CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
Paper
•
2502.21074
•
Published
•
4
The official weight of GPT-2 trained with the CODI framework (https://arxiv.org/abs/2502.21074).
Base model
openai-community/gpt2