@appvoid on Hugging Face: "Let's keep the momentum for small models. I just published dot. It's the first…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update 3 days ago

Post

2387

Let's keep the momentum for small models. I just published dot. It's the first pretrained causal model that is trained on math/symbols rather than english. The goal is to get an agnostic fewshot meta learner that learns from reality itself instead of language.

It's already decent at some tasks, with next version coming in a few weeks.

appvoid/dot

NJX-njx

2 days ago

So my understanding is that the data used to train this model from the beginning is not English corpus, nor is it text, so its tokenizer is also different from the traditional one. I'm curious about how this part is handled and how the model itself understands things. Is it the same as the traditional one, which is also a one-dimensional token sequence?

appvoid

2 days ago

Correct! It's causal modeling (for now) with a char level tokenizer with only 8 tokens.

The model learns by looking for relationships of sequences for a single token, so the only way it learns is literally nudging weights towards a generalized solution using pure sequences.

In short, it learns to learn.

adamtechguy

about 23 hours ago

Yes I am just diving into building my own AI models, I actually going to start learning to pretrain and fine tune model this this week. It a tone of thing to learn about AI.

actually thats a good idea pretraining the AI on a specific task I am just learning how to use contexts and system prompts prompt to guide the AI right now.

In this post