random
fakerbaby
AI & ML interests
NLP, RL, VLM
Recent Activity
upvoted
an
article
25 days ago
We Got Claude to Fine-Tune an Open Source LLM
upvoted
a
paper
about 1 month ago
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch
upvoted
a
paper
2 months ago
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning
for LLMs