Jonatan Borkowski's picture

12 20

Jonatan Borkowski PRO

j14i

·

jborkowski

AI & ML interests

None yet

Recent Activity

reacted to sergiopaniego's post with ❤️ about 7 hours ago

This super detailed tutorial by @Paulescu is pure gold 🪙 "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv" LFM2-350M (@LiquidAI) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control 🤝 https://paulabartabajo.substack.com/p/fine-tuning-lfm2-350m-for-browser

liked a Space 2 days ago

hysts/daily-papers

reacted to sergiopaniego's post with 🚀 2 days ago

Google DeepMind releases FunctionGemma, a 240M model specialized in 🔧 tool calling, built for fine-tuning TRL has day-0 support. To celebrate, we’re sharing 2 new resources: > Colab guide to fine-tune it for 🌐 browser control with BrowserGym OpenEnv > Standalone training script > Colab notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_functiongemma_browsergym_openenv.ipynb > Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/browsergym_llm.py (command to run it inside the script) > More notebooks in TRL: https://huggingface.co/docs/trl/example_overview#notebooks

View all activity

Organizations

j14i 's datasets

None public yet