Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrey's picture
In a Training Loop πŸ”„
17 3

Andrey

Bochkov
stas-isaev's profile picture trand1k's profile picture musicakamusic's profile picture
Β·
  • E6E831728
  • AVBochkov
  • andreybochkov

AI & ML interests

None yet

Recent Activity

reacted to sergiopaniego's post with πŸ”₯ 14 days ago
New REPL environment in OpenEnv available! ✨ Used in the Recursive Language Models (RLM) paper by Alex Zhang. Ready for inference & post-training using trajectories. Handles long contexts: > Run Python code in a sandbox > Make recursive calls to LMs > Explore data programmatically > Return final result Docs: https://meta-pytorch.org/OpenEnv/environments/repl/ Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py
upvoted a paper 16 days ago
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
updated a model 18 days ago
Bochkov/growing-transformers-model-frozen-16-bit-baseline-monolyth-181m
View all activity

Organizations

None yet

upvoted a paper 16 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper β€’ 2508.14444 β€’ Published Aug 20, 2025 β€’ 40
upvoted an article 20 days ago
view article
Article

TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell

22 days ago
β€’
11
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs