Running Featured 1.31k FineWeb: decanting the web for the finest text data at scale 🍷 1.31k Generate a curated web‑text dataset for LLM training
Running Featured 65 Tokenizer Arena 📈 65 Compare and visualize tokenization of your text across models
nvidia/Aegis-AI-Content-Safety-LlamaGuard-Permissive-1.0 Text Classification • Updated Sep 22, 2025 • 1.1k • 18
nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0 Text Classification • Updated Sep 22, 2025 • 7.42k • 28
Running on CPU Upgrade Featured 1.01k Model Memory Utility 🚀 1.01k Calculate VRAM needed to train and run Hugging Face models