view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 4 days ago • 42
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 10 days ago • 16
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 29 days ago • 50
Robust Speech Recognition via Large-Scale Weak Supervision Paper • 2212.04356 • Published Dec 6, 2022 • 52
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output Feb 7 • 22
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model +6 Oct 29, 2024 • 60
Quantized translategemma Collection Quickly tested with vLLM. Not fully compatible yet. • 7 items • Updated Jan 19 • 3
PII & De-Identification Collection Models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 278 items • Updated 3 days ago • 33
SAM Audio Collection The SAM Audio model licenses allow for redistribution so long as the original license files are included • 9 items • Updated Dec 25, 2025 • 4
ViDoRe Benchmark V3 Collection ViDoRe V3 is our latest benchmark, engineered to set a new industry gold standard for multi-modal, enterprise document retrieval evaluation. • 8 items • Updated Jan 14 • 20