Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Paper • 2512.13607 • Published 10 days ago • 26
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity Paper • 2503.07677 • Published Mar 10 • 86
Efficient Generative Modeling with Residual Vector Quantization-Based Tokens Paper • 2412.10208 • Published Dec 13, 2024 • 19