SINQ: Sinkhorn-Normalized Quantization for Calibration-Free
Low-Precision LLM Weights
Paper
• 2509.22944
• Published
• 80
Robot Learning: A Tutorial
Paper
• 2510.12403
• Published
• 123
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity
MoE
Paper
• 2510.13344
• Published
• 63
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal
Generation and Understanding
Paper
• 2510.06308
• Published
• 55
Training-Free Group Relative Policy Optimization
Paper
• 2510.08191
• Published
• 45
Detect Anything via Next Point Prediction
Paper
• 2510.12798
• Published
• 50
RLP: Reinforcement as a Pretraining Objective
Paper
• 2510.01265
• Published
• 44
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
Paper
• 2510.06710
• Published
• 42
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language
Models
Paper
• 2510.11341
• Published
• 35
Paper
• 2510.13998
• Published
• 59
Agentic Entropy-Balanced Policy Optimization
Paper
• 2510.14545
• Published
• 106