KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper • 2501.01028 • Published Jan 2 • 19
Stabilizing Long-term Multi-turn Reinforcement Learning with Gated Rewards Paper • 2508.10548 • Published Aug 14 • 1
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 166
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published Jan 22 • 74
Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation Paper • 2308.08090 • Published Aug 16, 2023
SelectIT: Selective Instruction Tuning for Large Language Models via Uncertainty-Aware Self-Reflection Paper • 2402.16705 • Published Feb 26, 2024 • 2
Improving Attributed Text Generation of Large Language Models via Preference Learning Paper • 2403.18381 • Published Mar 27, 2024
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration Paper • 2311.08152 • Published Nov 14, 2023
Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer Paper • 2402.14488 • Published Feb 22, 2024
In-Context Learning State Vector with Inner and Momentum Optimization Paper • 2404.11225 • Published Apr 17, 2024 • 1
FunnelRAG: A Coarse-to-Fine Progressive Retrieval Paradigm for RAG Paper • 2410.10293 • Published Oct 14, 2024
MSDF: A General Open-Domain Multi-Skill Dialog Framework Paper • 2206.08626 • Published Jun 17, 2022 • 2
CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models Paper • 2412.07393 • Published Dec 10, 2024 • 2
In-Context Learning State Vector with Inner and Momentum Optimization Paper • 2404.11225 • Published Apr 17, 2024 • 1
CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models Paper • 2412.07393 • Published Dec 10, 2024 • 2