Dongfang Li

crazyofapple

AI & ML interests

None yet

Recent Activity

liked a model 25 days ago

deepseek-ai/DeepSeek-V3.2

upvoted a paper 4 months ago

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

upvoted a paper 4 months ago

Stabilizing Long-term Multi-turn Reinforcement Learning with Gated Rewards

View all activity

Organizations

liked a model 25 days ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated 25 days ago • 98.6k • • 1.02k

upvoted 2 papers 4 months ago

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

Paper • 2501.01028 • Published Jan 2 • 19

Stabilizing Long-term Multi-turn Reinforcement Learning with Gated Rewards

Paper • 2508.10548 • Published Aug 14 • 1

liked a model 8 months ago

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • 8B • Updated Jan 29 • 33.5k • • 358

upvoted a paper 9 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 166

upvoted a paper 11 months ago

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published Jan 22 • 74

authored 10 papers 12 months ago

Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation

Paper • 2308.08090 • Published Aug 16, 2023

SelectIT: Selective Instruction Tuning for Large Language Models via Uncertainty-Aware Self-Reflection

Paper • 2402.16705 • Published Feb 26, 2024 • 2

Improving Attributed Text Generation of Large Language Models via Preference Learning

Paper • 2403.18381 • Published Mar 27, 2024

Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration

Paper • 2311.08152 • Published Nov 14, 2023

A Survey of Large Language Models Attribution

Paper • 2311.03731 • Published Nov 7, 2023

Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer

Paper • 2402.14488 • Published Feb 22, 2024

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Paper • 2412.07393 • Published Dec 10, 2024 • 2

upvoted 3 papers about 1 year ago

In-Context Learning State Vector with Inner and Momentum Optimization

Paper • 2404.11225 • Published Apr 17, 2024 • 1

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Paper • 2412.07393 • Published Dec 10, 2024 • 2

liked a dataset about 1 year ago

nvidia/Daring-Anteater

Viewer • Updated Jun 17, 2024 • 99.5k • 1.72k • 27

Dongfang Li

AI & ML interests

Recent Activity

Organizations

crazyofapple's activity