Zhenghao Xu

zhenghaoxu

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

upvoted a paper 6 days ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

liked a model about 2 months ago

inclusionAI/LLaDA2.0-flash

View all activity

Organizations

upvoted a paper 4 days ago

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

Paper • 2602.05933 • Published 5 days ago • 5

upvoted a paper 6 days ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published 9 days ago • 39

liked 2 models about 2 months ago

inclusionAI/LLaDA2.0-flash

Text Generation • 103B • Updated Dec 19, 2025 • 631 • 67

inclusionAI/LLaDA2.0-mini

Text Generation • 16B • Updated 1 day ago • 35.9k • 57

liked 2 models 3 months ago

inclusionAI/LLaDA2.0-mini-preview

Text Generation • 16B • Updated Dec 19, 2025 • 2.69k • 88

inclusionAI/LLaDA2.0-flash-preview

Text Generation • 103B • Updated Dec 19, 2025 • 27 • 68

liked a dataset 3 months ago

prometheus-eval/Preference-Collection

Viewer • Updated May 3, 2024 • 200k • 58 • 37

liked a dataset 4 months ago

nvidia/Aegis-AI-Content-Safety-Dataset-2.0

Viewer • Updated Jun 9, 2025 • 33.4k • 3.65k • 75

liked a model 4 months ago

Salesforce/FARE-8B

8B • Updated Oct 21, 2025 • 3 • 3

updated a dataset 4 months ago

zhenghaoxu/think-rm-rmr1-helpsteer3

Viewer • Updated Oct 21, 2025 • 111k • 6 • 1

published a dataset 4 months ago

zhenghaoxu/think-rm-rmr1-helpsteer3

Viewer • Updated Oct 21, 2025 • 111k • 6 • 1

liked a model 5 months ago

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6, 2025 • 531k • • 543

updated a dataset 5 months ago

zhenghaoxu/R2E-Gym-Lite-Truncate-Heuristic

Viewer • Updated Sep 26, 2025 • 7.49k • 117

published a dataset 5 months ago

zhenghaoxu/R2E-Gym-Lite-Truncate-Heuristic

Viewer • Updated Sep 26, 2025 • 7.49k • 117

updated a dataset 5 months ago

zhenghaoxu/R2E-Gym-Lite-Truncate-Heuristic-100

Viewer • Updated Sep 26, 2025 • 100 • 4

published a dataset 5 months ago

zhenghaoxu/R2E-Gym-Lite-Truncate-Heuristic-100

Viewer • Updated Sep 26, 2025 • 100 • 4

updated 2 datasets 5 months ago

zhenghaoxu/R2E-Gym-Lite-Truncate-7B

Viewer • Updated Sep 26, 2025 • 6.64k • 13

zhenghaoxu/R2E-Gym-Lite-Truncate-7B-Fixed

Viewer • Updated Sep 25, 2025 • 6.89k • 19

published 2 datasets 5 months ago