ymh233's picture

ymh233

ymh233

·

AI & ML interests

None yet

Recent Activity

published a model 5 days ago

ymh233/swe_traj

liked a dataset 7 days ago

neulab/agent-data-collection

updated a model 10 days ago

ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-20251217_step_10000

View all activity

Organizations

upvoted a paper 24 days ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published 26 days ago • 149

upvoted a paper 3 months ago

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Paper • 2509.16198 • Published Sep 19 • 126

upvoted 2 papers 4 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published Aug 14 • 19

upvoted a paper 6 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 42

upvoted 2 papers 7 months ago

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 41

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

upvoted a paper 8 months ago

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published May 12 • 28

upvoted 2 papers 9 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24 • 31

upvoted a paper 10 months ago

Process-based Self-Rewarding Language Models

Paper • 2503.03746 • Published Mar 5 • 39

upvoted a paper 11 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 48