Mingyu Jin's picture

9

Mingyu Jin

JimmyNLP

AI & ML interests

None yet

Organizations

None yet

upvoted 9 papers 3 months ago

R-WoM: Retrieval-augmented World Model For Computer-use Agents

Paper • 2510.11892 • Published Oct 13, 2025 • 21

MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

Paper • 2510.03632 • Published Oct 4, 2025 • 41

Dynamic Speculative Agent Planning

Paper • 2509.01920 • Published Sep 2, 2025 • 6

Who's Your Judge? On the Detectability of LLM-Generated Judgments

Paper • 2509.25154 • Published Sep 29, 2025 • 29

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 140

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 118

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27, 2025 • 62

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 134