5 25 5

Ziniu Li

ziniuli

http://www.liziniu.org/

liziniu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

upvoted a paper 24 days ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

upvoted a paper 24 days ago

Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning

View all activity

Organizations

upvoted a paper about 9 hours ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published 2 days ago • 21

upvoted 2 papers 24 days ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Paper • 2310.10505 • Published Oct 16, 2023 • 3

Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning

Paper • 2512.06533 • Published 27 days ago • 6

upvoted 2 papers about 1 month ago

How Far Are We from Genuinely Useful Deep Research Agents?

Paper • 2512.01948 • Published Dec 1, 2025 • 54

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 279

upvoted a paper about 2 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 51

authored 10 papers about 2 months ago

Why Transformers Need Adam: A Hessian Perspective

Paper • 2402.16788 • Published Feb 26, 2024 • 2

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Paper • 2310.10505 • Published Oct 16, 2023 • 3

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

Bridging Formal Language with Chain-of-Thought Reasoning to Geometry Problem Solving

Paper • 2508.09099 • Published Aug 12, 2025

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Paper • 2505.04113 • Published May 7, 2025

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30, 2025 • 47

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 221

upvoted 2 papers 2 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 221

UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

Paper • 2510.20286 • Published Oct 23, 2025 • 23

Ziniu Li

AI & ML interests

Recent Activity

Organizations

ziniuli's activity