hua
zhihua95
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 hour ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
upvoted
a
paper
over 1 year ago
Towards Achieving Human Parity on End-to-end Simultaneous Speech
Translation via LLM Agent
Organizations
None yet