Jin Zhu's picture

2 2 30

Jin Zhu

mamba413

·

https://mamba413.github.io/

Mamba413

AI & ML interests

reinforcement learning

Recent Activity

liked a dataset 1 day ago

fancyzhx/ag_news

authored a paper 22 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

upvoted a paper 22 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

View all activity

Organizations

authored a paper 22 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Paper • 2504.03784 • Published Apr 3 • 2

authored 2 papers 3 months ago

An Instrumental Variable Approach to Confounded Off-Policy Evaluation

Paper • 2212.14468 • Published Dec 29, 2022

AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees

Paper • 2510.01268 • Published Sep 29 • 2