Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jin Zhu's picture
2 2 30

Jin Zhu

mamba413
callmespring's profile picture Eehan's profile picture Kyleyee's profile picture
·
https://mamba413.github.io/
  • Mamba413

AI & ML interests

reinforcement learning

Recent Activity

liked a dataset 1 day ago
fancyzhx/ag_news
authored a paper 22 days ago
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
upvoted a paper 22 days ago
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
View all activity

Organizations

Stats-powered AI's profile picture

authored a paper 22 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Paper • 2504.03784 • Published Apr 3 • 2
authored 2 papers 3 months ago

An Instrumental Variable Approach to Confounded Off-Policy Evaluation

Paper • 2212.14468 • Published Dec 29, 2022

AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees

Paper • 2510.01268 • Published Sep 29 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs