arxiv:2510.01268
Jin Zhu
mamba413
AI & ML interests
reinforcement learning
Recent Activity
authored
a paper
20 days ago
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
upvoted
a
paper
20 days ago
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
liked
a dataset
about 2 months ago
bookcorpus/bookcorpus