Jin Zhu's picture

2 2 29

Jin Zhu

mamba413

·

https://mamba413.github.io/

Mamba413

AI & ML interests

reinforcement learning

Recent Activity

authored a paper 21 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

upvoted a paper 21 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

liked a dataset about 2 months ago

bookcorpus/bookcorpus

View all activity

Organizations

authored a paper 21 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Paper • 2504.03784 • Published Apr 3 • 2

upvoted a paper 21 days ago

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Paper • 2504.03784 • Published Apr 3 • 2

liked 2 datasets about 2 months ago

bookcorpus/bookcorpus

Updated May 3, 2024 • 6.39k • 336

Salesforce/wikitext

Viewer • Updated Jan 4, 2024 • 3.71M • 915k • 540

New activity in AyoubChLin/CNN_News_Articles_2011-2022 2 months ago

Request: DOI

#1 opened 2 months ago by

liked 4 datasets 2 months ago

euirim/goodwiki

Viewer • Updated Sep 11, 2023 • 44.8k • 150 • 53

toloka/beemo

Viewer • Updated Jan 28 • 2.19k • 374 • 18

microsoft/wiki_qa

Viewer • Updated Jan 4, 2024 • 29.3k • 4.69k • 66

legacy-datasets/wikipedia

Updated Mar 11, 2024 • 32.7k • 607

upvoted a paper 3 months ago

AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees

Paper • 2510.01268 • Published Sep 29 • 2

authored 2 papers 3 months ago

An Instrumental Variable Approach to Confounded Off-Policy Evaluation

Paper • 2212.14468 • Published Dec 29, 2022

AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees

Paper • 2510.01268 • Published Sep 29 • 2