Yamamoto-Haruki's picture

20 7

Yamamoto-Haruki

YamamotoHaruki

·

AI & ML interests

None yet

Recent Activity

liked a dataset 12 days ago

bigai/TongSIM-Asset

upvoted a paper 23 days ago

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

upvoted a paper about 1 month ago

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

View all activity

Organizations

None yet

liked a dataset 12 days ago

bigai/TongSIM-Asset

Updated 10 days ago • 19.1k • 271

upvoted a paper 23 days ago

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Paper • 2512.13168 • Published 24 days ago • 49

upvoted a paper about 1 month ago

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Paper • 2511.23127 • Published Nov 28, 2025 • 43

upvoted a paper 7 months ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published May 28, 2025 • 48

upvoted 16 papers 10 months ago

MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge

Paper • 2502.19870 • Published Feb 27, 2025 • 9

Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications

Paper • 2502.20311 • Published Feb 27, 2025 • 6

Towards Optimal Multi-draft Speculative Decoding

Paper • 2502.18779 • Published Feb 26, 2025 • 5

FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users

Paper • 2502.19312 • Published Feb 26, 2025 • 7

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published Feb 24, 2025 • 6

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published Feb 26, 2025 • 10

BIG-Bench Extra Hard

Paper • 2502.19187 • Published Feb 26, 2025 • 10

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Paper • 2502.19204 • Published Feb 26, 2025 • 11

VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model

Paper • 2502.18906 • Published Feb 26, 2025 • 12

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25, 2025 • 28

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Paper • 2502.19413 • Published Feb 26, 2025 • 21

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 51

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26, 2025 • 23

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Paper • 2502.19414 • Published Feb 26, 2025 • 20

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26, 2025 • 28

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26, 2025 • 65