Fu Jinlan's picture

6 10

Fu Jinlan

Jinlan

·

AI & ML interests

None yet

Recent Activity

authored a paper about 23 hours ago

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

upvoted a paper 1 day ago

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

submitted a paper 1 day ago

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

View all activity

Organizations

authored a paper about 23 hours ago

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published 2 days ago • 28

upvoted a paper 1 day ago

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published 2 days ago • 28

submitted a paper to Daily Papers 1 day ago

FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs

Paper • 2601.13836 • Published 2 days ago • 28

authored 14 papers 2 days ago

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

Paper • 2402.11251 • Published Feb 17, 2024 • 1

Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge

Paper • 2402.14310 • Published Feb 22, 2024

CorefDiffs: Co-referential and Differential Knowledge Flow in Document Grounded Conversations

Paper • 2210.02223 • Published Oct 5, 2022

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

Paper • 2406.12030 • Published Jun 17, 2024

Cross-Modality Safety Alignment

Paper • 2406.15279 • Published Jun 21, 2024 • 5

FlipAttack: Jailbreak LLMs via Flipping

Paper • 2410.02832 • Published Oct 2, 2024 • 1

Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices

Paper • 2503.06063 • Published Mar 8, 2025

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Paper • 2504.13122 • Published Apr 17, 2025 • 20

LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding

Paper • 2505.16983 • Published May 22, 2025 • 1

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 45

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 54

MCM-DPO: Multifaceted Cross-Modal Direct Preference Optimization for Alt-text Generation

Paper • 2510.00647 • Published Oct 1, 2025

AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Paper • 2512.23343 • Published 24 days ago • 27

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

Paper • 2104.07412 • Published Apr 15, 2021

upvoted a paper 15 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 18 days ago • 53

upvoted a paper 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 185

upvoted a collection 3 months ago

RoboOmni

Proactive Robot Manipulation in Omni-modal Context • 8 items • Updated Oct 29, 2025 • 4