8 9 11

Xinhao Li

lixinhao

https://leexinhao.github.io/

AI & ML interests

None yet

Recent Activity

updated a model about 21 hours ago

OpenGVLab/InternVideo2-Stage2_6B-224p-f4

upvoted a paper 7 days ago

Pixels, Patterns, but No Poetry: To See The World like Humans

new activity 16 days ago

OpenGVLab/VideoChat-R1_7B:How many devices are needed for the GRPO tuning as your paper mentioned?

View all activity

Organizations

updated a model about 21 hours ago

OpenGVLab/InternVideo2-Stage2_6B-224p-f4

Updated about 21 hours ago • 6

upvoted a paper 7 days ago

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published 9 days ago • 62

New activity in OpenGVLab/VideoChat-R1_7B 16 days ago

How many devices are needed for the GRPO tuning as your paper mentioned?

#1 opened 16 days ago by

JasonLee996

updated a dataset about 1 month ago

OpenGVLab/VideoChat-Flash-Training-Data

Viewer • Updated Jun 24 • 87k • 126k • 11

updated a dataset 2 months ago

lixinhao/some_videos

Updated May 30 • 79

upvoted a paper 2 months ago

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Paper • 2505.23359 • Published May 29 • 40

published a dataset 2 months ago

lixinhao/some_videos

Updated May 30 • 79

authored 7 papers 2 months ago

Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method

Paper • 2501.00584 • Published Dec 31, 2024

Fine-grained Video-Text Retrieval: A New Benchmark and Method

Paper • 2501.00513 • Published Dec 31, 2024

VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model

Paper • 2407.06491 • Published Jul 9, 2024

updated a dataset 2 months ago

lixinhao/VideoEval

Viewer • Updated May 21 • 25.7k • 30

published a dataset 3 months ago

lixinhao/VideoEval

Viewer • Updated May 21 • 25.7k • 30

updated 3 models 3 months ago

OpenGVLab/VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B

Video-Text-to-Text • 9B • Updated May 16 • 81 • 5

OpenGVLab/VideoChat-Flash-Qwen2_5-7B-1M_res224

Video-Text-to-Text • 8B • Updated May 16 • 47 • 2

OpenGVLab/InternVL_2_5_HiCo_R64

Video-Text-to-Text • 8B • Updated May 13 • 149 • 3

updated a dataset 3 months ago