Yi Wang's picture

Yi Wang

shepnerd

·

https://shepnerd.github.io/

Shepnerd

AI & ML interests

visual understanding & generation

Organizations

upvoted a paper 3 months ago

VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception

Paper • 2509.21100 • Published Sep 25, 2025 • 1

upvoted 4 papers 4 months ago

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Paper • 2501.00574 • Published Dec 31, 2024 • 6

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Paper • 2504.06958 • Published Apr 9, 2025 • 13

ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

Paper • 2510.11606 • Published Oct 13, 2025 • 6

Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale

Paper • 2509.24910 • Published Sep 29, 2025 • 4

upvoted a paper 8 months ago

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Paper • 2506.10857 • Published Jun 12, 2025 • 30

upvoted a paper 10 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

upvoted a collection 12 months ago

InternVL2.5

Better than InternVL 2.0 • 19 items • Updated Sep 28, 2025 • 93

upvoted a paper about 1 year ago

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published Dec 26, 2024 • 18

upvoted a collection almost 2 years ago

InternVideo2

InternVideo2 • 21 items • Updated Sep 28, 2025 • 24

upvoted a paper almost 2 years ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22, 2024 • 28

upvoted 2 papers over 2 years ago

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Paper • 2307.06942 • Published Jul 13, 2023 • 23

JourneyDB: A Benchmark for Generative Image Understanding

Paper • 2307.00716 • Published Jul 3, 2023 • 19