2 9 8

Jiashuo Yu

awojustin

AI & ML interests

Audio-Visual Learning, Music AI, AIGC

Recent Activity

liked a dataset about 1 month ago

OpenGVLab/VRBench

authored a paper about 1 month ago

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

authored a paper about 1 month ago

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

View all activity

Organizations

liked a dataset about 1 month ago

OpenGVLab/VRBench

Updated Jun 12 • 11 • 2

authored 4 papers about 1 month ago

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 35

upvoted a paper about 2 months ago

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Paper • 2506.10857 • Published Jun 12 • 31

commented a paper about 2 months ago

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Paper • 2506.10857 • Published Jun 12 • 31 •

published a dataset about 2 months ago

OpenGVLab/VRBench

Updated Jun 12 • 11 • 2

upvoted a paper 3 months ago

Long-Term Rhythmic Video Soundtracker

Paper • 2305.01319 • Published May 2, 2023 • 1

published a dataset 4 months ago

OpenGVLab/LongVid

Preview • Updated Mar 28 • 21 • 2

updated a dataset 4 months ago

OpenGVLab/LongVid

Preview • Updated Mar 28 • 21 • 2

updated a model 8 months ago

OpenGVLab/InternVideo2-Stage2-6B-Audio

Updated Nov 27, 2024 • 2

upvoted a paper 8 months ago

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 35

authored 7 papers 12 months ago

InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language

Paper • 2305.05662 • Published May 9, 2023 • 4

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Paper • 2212.03191 • Published Dec 6, 2022

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Paper • 2310.20700 • Published Oct 31, 2023 • 10

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 31

Learning Music-Dance Representations through Explicit-Implicit Rhythm Synchronization

Paper • 2207.03190 • Published Jul 7, 2022

Modality-Aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection

Paper • 2207.05500 • Published Jul 12, 2022

MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing

Paper • 2111.12374 • Published Nov 24, 2021

Jiashuo Yu

AI & ML interests

Recent Activity

Organizations

awojustin's activity