Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
Joya Chen PRO
chenjoya
AI & ML interests
Video LLM
Recent Activity
upvoted
a
paper
2 days ago
StreamingVLM: Real-Time Understanding for Infinite Video Streams
liked
a dataset
7 days ago
ZaynZhu/Paper2Video
liked
a dataset
7 days ago
Enxin/VideoNSA-data