zhaoyuzhong's picture

3 4 1

zhaoyuzhong

callsys

·

AI & ML interests

computer vision

Recent Activity

published a model 3 days ago

callsys/GMPO-7B

upvoted a paper 4 days ago

Geometric-Mean Policy Optimization

upvoted a paper 8 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

View all activity

Organizations

None yet

published a model 3 days ago

callsys/GMPO-7B

Updated 3 days ago

upvoted a paper 4 days ago

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published 5 days ago • 26

upvoted 2 papers 8 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Paper • 2411.19108 • Published Nov 28, 2024 • 19

New activity in microsoft/kosmos-2.5 11 months ago

change image

#9 opened 11 months ago by

liked a model 11 months ago

microsoft/kosmos-2.5-chat

1B • Updated Aug 28, 2024 • 8 • 11

New activity in microsoft/kosmos-2.5-chat 11 months ago

checkpoint

#1 opened 11 months ago by

upvoted a paper about 1 year ago

DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution

Paper • 2405.16071 • Published May 25, 2024 • 2