Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhaoyuzhong's picture
3 4 1

zhaoyuzhong

callsys
harvardcly's profile picture
·

AI & ML interests

computer vision

Recent Activity

published a model 3 days ago
callsys/GMPO-7B
upvoted a paper 4 days ago
Geometric-Mean Policy Optimization
upvoted a paper 8 months ago
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
View all activity

Organizations

None yet

published a model 3 days ago

callsys/GMPO-7B

Updated 3 days ago
upvoted a paper 4 days ago

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published 5 days ago • 26
upvoted 2 papers 8 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Paper • 2411.19108 • Published Nov 28, 2024 • 19
New activity in microsoft/kosmos-2.5 11 months ago

change image

#9 opened 11 months ago by
callsys
liked a model 11 months ago

microsoft/kosmos-2.5-chat

1B • Updated Aug 28, 2024 • 8 • 11
New activity in microsoft/kosmos-2.5-chat 11 months ago

checkpoint

#1 opened 11 months ago by
callsys
upvoted a paper about 1 year ago

DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution

Paper • 2405.16071 • Published May 25, 2024 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs