Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Henry Hengyuan Zhao's picture
13 23 6

Henry Hengyuan Zhao

hhenryz
21world's profile picture Concyclics's profile picture
·
https://zhaohengyuan1.github.io/
  • ZHHHYuan
  • zhaohengyuan1

AI & ML interests

Multimodal Reasoning, Human-AI Interaction, GUI Automation

Recent Activity

upvoted a collection 4 days ago
NVILA
upvoted a paper 15 days ago
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
updated a dataset about 1 month ago
hhenryz/WorldGUI-Bench
View all activity

Organizations

National University of Singapore's profile picture Efficient-Large-Model's profile picture

authored a paper 5 months ago

InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback

Paper • 2502.15027 • Published Feb 20 • 7
authored a paper 6 months ago

WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation

Paper • 2502.08047 • Published Feb 12 • 28
authored 2 papers 10 months ago

Genixer: Empowering Multimodal Large Language Models as a Powerful Data Generator

Paper • 2312.06731 • Published Dec 11, 2023 • 1

LOVA3: Learning to Visual Question Answering, Asking and Assessment

Paper • 2405.14974 • Published May 23, 2024 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs