Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dou wenhan's picture
11 3 1

dou wenhan

douwh
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models
new activity 6 days ago
OpenGVLab/Mono-InternVL-2B-Synthetic-Data:Improve dataset card: Add paper, project, code links, update task category & add sample usage
new activity 6 days ago
OpenGVLab/Mono-InternVL-2B-S1-3:Update model card with link to most recent paper and full citations
View all activity

Organizations

OpenGVLab's profile picture

authored a paper 6 days ago

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

Paper • 2507.12566 • Published 11 days ago • 14
authored a paper 6 months ago

Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding

Paper • 2501.07783 • Published Jan 14 • 7
authored a paper 7 months ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published Dec 12, 2024 • 39
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs