dou wenhan's picture

11 3 1

dou wenhan

douwh

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

new activity 6 days ago

OpenGVLab/Mono-InternVL-2B-Synthetic-Data:Improve dataset card: Add paper, project, code links, update task category & add sample usage

new activity 6 days ago

OpenGVLab/Mono-InternVL-2B-S1-3:Update model card with link to most recent paper and full citations

View all activity

Organizations

authored a paper 6 days ago

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

Paper • 2507.12566 • Published 11 days ago • 14

authored a paper 6 months ago

Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding

Paper • 2501.07783 • Published Jan 14 • 7

authored a paper 7 months ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published Dec 12, 2024 • 39