Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenGVLab 's Collections
Docopilot
ZeroGUI
InternVL3
VisualPRM
Mono-InternVL
VideoChat-R1
PIIP
InternVideo2.5
VideoMAE-v2
VideoChat-Flash
InternVL2.5
InternVL2.5-MPO
InternVL2.0
InternVL1.5
InternVL1.0
V2PE
InternVL Adaptation
InternVideo2
VideoChat
VideoMamba
InternVid
OmniCorpus
All-Seeing Project
InternImage
PVT v2
InternVL Data

Docopilot

updated 8 days ago

[CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding

Upvote
1

  • OpenGVLab/Docopilot-2B

    Image-Text-to-Text • 2B • Updated 7 days ago • 27 • 7

  • OpenGVLab/Docopilot-8B

    Image-Text-to-Text • 8B • Updated 7 days ago • 20 • 3

  • OpenGVLab/Doc-750K

    Preview • Updated 5 days ago • 3.57k • 9
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs