Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dolphinlee 's Collections
llm
Text-to-Image
audio
VLM

VLM

updated 28 days ago
Upvote
-

  • Scalable Pre-training of Large Autoregressive Image Models

    Paper • 2401.08541 • Published Jan 16, 2024 • 39

  • MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets

    Paper • 2403.03194 • Published Mar 5, 2024 • 15

  • MiCo: Multi-image Contrast for Reinforcement Visual Reasoning

    Paper • 2506.22434 • Published Jun 27 • 10
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs