Train a Unified Multimodal Data Quality Classifier with Synthetic Data Paper • 2510.15162 • Published 7 days ago • 2
UniFilter Collection A Unified Multimodal Data Quality Classifier for generating quality scores for both image-text caption data and interleaved document data • 4 items • Updated 2 days ago • 1
MiroThinker-v0.1 Collection High performance in deep research and tool use. • 7 items • Updated Sep 8 • 32
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources Paper • 2504.00595 • Published Apr 1 • 36
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper • 2412.10302 • Published Dec 13, 2024 • 18
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21, 2024 • 31
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 100
MLM-Filter Model and Data Collection The collections of proposed MLM-Filter models based on different LLM backbones. • 7 items • Updated Apr 14 • 1
Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters Paper • 2403.02677 • Published Mar 5, 2024 • 18