Image-Text-to-Text
Transformers
ONNX
Safetensors
English
idefics3
image-to-text
conversational
asnassar commited on
Commit
607718f
·
verified ·
1 Parent(s): 83c9645

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -22
README.md CHANGED
@@ -1,26 +1,31 @@
1
- ---
2
- base_model:
3
- - HuggingFaceTB/SmolVLM-256M-Instruct
4
- language:
5
- - en
6
- library_name: transformers
7
- license: cdla-permissive-2.0
8
- pipeline_tag: image-text-to-text
9
- datasets:
10
- - ds4sd/SynthCodeNet
11
- - ds4sd/SynthFormulaNet
12
- - ds4sd/SynthChartNet
13
- - HuggingFaceM4/DoclingMatix
14
- ---
15
-
16
- <div style="display: flex; align-items: center;">
17
- <img src="https://huggingface.co/ds4sd/SmolDocling-256M-preview/resolve/main/assets/SmolDocling_doctags1.png" alt="SmolDocling" style="width: 200px; height: auto; margin-right: 20px;">
18
- <div>
19
- <h3>SmolDocling-256M-preview</h3>
20
- <p>SmolDocling is a multimodal Image-Text-to-Text model designed for efficient document conversion. It retains Docling's most popular features while ensuring full compatibility with Docling through seamless support for <strong>DoclingDocuments</strong>.</p>
21
- </div>
 
 
 
 
 
 
22
  </div>
23
-
24
  This model was presented in the paper [SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion](https://huggingface.co/papers/2503.11576).
25
 
26
  ### 🚀 Features:
 
1
+ ---
2
+ base_model:
3
+ - HuggingFaceTB/SmolVLM-256M-Instruct
4
+ language:
5
+ - en
6
+ library_name: transformers
7
+ license: cdla-permissive-2.0
8
+ pipeline_tag: image-text-to-text
9
+ datasets:
10
+ - ds4sd/SynthCodeNet
11
+ - ds4sd/SynthFormulaNet
12
+ - ds4sd/SynthChartNet
13
+ - HuggingFaceM4/DoclingMatix
14
+ ---
15
+
16
+ <div style="
17
+ background-color: #f0f9ff;
18
+ border: 1px solid #bae6fd;
19
+ color: #0369a1;
20
+ padding: 12px 16px;
21
+ border-radius: 12px;
22
+ margin-bottom: 16px;
23
+ font-family: sans-serif;
24
+ ">
25
+ <strong>📢 New Release:</strong>
26
+ We’ve released <a href="https://huggingface.co/ibm-granite/granite-docling-258M" target="_blank" style="color:#0284c7; font-weight:bold; text-decoration:underline;">
27
+ granite-docling-258M</a>, the successor to <b>SmolDocling</b>. It will now receive updates and support, check it out!
28
  </div>
 
29
  This model was presented in the paper [SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion](https://huggingface.co/papers/2503.11576).
30
 
31
  ### 🚀 Features: