ds4sd
/

SmolDocling-256M-preview

Image-Text-to-Text

Model card Files Files and versions

asnassar commited on 17 days ago

Commit

607718f

·

verified ·

1 Parent(s): 83c9645

Update README.md

Files changed (1) hide show

README.md +27 -22

README.md CHANGED Viewed

@@ -1,26 +1,31 @@
----
-base_model:
-- HuggingFaceTB/SmolVLM-256M-Instruct
-language:
-- en
-library_name: transformers
-license: cdla-permissive-2.0
-pipeline_tag: image-text-to-text
-datasets:
-- ds4sd/SynthCodeNet
-- ds4sd/SynthFormulaNet
-- ds4sd/SynthChartNet
-- HuggingFaceM4/DoclingMatix
----
-<div style="display: flex; align-items: center;">
-    <img src="https://huggingface.co/ds4sd/SmolDocling-256M-preview/resolve/main/assets/SmolDocling_doctags1.png" alt="SmolDocling" style="width: 200px; height: auto; margin-right: 20px;">
-    <div>
-        <h3>SmolDocling-256M-preview</h3>
-        <p>SmolDocling is a multimodal Image-Text-to-Text model designed for efficient document conversion. It retains Docling's most popular features while ensuring full compatibility with Docling through seamless support for <strong>DoclingDocuments</strong>.</p>
-    </div>
 </div>
 This model was presented in the paper [SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion](https://huggingface.co/papers/2503.11576).
 ### 🚀 Features:

+---
+base_model:
+- HuggingFaceTB/SmolVLM-256M-Instruct
+language:
+- en
+library_name: transformers
+license: cdla-permissive-2.0
+pipeline_tag: image-text-to-text
+datasets:
+- ds4sd/SynthCodeNet
+- ds4sd/SynthFormulaNet
+- ds4sd/SynthChartNet
+- HuggingFaceM4/DoclingMatix
+---
+<div style="
+  background-color: #f0f9ff;
+  border: 1px solid #bae6fd;
+  color: #0369a1;
+  padding: 12px 16px;
+  border-radius: 12px;
+  margin-bottom: 16px;
+  font-family: sans-serif;
+">
+  <strong>📢 New Release:</strong>
+  We’ve released <a href="https://huggingface.co/ibm-granite/granite-docling-258M" target="_blank" style="color:#0284c7; font-weight:bold; text-decoration:underline;">
+    granite-docling-258M</a>, the successor to <b>SmolDocling</b>. It will now receive updates and support, check it out!
 </div>
 This model was presented in the paper [SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion](https://huggingface.co/papers/2503.11576).
 ### 🚀 Features: