Update README.md
Browse files
README.md
CHANGED
|
@@ -15,13 +15,14 @@ tags:
|
|
| 15 |
|
| 16 |
# Chandra OCR 2
|
| 17 |
|
| 18 |
-
Chandra 2 is
|
| 19 |
|
| 20 |
Try Chandra in the [free playground](https://www.datalab.to/playground), or use the [hosted API](https://www.datalab.to/) for higher accuracy and speed.
|
| 21 |
|
| 22 |
## What's New in Chandra 2
|
| 23 |
|
| 24 |
-
-
|
|
|
|
| 25 |
- Improved layout, especially on wider documents
|
| 26 |
- Significantly better image captioning
|
| 27 |
- 90+ language support with major accuracy gains
|
|
|
|
| 15 |
|
| 16 |
# Chandra OCR 2
|
| 17 |
|
| 18 |
+
Chandra 2 is a state of the art OCR model from [Datalab](https://www.datalab.to) that outputs markdown, HTML, and JSON. It is highly accurate at extracting text from images and PDFs, while preserving layout information.
|
| 19 |
|
| 20 |
Try Chandra in the [free playground](https://www.datalab.to/playground), or use the [hosted API](https://www.datalab.to/) for higher accuracy and speed.
|
| 21 |
|
| 22 |
## What's New in Chandra 2
|
| 23 |
|
| 24 |
+
- 85.9% olmocr bench score (sota), 77.8% multilingual bench score (12% improvement over Chandra 1)
|
| 25 |
+
- Significant improvements to math, tables, complex layouts
|
| 26 |
- Improved layout, especially on wider documents
|
| 27 |
- Significantly better image captioning
|
| 28 |
- 90+ language support with major accuracy gains
|