chaohuang-ai
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1053,28 +1053,29 @@ When merging entities:
|
|
1053 |
|
1054 |
## Multimodal Document Processing (MinerU Integration)
|
1055 |
|
1056 |
-
LightRAG now supports
|
1057 |
|
1058 |
**Key Features:**
|
1059 |
-
-
|
1060 |
-
-
|
1061 |
-
-
|
1062 |
-
-
|
1063 |
-
|
|
|
1064 |
**Quick Start:**
|
1065 |
1. Install dependencies:
|
1066 |
```bash
|
1067 |
pip install "magic-pdf[full]>=1.2.2" huggingface_hub
|
1068 |
```
|
1069 |
-
2. Download MinerU model weights (
|
1070 |
-
3.
|
1071 |
```python
|
1072 |
from lightrag.mineru_parser import MineruParser
|
1073 |
content_list, md_content = MineruParser.parse_pdf('path/to/document.pdf', 'output_dir')
|
1074 |
# or for any file type:
|
1075 |
content_list, md_content = MineruParser.parse_document('path/to/file', 'auto', 'output_dir')
|
1076 |
```
|
1077 |
-
4. Query multimodal content with LightRAG
|
1078 |
|
1079 |
## Token Usage Tracking
|
1080 |
|
|
|
1053 |
|
1054 |
## Multimodal Document Processing (MinerU Integration)
|
1055 |
|
1056 |
+
LightRAG now supports comprehensive multi-modal document processing through [MinerU](https://github.com/opendatalab/MinerU) integration, enabling advanced parsing and retrieval-augmented generation (RAG) capabilities. This powerful feature allows you to handle multi-modal documents seamlessly, extracting structured content—including text, images, tables, and formulas—from various document formats for integration into your RAG pipeline.
|
1057 |
|
1058 |
**Key Features:**
|
1059 |
+
- **Multimodal Document Handling**: Process complex documents containing mixed content types (text, images, tables, formulas)
|
1060 |
+
- **Comprehensive Format Support**: Parse PDFs, images, DOC/DOCX/PPT/PPTX, and additional file types
|
1061 |
+
- **Multi-Element Extraction**: Extract and index text, images, tables, formulas, and document structure
|
1062 |
+
- **Multimodal Retrieval**: Query and retrieve diverse content types (text, images, tables, formulas) within RAG workflows
|
1063 |
+
- **Seamless Integration**: Works smoothly with LightRAG core and RAG-Anything frameworks
|
1064 |
+
|
1065 |
**Quick Start:**
|
1066 |
1. Install dependencies:
|
1067 |
```bash
|
1068 |
pip install "magic-pdf[full]>=1.2.2" huggingface_hub
|
1069 |
```
|
1070 |
+
2. Download MinerU model weights (refer to [MinerU Integration Guide](docs/mineru_integration_en.md))
|
1071 |
+
3. Process multi-modal documents using the new MineruParser or RAG-Anything's process_document_complete:
|
1072 |
```python
|
1073 |
from lightrag.mineru_parser import MineruParser
|
1074 |
content_list, md_content = MineruParser.parse_pdf('path/to/document.pdf', 'output_dir')
|
1075 |
# or for any file type:
|
1076 |
content_list, md_content = MineruParser.parse_document('path/to/file', 'auto', 'output_dir')
|
1077 |
```
|
1078 |
+
4. Query multimodal content with LightRAG refer to [docs/mineru_integration_en.md](docs/mineru_integration_en.md).
|
1079 |
|
1080 |
## Token Usage Tracking
|
1081 |
|