vlad-m-dev
/

mobilenet_v3_small_onnx_photo_doc

Model card Files Files and versions

vlad-m-dev commited on Jun 15

Commit

be9ec6f

·

verified ·

1 Parent(s): 342e4aa

Update README.md

Files changed (1) hide show

README.md +102 -3

README.md CHANGED Viewed

@@ -1,3 +1,102 @@
----
-license: mit
----

+---
+license: mit
+datasets:
+  - alfredplpl/Japanese-photos
+  - 3sara/colpali_italian_documents
+pipeline_tag: image-classification
+tags:
+  - image-classification
+  - mobile
+  - tablet
+  - quantization
+  - onnx
+  - mobilenetv3
+  - mobilenet_v3
+  - mobilenetv3_onnx
+  - document-classification
+  - photo-classification
+  - real-time
+  - lightweight
+  - efficient
+  - document
+  - photo
+  - images
+  - q8
+  - int8
+  - edge-ai
+  - ai-on-device
+  - offline
+  - privacy
+  - fast
+  - android
+  - ios
+  - gallery
+---
+# MobileNetV3 — ONNX, Quantized
+### 🔥 Lightweight mobile model for **image classification** into two categories:
+- **`document`** (scans, receipts, papers, invoices)
+- **`photo`** (regular phone photos: scenes, people, nature, etc.)
+---
+## 🟢 Overview
+- **Designed for mobile devices** (phones and tablets, Android/iOS), perfect for real-time on-device inference!
+- Architecture: **MobileNetV2**
+- Format: **ONNX** (both float32 and quantized int8 versions included)
+- Trained on balanced, real-world open-source datasets for both documents and photos.
+- Ideal for tasks like:
+  - Document detection in gallery/camera rolls
+  - Screenshot, receipt, photo, and PDF preview classification
+  - Image sorting for privacy-first offline AI assistants
+---
+## 🏷️ Model Classes
+- **0** — `document`
+- **1** — `photo`
+---
+## ⚡️ Versions
+- `mobilenet_v3_small.onnx` — Standard float32 for maximum accuracy (best for ARM/CPU)
+- `mobilenet_v3_small_quant.onnx` — Quantized int8 for even faster inference and smaller file size (best for low-power or edge devices)
+---
+## 🚀 Why this model?
+- **Ultra-small size** (~10-15MB), real-time inference (<100ms) on most phones
+- **Runs 100% offline** (privacy, no cloud required)
+- **Easy integration** with any framework, including React Native (`onnxruntime-react-native`), Android (ONNX Runtime), and iOS.
+---
+## 🗃️ Datasets
+- **Photos:** [alfredplpl/Japanese-photos](https://huggingface.co/datasets/alfredplpl/Japanese-photos)
+- **Documents:** [3sara/colpali_italian_documents](https://huggingface.co/datasets/3sara/colpali_italian_documents)
+---
+## 🤖 Author
+@vlad-m-dev
+Built for edge-ai/phone/tablet offline image classification: document vs photo
+Telegram: https://t.me/dwight_schrute_engineer
+---
+## 🛠️ Usage Example
+```python
+import onnxruntime as ort
+import numpy as np
+session = ort.InferenceSession(MODEL_PATH)
+img = np.random.randn(1, 3, 224, 224).astype(np.float32)  # Replace with your image preprocessing!
+output = session.run(None, {"input": img})
+pred_class = np.argmax(output[0])
+print(pred_class)  # 0 = document, 1 = photo```