Official GitHub Repository: meikiocr
This model is a core component of the meikiocr pipeline. For the full implementation, command-line script, and documentation, please see the official GitHub repository.
meiki.text.detect.v0.1
meiki.text.detect.v0.1 is an update to meiki.text.detect.v0 (see below):
- meiki.text.detect.v0.1 is a new state-of-the-art, open weight text detection model for video games beating text detection models like PaddleOCR
- while it is still based on D-FINE detector, it uses mobilenet v4 small as backbone instead of hgnet v2
- v0.1 comes in 2 variants: v0.1.960x544 and v0.1.320x192. unlike v0 both v0.1 variants share the same architecture, but are trained on different resolutions
- v0.1 models increase focus on video game text detection and are limited to 64 detected boxes, increasing efficency for this use case (making them less suitable for manga text detection out of the box)
- v0.1.960x544 and v0.1.320x192 have better accuracy and lower latency than small.v0 and tiny.v0 respectively
meiki.text.detect.v0
experimental text detection models with focus on low latency. trained on japanese video games and manga.
model versions:
- tiny: good for images with only few textlines (e.g. visual novels). ~30ms latency on CPU. ~3ms on GPU.
- small: better for cases with many textlines (e.g. manga). ~70ms latency on CPU. ~7ms on GPU.
fine-tune of https://github.com/Peterande/D-FINE
examples
visual novel
manga
- Downloads last month
- 4,141





