Official GitHub Repository: meikiocr

This model is a core component of the meikiocr pipeline. For the full implementation, command-line script, and documentation, please see the official GitHub repository.

meiki.text.detect.v0.1

meiki.text.detect.v0.1 is an update to meiki.text.detect.v0 (see below):

meiki.text.detect.v0.1 is a new state-of-the-art, open weight text detection model for video games beating text detection models like PaddleOCR
while it is still based on D-FINE detector, it uses mobilenet v4 small as backbone instead of hgnet v2
v0.1 comes in 2 variants: v0.1.960x544 and v0.1.320x192. unlike v0 both v0.1 variants share the same architecture, but are trained on different resolutions
v0.1 models increase focus on video game text detection and are limited to 64 detected boxes, increasing efficency for this use case (making them less suitable for manga text detection out of the box)
v0.1.960x544 and v0.1.320x192 have better accuracy and lower latency than small.v0 and tiny.v0 respectively

cpu	gpu

meiki.text.detect.v0

experimental text detection models with focus on low latency. trained on japanese video games and manga.

model versions:

tiny: good for images with only few textlines (e.g. visual novels). ~30ms latency on CPU. ~3ms on GPU.
small: better for cases with many textlines (e.g. manga). ~70ms latency on CPU. ~7ms on GPU.

fine-tune of https://github.com/Peterande/D-FINE

examples

visual novel

small	tiny

manga

small	tiny

Downloads last month: 4,141

rtr46
/

meiki.text.detect.v0

meiki.text.detect.v0.1

meiki.text.detect.v0

examples

visual novel

manga

Spaces using rtr46/meiki.text.detect.v0 2