Arctic-TILT
Arctic-TILT is a state-of-the-art, sub-billion parameter model for Document Understanding. It is designed for high efficiency and long-context processing, achieving performance on par with models 1000x its size on a single 24GB GPU.
- GitHub Repository (Code & Examples): Snowflake-Labs/arctic-tilt
- Paper: Arctic-TILT: Business Document Understanding at Sub-Billion Scale.
Model Details
- Model type: Encoder-Decoder Transformer
- Parameters: 822M
- Core Task: Document AI (Question Answering, Key Information Extraction)
How to Use
This model is optimized for use with vLLM
and requires a custom scheduler and preprocessor. The TiltPreprocessor
is available in the GitHub repository.
- Downloads last month
- 19
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
馃檵
Ask for provider support