Arctic-TILT

Arctic-TILT is a state-of-the-art, sub-billion parameter model for Document Understanding. It is designed for high efficiency and long-context processing, achieving performance on par with models 1000x its size on a single 24GB GPU.

Model Details

  • Model type: Encoder-Decoder Transformer
  • Parameters: 822M
  • Core Task: Document AI (Question Answering, Key Information Extraction)

How to Use

This model is optimized for use with vLLM and requires a custom scheduler and preprocessor. The TiltPreprocessor is available in the GitHub repository.

Downloads last month
19
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support