GLiClass ONNX Collection GLiClass models converted to ONNX format, as well as 8bit quantization • 5 items • Updated 1 day ago • 2
GLiCLass-V3 Collection Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy. • 7 items • Updated 9 days ago • 13
GLiNER-X Collection The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated Jun 24 • 19
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 150
view article Article Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM By INSAIT-Institute and 2 others • Apr 23 • 55
GLiNER-biomed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition Paper • 2504.00676 • Published Apr 1 • 4
GLiNER-BioMed Collection Collection of high-quality GLiNER models tuned for working with biomedical data • 7 items • Updated Apr 2 • 6
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • Mar 10 • 146
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • Jan 31 • 50
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen • Jan 15 • 199
GLiNER Collection Knowledgator GLiNER models for information extraction • 8 items • Updated Dec 9, 2024 • 12
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 10 days ago • 368
GLiNER bi-encoders Collection Bi-encoder and poly-encoder architectures of GLiNER • 5 items • Updated Sep 10, 2024 • 13
GLiClass Collection Generalist and Light-weighted Models for Zero-shot Text Classification • 13 items • Updated Sep 17, 2024 • 14
Building Efficient Universal Classifiers with Natural Language Inference Paper • 2312.17543 • Published Dec 29, 2023 • 2
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks Paper • 2406.12925 • Published Jun 14, 2024 • 26