-
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
Paper • 2402.03216 • Published • 6 -
intfloat/multilingual-e5-large-instruct
Feature Extraction • 0.6B • Updated • 2.18M • • 539 -
BAAI/bge-m3
Sentence Similarity • Updated • 4.5M • • 2.24k -
BAAI/bge-multilingual-gemma2
Feature Extraction • 9B • Updated • 348k • • 189
Collections
Discover the best community collections!
Collections including paper arxiv:2402.03216
-
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper • 2401.02823 • Published • 37 -
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 66 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Paper • 2309.01131 • Published • 1
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 37 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 13 -
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 13 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 14
-
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 49 -
Qwen Technical Report
Paper • 2309.16609 • Published • 36 -
GPT-4 Technical Report
Paper • 2303.08774 • Published • 6 -
Gemini: A Family of Highly Capable Multimodal Models
Paper • 2312.11805 • Published • 46
-
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
Paper • 2402.03216 • Published • 6 -
intfloat/multilingual-e5-large-instruct
Feature Extraction • 0.6B • Updated • 2.18M • • 539 -
BAAI/bge-m3
Sentence Similarity • Updated • 4.5M • • 2.24k -
BAAI/bge-multilingual-gemma2
Feature Extraction • 9B • Updated • 348k • • 189
-
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper • 2401.02823 • Published • 37 -
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 66 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Paper • 2309.01131 • Published • 1
-
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 49 -
Qwen Technical Report
Paper • 2309.16609 • Published • 36 -
GPT-4 Technical Report
Paper • 2303.08774 • Published • 6 -
Gemini: A Family of Highly Capable Multimodal Models
Paper • 2312.11805 • Published • 46
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 37 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 13 -
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 13 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 14