MLM versus CLM for NLP tasks Collection Related paper: "Should We Still Pretrain Encoders with Masked Language Modeling?" • 1 item • Updated Sep 11
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11 • 1
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11 • 1
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78
Abstention Reranking Collection Related paper: "Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism" (accepted at TMLR 2024) • 3 items • Updated Apr 10
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11 • 1
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19 • 41
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11 • 1