denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-6 Sentence Similarity • 0.0B • Updated Dec 19, 2024 • 17 • 2
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper • 2502.06394 • Published Feb 10 • 90
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-7 Sentence Similarity • 0.0B • Updated Dec 23, 2024 • 17
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-6 Sentence Similarity • 0.0B • Updated Dec 19, 2024 • 17 • 2
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-5 Sentence Similarity • 0.0B • Updated Dec 19, 2024 • 6
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-4 Sentence Similarity • 0.0B • Updated Dec 19, 2024 • 5
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-3 Sentence Similarity • 0.0B • Updated Dec 19, 2024 • 18
denis-gordeev/reranker_dialog_items_crossencoder_rubert-tiny-turbo Text Ranking • 0.0B • Updated Apr 9 • 218
Reasoning benchmarks Collection Various benchmarks for reasoning capabilities of LLMs • 1 item • Updated Oct 4, 2024
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos Paper • 2410.02763 • Published Oct 3, 2024 • 7
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation Paper • 2409.06820 • Published Sep 10, 2024 • 69