view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance about 1 month ago ⢠82
How to Train Your LLM Web Agent: A Statistical Diagnosis Paper ⢠2507.04103 ⢠Published Jul 5, 2025 ⢠51
StarVector SVG Datasets (đSVG-Bench) Collection Datasets for training and evaluating SVG generation models ⢠11 items ⢠Updated Jan 12, 2025 ⢠21
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper ⢠2502.01341 ⢠Published Feb 3, 2025 ⢠39
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Paper ⢠2406.11811 ⢠Published Jun 17, 2024 ⢠16
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Paper ⢠2406.11811 ⢠Published Jun 17, 2024 ⢠16 ⢠1
StarCoder 2 and The Stack v2: The Next Generation Paper ⢠2402.19173 ⢠Published Feb 29, 2024 ⢠152
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders Paper ⢠2404.05961 ⢠Published Apr 9, 2024 ⢠66
StarCoder 2 and The Stack v2: The Next Generation Paper ⢠2402.19173 ⢠Published Feb 29, 2024 ⢠152
Running on CPU Upgrade 13.8k Open LLM Leaderboard đ 13.8k Track, rank and evaluate open LLMs and chatbots