Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper β’ 2504.17025 β’ Published Apr 23, 2025 β’ 17
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines Paper β’ 2504.14738 β’ Published Apr 20, 2025 β’ 5
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Paper β’ 2504.15266 β’ Published Apr 21, 2025 β’ 6
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging Paper β’ 2504.10642 β’ Published Apr 14, 2025 β’ 2
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Paper β’ 2504.15133 β’ Published Apr 21, 2025 β’ 26
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper β’ 2504.14538 β’ Published Apr 20, 2025 β’ 30
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper β’ 2504.17192 β’ Published Apr 24, 2025 β’ 124
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 43 items β’ Updated Mar 2 β’ 717
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 24 items β’ Updated May 19, 2025 β’ 188
Big-Math Collection This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers β’ 4 items β’ Updated Apr 16, 2025 β’ 7
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data Paper β’ 2309.11235 β’ Published Sep 20, 2023 β’ 15
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. β’ 100 items β’ Updated Mar 2 β’ 577