Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 249
💜 Kotlin ML Pack Collection A collection of datasets, fine-tuned models and benchmarks to train your models for perfect Kotlin code generation. • 9 items • Updated Jun 11, 2024 • 25
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 176
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar Paper • 2510.14972 • Published Oct 16, 2025 • 34
Granite 2.0 Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated Nov 17, 2025 • 202
H-Net Collection The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11, 2025 • 20
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper • 2506.16406 • Published Jun 19, 2025 • 130
view article Article RegMix: Data Mixture as Regression for Language Model Pre-training Jul 11, 2024 • 15
Spectrum: Targeted Training on Signal to Noise Ratio Paper • 2406.06623 • Published Jun 7, 2024 • 15
Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure Paper • 2506.12278 • Published Jun 13, 2025 • 16
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation +7 Apr 29, 2024 • 79
LocAgent: Graph-Guided LLM Agents for Code Localization Paper • 2503.09089 • Published Mar 12, 2025 • 13
UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function Paper • 2410.21438 • Published Oct 28, 2024 • 2