Noob's picture

Noob

noobmldude

·

AI & ML interests

Explainable AI

Recent Activity

liked a model 8 days ago

MiniMaxAI/MiniMax-M2.1

upvoted a paper 9 days ago

Shadow-FT: Tuning Instruct via Base

upvoted a collection 9 days ago

View all activity

Organizations

upvoted a paper 9 days ago

Shadow-FT: Tuning Instruct via Base

Paper • 2505.12716 • Published May 19, 2025 • 4

upvoted a collection 9 days ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 249

upvoted a collection 18 days ago

💜 Kotlin ML Pack

A collection of datasets, fine-tuned models and benchmarks to train your models for perfect Kotlin code generation. • 9 items • Updated Jun 11, 2024 • 25

upvoted an article 23 days ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

Sep 11, 2025

•

176

upvoted a paper about 1 month ago

TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar

Paper • 2510.14972 • Published Oct 16, 2025 • 34

upvoted a collection 4 months ago

Granite 2.0 Code Models

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated Nov 17, 2025 • 202

upvoted a collection 6 months ago

H-Net

The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11, 2025 • 20

upvoted 2 articles 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

741

Article

Welcome Gemma 2 - Google’s new open LLM

+4

Jun 27, 2024

•

132

upvoted a paper 6 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

upvoted an article 6 months ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

Jul 11, 2024

•

15

upvoted a paper 6 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 15

upvoted an article 6 months ago

Article

Selective fine-tuning of Language Models with Spectrum

Sep 3, 2024

•

36

upvoted 2 papers 7 months ago

Magistral

Paper • 2506.10910 • Published Jun 12, 2025 • 66

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure

Paper • 2506.12278 • Published Jun 13, 2025 • 16

upvoted an article 8 months ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

+7

Apr 29, 2024

•

79

upvoted 2 papers 9 months ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published Mar 12, 2025 • 13

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20, 2025 • 63

upvoted 2 papers 10 months ago

2BP: 2-Stage Backpropagation

Paper • 2405.18047 • Published May 28, 2024 • 26

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Paper • 2410.21438 • Published Oct 28, 2024 • 2