Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

Menlo/Lucy-128k

liked a model about 1 month ago

chandar-lab/NeoBERT

upvoted a paper about 2 months ago

Large Language Models are Locally Linear Mappings

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30 • 15

upvoted a paper 2 months ago

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Paper • 2505.11711 • Published May 16 • 10

upvoted an article 2 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

and 6 others •

May 21

• 196

upvoted an article 3 months ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 292

upvoted a paper 3 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 132

upvoted an article 3 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 40

upvoted a collection 5 months ago

Gemma 3

Collection

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 50 items • Updated 4 days ago • 73

upvoted 2 articles 6 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 877

upvoted a collection 6 months ago

EvaByte

Collection

3 items • Updated Jan 21 • 4

upvoted an article 7 months ago

Article

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 79

upvoted a paper 7 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 372

upvoted a paper 10 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted an article 10 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

and 5 others •

Sep 18, 2024

• 261

upvoted an article 11 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 77

upvoted an article 12 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

and 2 others •

Aug 14, 2024

• 68

upvoted an article about 1 year ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 633

upvoted a paper about 1 year ago

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

Paper • 2404.12753 • Published Apr 19, 2024 • 44

Stephen Oates PRO

AI & ML interests

Recent Activity

Organizations

soates's activity

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tiny Agents: a MCP-powered agent in 50 lines of code

Gotchas in Tokenizer Behavior Every Developer Should Know

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Mastering Tensor Dimensions in Transformers

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

A failed experiment: Infini-Attention, and why we should keep trying?

Uncensor any LLM with abliteration