Edward Beeching's picture

Edward Beeching PRO

edbeeching

·

https://edbeeching.github.io/

edbeeching

AI & ML interests

None yet

Organizations

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

747

upvoted a paper 10 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 47

upvoted an article 10 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296

upvoted an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

+6

Jul 11, 2024

•

124

upvoted a paper about 2 years ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 19

upvoted a collection about 2 years ago

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13, 2024 • 25

upvoted 2 papers about 2 years ago

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 19

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123

upvoted a paper over 2 years ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 42