Edward Beeching's picture

Edward Beeching PRO

edbeeching

HuggingFaceH4

·

https://edbeeching.github.io/

edbeeching

AI & ML interests

None yet

Recent Activity

updated a bucket 4 days ago

edbeeching/trackio-trace-capacity-bucket

updated a Space 4 days ago

edbeeching/trackio-trace-capacity

published a bucket 4 days ago

edbeeching/trackio-trace-capacity-bucket

View all activity

Organizations

published an article 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 147

published an article 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 773

published an article about 1 year ago

Article

Open R1: How to use OlympicCoder locally for coding

+3

burtenshaw, reach-vb, lewtun, edbeeching, yagilb

•

Mar 20, 2025

• 63

published an article about 1 year ago

Article

Open R1: Update #3

open-r1

•

Mar 11, 2025

• 297

published an article over 1 year ago

Article

Open-R1: Update #1

open-r1

•

Feb 2, 2025

• 305

published an article almost 2 years ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

+6

yfleureau, liyongsea, edbeeching, lewtun, benlipkin, romansoletskyi, vwxyzjn, kashif

•

Jul 11, 2024

• 128

published an article about 2 years ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

+2

qgallouedec, edbeeching, ClementRomac, thomwolf

•

Apr 22, 2024

• 81

published an article about 2 years ago

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 531

published an article over 2 years ago

Article

Constitutional AI with Open LLMs

+5

vwxyzjn, lewtun, edbeeching, lvwerra, osanseviero, kashif, thomwolf

•

Feb 1, 2024

• 17

published an article over 2 years ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

kashif, edbeeching, lewtun, lvwerra, osanseviero

•

Jan 18, 2024

• 83

published an article almost 3 years ago

Article

Can foundation models label data like humans?

+7

nazneen, natolambert, sheonhan, wangjean, OsvaldN97, edbeeching, lewtun, slippylolo, thomwolf

•

Jun 12, 2023

• 1

published an article about 3 years ago

Article

Creating a Coding Assistant with StarCoder

+7

lewtun, natolambert, nazneen, edbeeching, teven, sheonhan, philschmid, lvwerra, srush

•

May 9, 2023

• 2

published an article about 3 years ago

Article

Creating a Coding Assistant with StarCoder

+7

lewtun, natolambert, nazneen, edbeeching, teven, sheonhan, philschmid, lvwerra, srush

•

May 9, 2023

• 2

published an article about 3 years ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

+5

edbeeching, kashif, ybelkada, lewtun, lvwerra, nazneen, natolambert

•

Apr 5, 2023

• 48

published an article about 3 years ago

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

+4

edbeeching, ybelkada, lvwerra, smangrul, lewtun, kashif

•

Mar 9, 2023

• 72

published an article over 3 years ago

Article

Train your first Decision Transformer

edbeeching, ThomasSimonini

•

Sep 8, 2022

• 15

published an article about 4 years ago

Article

Introducing Decision Transformers on Hugging Face 🤗

edbeeching, ThomasSimonini

•

Mar 28, 2022

• 10

published an article about 4 years ago

Article

Introducing Decision Transformers on Hugging Face 🤗

edbeeching, ThomasSimonini

•

Mar 28, 2022

• 10