leoking's picture

6 9

leoking

leokmax

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article 6 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

385

upvoted a collection 10 months ago

Deepseek Papers

Deepseek papers collection • 27 items • Updated 5 days ago • 289

upvoted 4 collections about 1 year ago

LLM Pre-Train

16 items • Updated Jan 20 • 1

LLM Post Training

15 items • Updated Feb 1 • 1

LLM Reasoning Papers

improve reasoning capabilities of LLMs • 45 items • Updated Feb 18 • 6

LLM Tech Report

33 items • Updated Feb 21 • 2