Ryan Koo's picture

3 5

Ryan Koo

rngusry

·

https://kooryan.netlify.app

kooryan

AI & ML interests

NLP, RLHF, Alignment

Recent Activity

updated a model about 1 month ago

rngusry/Llama3.2-3b-Instruct-MATH-orm

published a model about 1 month ago

rngusry/Llama3.2-3b-Instruct-MATH-orm

authored a paper 6 months ago

Decoding the End-to-end Writing Trajectory in Scholarly Manuscripts

View all activity

Organizations

Papers 5

arxiv:2504.16272

arxiv:2401.14698

arxiv:2309.17012

arxiv:2305.09857

models 5

rngusry/Llama3.2-3b-Instruct-MATH-orm

Feature Extraction • 3B • Updated Sep 16 • 1

rngusry/llama-3.2-3b-ultrafeedback-rm

rngusry/llama-3.1-1b-ultrafeedback-rm

rngusry/llama3.2-1b-instruct-hh-sft

Text Generation • 1B • Updated Jan 22

rngusry/qwen2.5-hh-rm

datasets 3

rngusry/UltraFeedback-honesty-preferences

Viewer • Updated Aug 3, 2024 • 251k • 10 • 1

rngusry/UltraFeedback-instruction_following-preferences

Viewer • Updated Jul 25, 2024 • 297k • 32

rngusry/UltraFeedback-truthfulness-preferences

Viewer • Updated Jul 25, 2024 • 217k • 11 • 1