Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Seungone Kim's picture
13 40 77

Seungone Kim PRO

seungone
chiffonng's profile picture 21world's profile picture juyoungml's profile picture
·
https://seungonekim.github.io/
  • seungonekim
  • SeungoneKim

AI & ML interests

Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment

Recent Activity

authored a paper 2 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
upvoted a paper 2 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
commented on a paper 2 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
View all activity

Organizations

NeuLab @ LTI/CMU's profile picture HAE-RAE's profile picture Mixture of Rewards's profile picture CMU-LTI's profile picture KAIST AI's profile picture Human_Eval_RLHF's profile picture prometheus-vision's profile picture prometheus-eval's profile picture MPA human eval's profile picture AI at Meta's profile picture multilingual-reward-bench's profile picture Agora's profile picture 11777-S25 Project's profile picture cot_encyclopedia_human_eval's profile picture cot_encyclopedia_human_eval's profile picture RefineBench's profile picture Carnegie Mellon University's profile picture

Papers 33

arxiv:2511.22173
arxiv:2506.01789
arxiv:2505.22202
arxiv:2505.16409

spaces 2

pinned
Running

My Argilla

✍

Apr 12
Runtime error

Test3

🟧

Apr 12

models 1

seungone/skywork-reward-replicate

Text Classification • 8B • Updated Dec 11, 2024 • 5

datasets 5

seungone/ablation1_math_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 5.56k • 37

seungone/ablation3_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 24.8k • 38

seungone/ablation2_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 5.99k • 27

seungone/ablation1_code_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 10k • 37

seungone/final-math-claude3.5_sonnet-10000

Viewer • Updated Sep 16, 2024 • 10k • 33 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs