arxiv:2511.22173
Seungone Kim PRO
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
authored
a paper
2 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
upvoted
a
paper
2 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
commented on
a paper
2 days ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists