Jiwoo Hong's picture

Jiwoo Hong

JW17

·

https://jiwooya1000.github.io/

AI & ML interests

NLP, LLM, and any related topics

Recent Activity

upvoted a paper about 1 month ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

updated a model about 1 month ago

JW17/L31-8B-It-ICRM-3Epoch-v0.1

published a model about 1 month ago

JW17/L31-8B-It-ICRM-3Epoch-v0.1

View all activity

Organizations

authored 2 papers 5 months ago

AlphaPO -- Reward shape matters for LLM alignment

Paper • 2501.03884 • Published Jan 7 • 2

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

Paper • 2504.03380 • Published Apr 4

authored a paper 6 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

authored 2 papers 12 months ago

Stable Language Model Pre-training by Reducing Embedding Variability

Paper • 2409.07787 • Published Sep 12, 2024

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23, 2024

authored 2 papers over 1 year ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 16

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 69