Chris (Yuhao) Liu's picture

Chris (Yuhao) Liu

chrisliu298

·

https://chrisliu298.ai/

AI & ML interests

Alignment

Recent Activity

upvoted a paper about 2 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

new activity 2 months ago

Skywork/Skywork-Reward-V2-Llama-3.1-8B-40M:Expected output

new activity 2 months ago

Skywork/Skywork-Reward-V2-Llama-3.1-8B:About system prompt

View all activity

Organizations

authored 2 papers 4 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 54

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published Jun 24 • 52

authored 2 papers 5 months ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 54

GUARD: Generation-time LLM Unlearning via Adaptive Restriction and Detection

Paper • 2505.13312 • Published May 19

authored 2 papers 9 months ago

Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization

Paper • 2412.18279 • Published Dec 24, 2024

LLM Unlearning via Loss Adjustment with Only Forget Data

Paper • 2410.11143 • Published Oct 14, 2024

authored a paper about 1 year ago

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Paper • 2410.18451 • Published Oct 24, 2024 • 20

authored 2 papers over 1 year ago

Large Language Model Unlearning via Embedding-Corrupted Prompts

Paper • 2406.07933 • Published Jun 12, 2024 • 9

Understanding the Role of Optimization in Double Descent

Paper • 2312.03951 • Published Dec 6, 2023