Xiangyu's picture

9 10

Xiangyu

xixy

·

https://xixy.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a collection 5 days ago

OpenReasoning-Nemotron

upvoted a paper about 1 month ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

new activity about 2 months ago

a-m-team/AM-DeepSeek-R1-0528-Distilled:什么叫中国速度！

View all activity

Organizations

None yet

authored a paper 2 months ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Paper • 2505.17652 • Published May 23 • 6

authored a paper 5 months ago

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Paper • 2503.01506 • Published Mar 3 • 9