Jujie He's picture

6 2

Jujie He

leafzs

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

updated a model 3 months ago

Skywork/Skywork-o1-Open-Llama-3.1-8B

updated a model 3 months ago

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B

View all activity

Organizations

upvoted a paper 3 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11 • 46

updated 3 models 3 months ago

Skywork/Skywork-o1-Open-Llama-3.1-8B

Text Generation • 8B • Updated Aug 29 • 439 • • 114

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B

Text Classification • Updated Aug 29 • 247 • 51

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B

Text Classification • Updated Aug 29 • 3.39k • 33

upvoted 2 papers 5 months ago

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8 • 72

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

upvoted a paper 6 months ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 54

upvoted a collection 7 months ago

Skywork-OR1

Skywork Open Reasoner 1 • 11 items • Updated May 29 • 31

updated a dataset 8 months ago

Skywork/Skywork-OR1-RL-Data

Viewer • Updated May 29 • 119k • 1.94k • 57

published a dataset 8 months ago

Skywork/Skywork-OR1-RL-Data

Viewer • Updated May 29 • 119k • 1.94k • 57

upvoted a paper 8 months ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published Apr 8 • 85

liked 2 models 11 months ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • 685B • Updated Mar 27 • 5.16k • 937

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 1.21M • • 12.9k

authored a paper over 1 year ago

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Paper • 2407.08348 • Published Jul 11, 2024 • 52