Jiacheng Zhu

JiachengZhu

https://jiachengzhuml.github.io/

AI & ML interests

machine learning, statistical machine learning, foundation model

Recent Activity

authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

upvoted a paper 2 months ago

Scaling Agent Learning via Experience Synthesis

liked a model 2 months ago

Qwen/Qwen3-0.6B

View all activity

Organizations

authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

upvoted a paper 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

liked a model 2 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26, 2025 • 7.97M • • 959

authored a paper 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 271

upvoted a paper 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 271

liked a dataset 5 months ago

ingoziegler/CRAFT-BioQA

Viewer • Updated Dec 8, 2025 • 30.6k • 88 • 4

upvoted a paper 5 months ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published Aug 12, 2025 • 40

liked a model 5 months ago

dominguesm/xlm-roberta-base-lora-language-detection

Updated May 2, 2024 • 12.8k • 2

upvoted an article 5 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

liked a dataset 7 months ago

yiqingliang/lisa-problems-dataset

Viewer • Updated Apr 9, 2025 • 1.33k • 7 • 1

upvoted a paper 7 months ago

MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning

Paper • 2505.24871 • Published May 30, 2025 • 23

liked a model 12 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 424k • • 12.9k

upvoted a paper over 1 year ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 54

liked a Space over 1 year ago

Merging Competition

💻

Display a static leaderboard for a competition

liked a dataset over 1 year ago

bigscience/P3

Viewer • Updated Mar 4, 2024 • 122M • 19.5k • 230

authored a paper over 1 year ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 13

upvoted 2 papers almost 2 years ago

Interpolation for Robust Learning: Data Augmentation on Geodesics

Paper • 2302.02092 • Published Feb 4, 2023 • 1

Asymmetry in Low-Rank Adapters of Foundation Models

Paper • 2402.16842 • Published Feb 26, 2024 • 2

authored a paper almost 2 years ago

Asymmetry in Low-Rank Adapters of Foundation Models

Paper • 2402.16842 • Published Feb 26, 2024 • 2

liked a model about 2 years ago

CultriX/MistralTrix-v1

Text Generation • 9B • Updated Jan 27, 2024 • 752 • 110

Jiacheng Zhu

AI & ML interests

Recent Activity

Organizations

JiachengZhu's activity

From GRPO to DAPO and GSPO: What, Why, and How

Merging Competition