Zengzhi Wang's picture

Zengzhi Wang

SinclairWang

·

https://tinyurl.com/zengzhi-homepage

AI & ML interests

Data Engineering for Generative AI

Recent Activity

upvoted a paper 12 days ago

DeepSeek-OCR: Contexts Optical Compression

upvoted a paper 12 days ago

olmOCR 2: Unit Test Rewards for Document OCR

upvoted a paper 14 days ago

FineVision: Open Data Is All You Need

View all activity

Organizations

upvoted 2 papers 12 days ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published 14 days ago • 69

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published 13 days ago • 11

upvoted a paper 14 days ago

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published 15 days ago • 61

upvoted an article 14 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 710

upvoted a paper about 1 month ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22 • 100

upvoted a paper 3 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14 • 143

upvoted 4 collections 3 months ago

ProX General Models

base models trained on ProX curated data. • 16 items • Updated Oct 10, 2024 • 1

ProX Math Models

base models trained on ProX curated openwebmath-pro. • 5 items • Updated Oct 10, 2024 • 1

ProX Refining Models

Adapted small language models used to generate data refining programs • 5 items • Updated Oct 10, 2024 • 5

Qwen3

84 items • Updated Aug 6 • 1.39k

upvoted a paper 3 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 63

upvoted 2 papers 4 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35

ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention

Paper • 2507.01004 • Published Jul 1 • 10

upvoted 4 collections 4 months ago

OctoThinker-Llama-1B Family

What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training. • 6 items • Updated Jul 6 • 2

OctoThinker-Llama-3B Family

What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training. • 6 items • Updated Jul 6 • 2

OctoThinker-Llama-8B Family

What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training. • 3 items • Updated Jul 6 • 3

Mid-training Analysis Checkpoints (Llama-3.2-3B)

What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training. • 10 items • Updated Jul 7 • 1

upvoted a paper 4 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 47

upvoted a paper 5 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

upvoted a paper 6 months ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published May 21 • 34