4 25 34

Chew Kok Wah

chewkokwah

AI & ML interests

Open Domain Question Answering

Recent Activity

liked a dataset 5 days ago

MegaScience/TextbookReasoning

upvoted a paper 6 days ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

liked a dataset 6 days ago

futurehouse/hle-gold-bio-chem

View all activity

Organizations

upvoted a paper 6 days ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published 7 days ago • 46

upvoted an article 7 days ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

11 days ago

• 47

upvoted a paper 7 days ago

OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique

Paper • 2507.09075 • Published 17 days ago • 13

upvoted an article 11 days ago

Article

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

and 5 others •

13 days ago

• 50

upvoted 2 papers 19 days ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 24

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 33

upvoted an article 21 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

21 days ago

• 589

upvoted a paper 21 days ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23 • 32

upvoted an article about 2 months ago

Article

The 4 Things Qwen-3's Chat Template Teaches Us

•

Apr 30

• 60

upvoted a paper 2 months ago

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

Paper • 2505.00612 • Published May 1 • 9

upvoted an article 2 months ago

Article

The Transformers Library: standardizing model definitions

and 3 others •

May 15

• 116

upvoted 2 papers 3 months ago

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23 • 24

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published Apr 22 • 64

upvoted a paper 4 months ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 34

upvoted an article 4 months ago

Article

Visualize and understand GPU memory in PyTorch

•

Dec 24, 2024

• 234

upvoted 2 collections 5 months ago

Light-R1

Collection

Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated Mar 13 • 12

TinyR1

Collection

2 items • Updated Apr 21 • 3

upvoted an article 5 months ago

Article

DualPipe could be better without the Dual

•

Feb 28

• 17

upvoted a collection 5 months ago

DeepSeek-R1-Distill Quantized

Collection

18 items • Updated Feb 7 • 16

upvoted a paper 5 months ago

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Paper • 2502.14922 • Published Feb 19 • 32

Chew Kok Wah

AI & ML interests

Recent Activity

Organizations

chewkokwah's activity

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

SmolLM3: smol, multilingual, long-context reasoner

The 4 Things Qwen-3's Chat Template Teaches Us

The Transformers Library: standardizing model definitions

Visualize and understand GPU memory in PyTorch

DualPipe could be better without the Dual