6 18 14

Jiawei Liu

ganler

https://jw-liu.xyz/

AI & ML interests

Simplifying the making of great software.

Recent Activity

upvoted a paper 21 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

upvoted an article about 1 month ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

published a dataset 3 months ago

purpcode/ctxdistill-verified-ablation-Qwen2.5-14B-Instruct-1M-73k

View all activity

Organizations

upvoted a paper 21 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published 25 days ago • 34

upvoted an article about 1 month ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 243

upvoted 2 papers 8 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 75

upvoted an article 9 months ago

Article

Blazing-Fast Code Editing via Multi-Layer Speculation

and 3 others •

Feb 15

• 17

upvoted a paper about 1 year ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 24

upvoted an article about 1 year ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

•

Oct 24, 2024

• 13

upvoted a paper over 1 year ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 64

upvoted 2 articles over 1 year ago

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18, 2024

• 52

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Apr 29, 2024

• 79

upvoted 2 papers over 1 year ago

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

Paper • 2404.15247 • Published Apr 23, 2024 • 3

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 149

upvoted 3 papers almost 2 years ago

NeuRI: Diversifying DNN Generation via Inductive Rule Inference

Paper • 2302.02261 • Published Feb 4, 2023 • 3

Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

Paper • 2312.04724 • Published Dec 7, 2023 • 21

Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation

Paper • 2305.01210 • Published May 2, 2023 • 3

upvoted a collection almost 2 years ago

ise-uiuc's Papers

Collection

7 items • Updated Mar 31, 2024 • 7

upvoted 2 papers almost 2 years ago

Magicoder: Source Code Is All You Need

Paper • 2312.02120 • Published Dec 4, 2023 • 81

Relax: Composable Abstractions for End-to-End Dynamic Machine Learning

Paper • 2311.02103 • Published Nov 1, 2023 • 22

Jiawei Liu

AI & ML interests

Recent Activity

Organizations

ganler's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Blazing-Fast Code Editing via Multi-Layer Speculation

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation