1 9 6

Jeff Gao

jeff-gao

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

liked a model 2 months ago

ASLP-lab/Easy-Turn

liked a model 3 months ago

inclusionAI/Rubicon-Preview

View all activity

Organizations

None yet

upvoted a paper 16 days ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published 18 days ago • 50

liked a model 2 months ago

ASLP-lab/Easy-Turn

Updated Oct 11 • 25 • 13

liked a model 3 months ago

inclusionAI/Rubicon-Preview

Text Generation • 31B • Updated Aug 19 • 65 • 23

upvoted a paper 4 months ago

Evaluating, Synthesizing, and Enhancing for Customer Support Conversation

Paper • 2508.04423 • Published Aug 6 • 9

upvoted a paper 5 months ago

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Paper • 2506.09827 • Published Jun 11 • 20

liked a model 8 months ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17 • 199k • 1.6k

published a model 9 months ago

jeff-gao/Qwen2.5-1.5B-Open-R1-GRPO

Updated Feb 25

updated a model 9 months ago

jeff-gao/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Feb 24

published a model 9 months ago

jeff-gao/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Feb 24

liked a model about 1 year ago

jinaai/reader-lm-1.5b

Text Generation • 2B • Updated Jan 17 • 604 • • 607

upvoted a paper about 1 year ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83

liked a dataset over 1 year ago

facebook/covost2

Updated Jan 18, 2024 • 375 • 43

upvoted 3 papers over 1 year ago

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Paper • 2407.08348 • Published Jul 11, 2024 • 52

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

upvoted 2 papers almost 2 years ago

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12, 2024 • 62

liked a model almost 2 years ago

SeaLLMs/SeaLLM-13B-Chat

Updated Feb 2, 2024 • 64

New activity in microsoft/phi-1_5 about 2 years ago

Inference time is much longer than reported

🤯 1

#25 opened about 2 years ago by

jeff-gao

Inference time is much longer than reported

🤯 1

#25 opened about 2 years ago by

jeff-gao

Jeff Gao

AI & ML interests

Recent Activity

Organizations

jeff-gao's activity

Inference time is much longer than reported

Inference time is much longer than reported