4 26 206

Lee Jung Bang

bangbang

AI & ML interests

NLP,CHATBOT,RL

Recent Activity

liked a dataset about 2 hours ago

Anthropic/AnthropicInterviewer

liked a dataset about 23 hours ago

openai/gdpval

liked a dataset about 23 hours ago

OpenMed/Medical-Reasoning-SFT-GPT-OSS-120B

View all activity

Organizations

None yet

liked a dataset about 2 hours ago

Anthropic/AnthropicInterviewer

Viewer • Updated 7 days ago • 1.25k • 9.07k • 287

liked 2 datasets about 23 hours ago

openai/gdpval

Viewer • Updated Sep 25 • 220 • 28.6k • 372

OpenMed/Medical-Reasoning-SFT-GPT-OSS-120B

Viewer • Updated 4 days ago • 200k • 669 • 102

liked a model about 23 hours ago

upstage/Solar-Open-100B

Updated 1 day ago • 48

liked a dataset 4 days ago

HuggingFaceFW/fineweb

Viewer • Updated Jul 11 • 52.5B • 193k • 2.52k

liked 2 models 4 days ago

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated 7 days ago • 286k • • 2.76k

microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 4 days ago • 143k • 863

upvoted a paper 5 days ago

Lessons from the Trenches on Reproducible Evaluation of Language Models

Paper • 2405.14782 • Published May 23, 2024 • 1

liked a Space 5 days ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

upvoted a paper 5 days ago

OLMES: A Standard for Language Model Evaluations

Paper • 2406.08446 • Published Jun 12, 2024 • 3

liked a Space 5 days ago

The Ultra-Scale Playbook

🌌

3.57k

The ultimate guide to training LLM on large GPU Clusters

upvoted 4 papers 5 days ago

Does your data spark joy? Performance gains from domain upsampling at the end of training

Paper • 2406.03476 • Published Jun 5, 2024 • 4

liked a dataset 5 days ago

LLM360/MegaMath

Viewer • Updated Apr 9 • 217M • 25.9k • 107

upvoted 4 papers 5 days ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 35

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 250

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 138

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 45