5 15 7

Jean Kaddour

JeanKaddour

AI & ML interests

None yet

Recent Activity

commented on a paper 2 months ago

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

upvoted a paper 2 months ago

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

commented on a paper 2 months ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

View all activity

Organizations

commented a paper 2 months ago

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

Paper • 2511.15593 • Published Nov 19, 2025 • 58 •

upvoted a paper 2 months ago

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

Paper • 2511.15593 • Published Nov 19, 2025 • 58

commented a paper 2 months ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 136 •

upvoted a paper 2 months ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 136

upvoted a paper 7 months ago

From KMMLU-Redux to KMMLU-Pro: A Professional Korean Benchmark Suite for LLM Evaluation

Paper • 2507.08924 • Published Jul 11, 2025 • 18

authored a paper 8 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

upvoted a paper 8 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

upvoted a paper 9 months ago

Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

Paper • 2505.06046 • Published May 9, 2025 • 15

upvoted 3 papers about 1 year ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24, 2025 • 77

LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24, 2024 • 9

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21

upvoted a paper over 1 year ago

Attention Is All You Need But You Don't Need All Of It For Inference of Large Language Models

Paper • 2407.15516 • Published Jul 22, 2024 • 1

liked a dataset over 1 year ago

bigcode/bigcodebench

Viewer • Updated Apr 30, 2025 • 5.7k • 16.4k • 76

upvoted a paper over 1 year ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48

authored a paper over 1 year ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48

liked a dataset over 1 year ago

andersonbcdefg/minipile-simlm

Viewer • Updated Feb 17, 2024 • 1M • 6 • 1

upvoted a collection over 1 year ago

Reasoning

Collection

8 items • Updated Jun 20, 2025 • 2

authored a paper over 1 year ago

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6, 2024 • 39

updated a collection over 1 year ago

Benchmarks

Collection

1 item • Updated Jun 13, 2024

Jean Kaddour

AI & ML interests

Recent Activity

Organizations

JeanKaddour's activity