Quentin Gallouédec's picture

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 12 hours ago

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

published a dataset about 22 hours ago

trackio/documentation_dataset

upvoted a paper 1 day ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

View all activity

Organizations

upvoted an article about 12 hours ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By

and 4 others •

1 day ago

• 69

upvoted 2 papers 1 day ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 84

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176

upvoted a paper 5 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 6 days ago • 239

upvoted a collection 9 days ago

Gemma 3n

4 items • Updated 20 days ago • 199

upvoted a paper 16 days ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published 23 days ago • 57

upvoted an article 21 days ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

21 days ago

• 615

upvoted 2 articles 22 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

22 days ago

• 596

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 780

upvoted 2 collections about 1 month ago

Qwen3

76 items • Updated 5 days ago • 959

🤖 Agents

21 items • Updated Dec 31, 2024 • 162

upvoted a collection about 2 months ago

Llama Nemotron

Open, Production-ready Enterprise Models • 9 items • Updated 5 days ago • 62

upvoted a changelog about 2 months ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6

• 104

upvoted 2 articles about 2 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

Jun 3

• 78

Article

🐯 Liger GRPO meets TRL

By

and 5 others •

May 25

• 47

upvoted 3 papers 2 months ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published Apr 15 • 6

INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

Paper • 2505.07291 • Published May 12 • 14

Layer Normalization

Paper • 1607.06450 • Published Jul 21, 2016 • 3

upvoted 2 articles 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 491

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

By

and 6 others •

May 11

• 74