Quentin Tardif's picture

Quentin Tardif

ntnq

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

Open Coding Agents

upvoted a collection 5 days ago

upvoted a paper 6 days ago

Ministral 3

View all activity

Organizations

upvoted a collection 1 day ago

Open Coding Agents

11 items • Updated about 17 hours ago • 38

upvoted a collection 5 days ago

Qwen3-ASR

4 items • Updated 5 days ago • 42

upvoted a paper 6 days ago

Ministral 3

Paper • 2601.08584 • Published 21 days ago • 51

upvoted a collection 7 days ago

Trinity-Large

5 items • Updated 6 days ago • 37

upvoted an article 8 days ago

Article

🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models

14 days ago

•

37

upvoted a paper 8 days ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 11 days ago • 172

upvoted a paper 26 days ago

Scaling Laws for Code: Every Programming Language Matters

Paper • 2512.13472 • Published Dec 15, 2025 • 13

upvoted 2 articles about 2 months ago

Article

Saving Memory Using Padding-Free Transformer Layers during Finetuning

Jun 11, 2024

•

21

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

•

106

upvoted 2 articles 2 months ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

Dec 4, 2025

•

63

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

316

upvoted a collection 2 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated Dec 23, 2025 • 163

upvoted 2 papers 3 months ago

Fantastic Pretraining Optimizers and Where to Find Them

Paper • 2509.02046 • Published Sep 2, 2025 • 14

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 97

upvoted 2 articles 3 months ago

Article

What makes good reasoning data

Oct 30, 2025

•

44

Article

On the Shifting Global Compute Landscape

Oct 29, 2025

•

58

upvoted a paper 4 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 506

upvoted a paper 5 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

upvoted a collection 5 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 325

upvoted a collection 6 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 413