23 108 95

Andrew Reed

andrewrreed

https://www.andrewreed.com

AI & ML interests

Applied ML, Practical AI, Inference & Deployment, LLMs, Multi-modal Models, RAG

Recent Activity

upvoted an article 2 months ago

We Got Claude to Fine-Tune an Open Source LLM

liked a Space 2 months ago

OpenEvals/evaluation-guidebook

upvoted a paper 2 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

View all activity

Organizations

upvoted an article 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

588

liked a Space 2 months ago

Evaluation Guidebook

📝

267

Display benchmark evaluation data for LLMs

upvoted a paper 2 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 129

upvoted a changelog 5 months ago

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30, 2025

• 201

updated a Space 6 months ago

Leaderboard Yourbench Andrewrreed Nationalgrid-specs-for-electrical-installations-2024

🏆

Display leaderboard and analyze samples

published a Space 6 months ago

Leaderboard Yourbench Andrewrreed Nationalgrid-specs-for-electrical-installations-2024

🏆

Display leaderboard and analyze samples

published a dataset 6 months ago

andrewrreed/nationalgrid-specs-for-electrical-installations-2024

Viewer • Updated Aug 5, 2025 • 113 • 11

updated a dataset 6 months ago

andrewrreed/nationalgrid-specs-for-electrical-installations-2024

Viewer • Updated Aug 5, 2025 • 113 • 11

liked a Space 6 months ago

YourBench

🚀

Generate custom evaluations from your data easily!

upvoted an article 6 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5, 2025

•

510

upvoted a collection 6 months ago

gpt-oss

Collection

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 413

upvoted an article 6 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29, 2025

•

212

liked a model 7 months ago

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17, 2025 • 35.8k • • 396

upvoted a paper 7 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 261

upvoted 2 articles 7 months ago

Article

Building the Hugging Face MCP Server

Jul 10, 2025

•

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

754

updated a collection 8 months ago

Eval Leaderboards

Collection

27 items • Updated Jun 17, 2025 • 3

upvoted a paper 8 months ago

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13, 2025 • 74

updated a collection 8 months ago

Awesome Spaces

Collection

29 items • Updated Jun 12, 2025 • 4

liked a Space 8 months ago

Consilium MCP Server

🏢

126

Multi-AI Expert Consensus Platform

Andrew Reed

AI & ML interests

Recent Activity

Organizations

andrewrreed's activity

We Got Claude to Fine-Tune an Open Source LLM

Evaluation Guidebook

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Leaderboard Yourbench Andrewrreed Nationalgrid-specs-for-electrical-installations-2024

Leaderboard Yourbench Andrewrreed Nationalgrid-specs-for-electrical-installations-2024

YourBench

Welcome GPT OSS, the new open-source model family from OpenAI!

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Building the Hugging Face MCP Server

SmolLM3: smol, multilingual, long-context reasoner

Consilium MCP Server