Distributed Leaderboard
Display model evaluation scores on a leaderboard
None defined yet.
Welcome to our hackathon!
Whether you’re a tooled up ML engineer, a classicist NLP dev, or an AGI pilled vibe coder, this hackathon is going to be hard work! We’re going to take the latest and greatest coding agents and use them to level up open source AI. After all, why use December to relax and spend time with loved ones, when you can solve AI for all humanity? Jokes aside, this hackathon is not about learning skills from zero or breaking things down in their simplest components. It’s about collaborating, shipping, and making a difference for the open source community.
Over four weeks, we're using coding agents to level up the open source AI ecosystem:
Every contribution earns XP. Top contributors make the leaderboard. Winners get prizes!
Here's the schedule:
| Date | Event | Link |
|---|---|---|
| Dec 2 (Mon) | Week 1 Quest Released | Evaluate a Hub Model |
| Dec 4 (Wed) | Livestream 1 | TBA |
| Dec 9 (Mon) | Week 2 Quest Released | Publish a Hub Dataset |
| Dec 11 (Wed) | Livestream 2 | TBA |
| Dec 16 (Mon) | Week 3 Quest Released | Supervised Fine-Tuning |
| Dec 18 (Wed) | Livestream 3 | TBA |
| Dec 23 (Mon) | Week 4 Community Sprint | TBA |
| Dec 31 (Tue) | Hackathon Ends | TBA |
Join hf-skills on Hugging Face. This is where your contributions will be tracked and updated on the leaderboard.
Use whatever coding agent you prefer:
claude in your terminalcodex CLIgemini in your terminalThe skills in this repo work with any agent that can read markdown instructions and run Python scripts. To install the skills, follow the instructions in the README.
Most quests require a Hugging Face token with write access:
# mac/linux
curl -LsSf https://hf.co/cli/install.sh | bash
# windows
powershell -ExecutionPolicy ByPass -c "irm https://hf.co/cli/install.ps1 | iex"
# Login (creates/stores your token)
hf auth login
This will set your HF_TOKEN environment variable.
git clone https://github.com/huggingface/skills.git
cd skills
Point your coding agent at the relevant configuration. Check the README for instructions on how to use the skills with your coding agent.
Week 1 is live! Head to 02_evaluate-hub-model.md to start evaluating models and climb the leaderboard.
Each quest has three tiers:
| Tier | What it takes | XP |
|---|---|---|
| 🐢 | Complete the basics | 50-75 XP |
| 🐕 | Go deeper with more features | 100-125 XP |
| 🦁 | Ship something impressive | 200-225 XP |
You can complete multiple tiers, and you can complete the same quest multiple times with different models/datasets/spaces.
To join the Hackathon, join the organization on the hub and setup your coding agent.
Ready? Let's ship some AI. 🚀