Spaces:

MTSAIR
/

ru_leaderboard

Sleeping

App Files Files Community

ru_leaderboard / descriptions /evaluation_queue_filename.md

Titova Ksenia

add dirs

192404a 6 months ago

preview code

raw

history blame contribute delete

469 Bytes

A newer version of the Gradio SDK is available: 5.45.0

Upgrade

Currently, our leaderboard doesn't support automatic running of models from the HF Hub through our benchmark—we're working on it! However, you can send a request with the model name, revision, and precision, and we'll run your LLM-as-a-judge and update the leaderboard!

Additionally, you can use our methodology to evaluate models on another open benchmark using the code available in the repository.