Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

EvalEval Coalition

community
https://evalevalai.com/
evaluatingevals
evaleval
Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

evijit  updated a dataset about 1 hour ago
evaleval/every_eval_score_ever
kevinlwei  updated a Space 1 day ago
evaleval/README
kevinlwei  published a Space 1 day ago
evaleval/README
View all activity

Yacine Jernite's profile picture Alina Leidinger's profile picture Margaret Mitchell's profile picture Leshem Choshen's profile picture Irene Solaiman's profile picture Ali El Filali's profile picture Joseph [open/acc] Pollack's profile picture Felix Friedrich's profile picture Mowafak Allaham's profile picture Prajna Soni's profile picture Jennifer Mickel's profile picture Usman Gohar's profile picture Shubham Singh's profile picture Avijit Ghosh's profile picture Anshuman Suri's profile picture Canyu Chen's profile picture Kevin Wei's profile picture Aurélien-Morgan CLAUDON's profile picture Levent Sagun's profile picture Monojit's profile picture wave's profile picture Amita Shukla's profile picture Jan Batzner's profile picture Andrew Tran's profile picture

evaleval 's collections 1

Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Sleeping
    14
    14

    BiasDetection

    🐠

    Analyze bias and toxicity in language models

  • Runtime error
    16
    16

    StableBias

    📖

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 2.76k • 25
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 545 • 10
Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Sleeping
    14
    14

    BiasDetection

    🐠

    Analyze bias and toxicity in language models

  • Runtime error
    16
    16

    StableBias

    📖

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 2.76k • 25
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 545 • 10
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs