Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
passing2961 's Collections
Multi-Turn Evaluation Benchmarks
Thanos
Stark
Ultron
DialogCC

Multi-Turn Evaluation Benchmarks

updated 25 days ago

A collection of benchmarks for evaluating LMs or VLMs under multi-turn interaction

Upvote
-

  • passing2961/MultiVerse

    Viewer • Updated Nov 1 • 647 • 136 • 1

  • passing2961/photochat_plus

    Viewer • Updated Dec 3, 2024 • 968 • 61 • 4

  • RefineBench/RefineBench

    Viewer • Updated 4 days ago • 1k • 710 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs