Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CodeKaleidoscope 's Collections
DyCodeEval

DyCodeEval

updated Jun 27

DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5.

Upvote
4

  • CodeKaleidoscope/Dynamic_HumanEvalZero

    Viewer • Updated Jun 24 • 15.7k • 24 • 3

  • CodeKaleidoscope/Dynamic_MBPP_sanitized

    Viewer • Updated Jun 24 • 15.8k • 35 • 3

  • Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

    Paper • 2503.04149 • Published Mar 6 • 6
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs