-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 36.4k • 58 -
openai/openai_humaneval
Viewer • Updated • 164 • 86.5k • 342 -
1.45k
Big Code Models Leaderboard
📈Submit code models for evaluation and view leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 11
Shaun
drgitt
AI & ML interests
None yet
Organizations
None yet
codegen_eval
-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 36.4k • 58 -
openai/openai_humaneval
Viewer • Updated • 164 • 86.5k • 342 -
Running1.45k1.45k
Big Code Models Leaderboard
📈Submit code models for evaluation and view leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 11
Interesting LLMs
datasets
0
None public yet