OpenEvals

community

AI & ML interests

LLM evaluation

Recent Activity

SaylorTwift updated a Space 31 minutes ago

OpenEvals/open_benchmark_index

clefourrier updated a Space 14 days ago

OpenEvals/InferenceProviderTesting

SaylorTwift updated a Space 15 days ago

OpenEvals/evals

View all activity

Articles

Gaia2 and ARE: Empowering the community to study agents

OpenEvals 's collections 5