Reliable and Efficient Amortized Model-Based Evaluation Datasets and Models for the REEval project stair-lab/reeval Viewer • Updated Jun 21 • 5.69M • 176 • 1 stair-lab/reeval-difficulty-for-helm Viewer • Updated Mar 18 • 217k • 15
Gathering Context for Decision Support with LLMs stair-lab/bosd_initial_dataset Viewer • Updated Jan 7 • 568 • 3
Dynamics of Learning Datasets and Models for the CodeInsights Projects stair-lab/code_insights_jsons Preview • Updated Dec 11, 2024 • 1 stair-lab/code_insights_csv Viewer • Updated Apr 16 • 3.07M • 5 • 1 stair-lab/code_insights_matrices Preview • Updated Dec 12, 2024 • 1 stair-lab/code-insights-llm_simulator Text Generation • 8B • Updated Sep 8, 2024
Nonmyopic Bayesian Optimization in Dynamic Cost Settings Datasets and Models for the Nonmyopic BO project stair-lab/semi_synthetic_protein_2p12_gemma_7b Viewer • Updated Dec 18, 2024 • 12.3k • 9 stair-lab/proteinea_fluorescence-embedding Viewer • Updated Dec 18, 2024 • 188k • 110
Finetuning and Comprehensive Evaluation of Vietnamese LLM stair-lab/MATH_vi Viewer • Updated Sep 1, 2024 • 25k • 19 • 1 stair-lab/VSMEC Viewer • Updated Sep 1, 2024 • 6.24k • 3 stair-lab/ViHSD Viewer • Updated Sep 1, 2024 • 30.7k • 6 stair-lab/VSFC Viewer • Updated Sep 1, 2024 • 14.6k • 5
Cultural Alignment akhilayerukola/NormAd Viewer • Updated Oct 25, 2024 • 2.63k • 127 • 1 ura-hcmut/ECLeKTic Preview • Updated Jun 5 • 25 • 1 ToxicityPrompts/PolyGuardPrompts Viewer • Updated Jun 23 • 29.3k • 167 SALT-NLP/CultureBank Viewer • Updated Apr 24, 2024 • 23k • 222 • 15
Reliable and Efficient Amortized Model-Based Evaluation Datasets and Models for the REEval project stair-lab/reeval Viewer • Updated Jun 21 • 5.69M • 176 • 1 stair-lab/reeval-difficulty-for-helm Viewer • Updated Mar 18 • 217k • 15
Nonmyopic Bayesian Optimization in Dynamic Cost Settings Datasets and Models for the Nonmyopic BO project stair-lab/semi_synthetic_protein_2p12_gemma_7b Viewer • Updated Dec 18, 2024 • 12.3k • 9 stair-lab/proteinea_fluorescence-embedding Viewer • Updated Dec 18, 2024 • 188k • 110
Gathering Context for Decision Support with LLMs stair-lab/bosd_initial_dataset Viewer • Updated Jan 7 • 568 • 3
Finetuning and Comprehensive Evaluation of Vietnamese LLM stair-lab/MATH_vi Viewer • Updated Sep 1, 2024 • 25k • 19 • 1 stair-lab/VSMEC Viewer • Updated Sep 1, 2024 • 6.24k • 3 stair-lab/ViHSD Viewer • Updated Sep 1, 2024 • 30.7k • 6 stair-lab/VSFC Viewer • Updated Sep 1, 2024 • 14.6k • 5
Dynamics of Learning Datasets and Models for the CodeInsights Projects stair-lab/code_insights_jsons Preview • Updated Dec 11, 2024 • 1 stair-lab/code_insights_csv Viewer • Updated Apr 16 • 3.07M • 5 • 1 stair-lab/code_insights_matrices Preview • Updated Dec 12, 2024 • 1 stair-lab/code-insights-llm_simulator Text Generation • 8B • Updated Sep 8, 2024
Cultural Alignment akhilayerukola/NormAd Viewer • Updated Oct 25, 2024 • 2.63k • 127 • 1 ura-hcmut/ECLeKTic Preview • Updated Jun 5 • 25 • 1 ToxicityPrompts/PolyGuardPrompts Viewer • Updated Jun 23 • 29.3k • 167 SALT-NLP/CultureBank Viewer • Updated Apr 24, 2024 • 23k • 222 • 15