arxiv:2502.13791
Sandro Pezzelle
sandropezzelle
AI & ML interests
None yet
Recent Activity
liked
a dataset
12 days ago
MBZUAI/ViMUL-Bench
authored
a paper
5 months ago
The LAMBADA dataset: Word prediction requiring a broad discourse context
authored
a paper
5 months ago
LLMs instead of Human Judges? A Large Scale Empirical Study across 20
NLP Evaluation Tasks