DETR-R101-DC5 object detector, finetuned on PaintSkills Dataset for visual reasoning skill evaluation of text-to-image generation models.
Please check https://github.com/j-min/DallEval/tree/main/paintskills for the instruction for running skill evaluation with the DETR model.
- Paper: DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)
 - Authors: Jaemin Cho, Abhay Zala, Mohit Bansal
 
@inproceedings{Cho2023DallEval,
  title         = {DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models},
  author        = {Jaemin Cho and Abhay Zala and Mohit Bansal},
  year          = {2023},
  booktitle     = {ICCV},
}