A newer version of the Gradio SDK is available: 5.43.1
5.43.1
Example script of using FlashAttention for inference coming soon.