Screenspot evaluation

#4
by jfwang213 - opened

May I ask if you have run the Screenspot evaluation dataset? The results we obtained are not very satisfactory.

In this version, no specific optimizations have been applied to the GUI tasks.

Sign up or log in to comment