did you first Quntize to QAT and then mlx , what kind of approch you did , the reason i am asking cause if there is no QAT , then it will lose alot of accuracy in int 4
· Sign up or log in to comment