plesze specific the model Qutnization

#1
by Wasim0606 - opened

did you first Quntize to QAT and then mlx , what kind of approch you did , the reason i am asking cause if there is no QAT , then it will lose alot of accuracy in int 4

Sign up or log in to comment