Do you have any plans to release a 4bit dwq quant of glm 4.6?
Have a look at https://github.com/ml-explore/mlx-lm/pull/536#issuecomment-3382233556 for the the work Awni is doing on this.
· Sign up or log in to comment