Please make MOE models

by Narutoouz - opened Jul 3

Jul 3

You guys are making the best models for local inference, I love GLM4-32b and 9b models. I can't aniticipate enough, what an wonderful MOE you could make? Always it would be great if it having both Thinking and non thinking modes like qwen3 series of models.

ZHANGYUXUAN-zR

Z.ai org Jul 4

We will make it. Although I don't know their open-source time yet, we will try our best, and we will disclose information once there is new progress.

Dampfinchen

Jul 5

I agree. Would love to see a smaller one similar to Qwen 3 30B A3B and a bigger one. Native multimodality and more attention heads for better instruct following over a larger context would be nice to see as well.

Doctor-Chad-PhD

Jul 5

Yes, a GLM 30B A3B would be great.

imoc

Jul 6

•

edited Jul 6

I vote for 70~80B-A8~9B, A3B is fast but practically too weak.

Narutoouz

Jul 6

We will make it. Although I don't know their open-source time yet, we will try our best, and we will disclose information once there is new progress.

Thank you, looking forward to it.

Narutoouz

Jul 6

I vote for 70~80B-A8~9B, A3B is fast but practically too weak.

GLM already surprised me with 32b and 9b models as it done or out done SOTA models in simiar tasks. Their MOE model will be also the same. Just like mistral 24b multimodal model, if it is multimodel also, it will be just cherry on the cake for open source community!