Backup-bdg/Xoron-Dev-MultiMoe · Xoron-Dev-MultiMoe-GGUF ?

Xoron-Dev-MultiMoe-GGUF ?

by Rebis - opened 1 day ago

Discussion

Rebis

1 day ago

Hi,
Is a GGUF version planned for the near future ?
Thank you in advance.

Backup-bdg

Owner 1 day ago

•

edited 1 day ago

Yes, I would love to add full GGUF support. However, the model cannot support it yet because of its architectural complexity. GGUF engines and tools like Llama.cpp, LM Studio, or Ollama would not fully support my architecture or the modifications I have made.
Can I ask why you chose GGUF? Is it for quantization, or for general model use with a server, API, or Ollama?

Rebis

about 21 hours ago

I want to use it with Ollama. Quantification is not a priority since it seems rather small. I would also like to know if you have considered using Qwen3-VL with mergekit. I saw that it was now supported.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment