fastllm model for Qwen-7B-Chat-int8
Github address: https://github.com/ztxz16/fastllm
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
fastllm model for Qwen-7B-Chat-int8
Github address: https://github.com/ztxz16/fastllm