Mengzhao Chen's picture

Mengzhao Chen

ChenMnZ

·

https://chenmnz.github.io/

ChenMnZ

AI & ML interests

model compression

Recent Activity

upvoted a paper 1 day ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

upvoted a paper 8 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

upvoted a paper 26 days ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

View all activity

Organizations

None yet

ChenMnZ 's models 129

ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ

10B • Updated Aug 6, 2024 • 3 • 25

ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-BitBLAS

Text Generation • 37B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-BitBLAS

Text Generation • 20B • Updated Jul 22, 2024

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-BitBLAS

Text Generation • 5B • Updated Jul 22, 2024 • 16

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-BitBLAS

Text Generation • 3B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-BitBLAS

Text Generation • 3B • Updated Jul 22, 2024

ChenMnZ/Llama-3-8b-EfficientQAT-w4g128-BitBLAS

Text Generation • 5B • Updated Jul 22, 2024

ChenMnZ/Llama-3-8b-EfficientQAT-w2g64-BitBLAS

Text Generation • 3B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w2g128-BitBLAS

Text Generation • 3B • Updated Jul 22, 2024

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-BitBLAS

Text Generation • 37B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-BitBLAS

Text Generation • 21B • Updated Jul 22, 2024 • 10

ChenMnZ/Llama-3-8b-EfficientQAT-w2g128-GPTQ

Text Generation • 2B • Updated Jul 22, 2024

ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-BitBLAS

Text Generation • 21B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-GPTQ

Text Generation • 2B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-GPTQ

Text Generation • 2B • Updated Jul 22, 2024

ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-BitBLAS

Text Generation • 20B • Updated Jul 22, 2024

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-GPTQ

Text Generation • 2B • Updated Jul 22, 2024 • 34 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w4g128-GPTQ

Text Generation • 2B • Updated Jul 22, 2024 • 2 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w2g64-GPTQ

Text Generation • 2B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128

Text Generation • 11B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-GPTQ

Text Generation • 11B • Updated Jul 22, 2024

ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-BitBLAS

Text Generation • 4B • Updated Jul 22, 2024 • 56

ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-BitBLAS

Text Generation • 2B • Updated Jul 22, 2024 • 4

ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-BitBLAS

Text Generation • 2B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-GPTQ

Text Generation • 8B • Updated Jul 22, 2024

ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS

Text Generation • 36B • Updated Jul 22, 2024

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-GPTQ

Text Generation • 7B • Updated Jul 22, 2024 • 11

ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-GPTQ

Text Generation • 11B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128

Text Generation • 2B • Updated Jul 22, 2024 • 5

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w3g128

Text Generation • 2B • Updated Jul 22, 2024 • 1