Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated
a collection
1 day ago
vLLM Kernels
updated
a collection
1 day ago
vLLM Kernels
updated
a dataset
16 days ago
mgoin/mlperf-inference-llama3.1-8b-data