Ofir Zafrir
ofirzaf
AI & ML interests
Sparsity, Qunatization, Model Compression
Recent Activity
upvoted
an
article
30 days ago
Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models
published
an
article
about 1 month ago
Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models
liked
a model
about 1 month ago
OpenVINO/Qwen3-pruned-6L-from-0.6B-int8-ov