Ilyas Moutawwakil's picture

Ilyas Moutawwakil

IlyasMoutawwakil

·

IlyasMoutawwakil

AI & ML interests

Optimization, LLMs, Hardware, Backends, ..

Recent Activity

posted an update about 20 hours ago

🚀 Optimum: The Last v1 Release 🚀 Optimum v1.27 marks the final major release in the v1 series. As we close this chapter, we're laying the groundwork for a more modular and community-driven future: - Optimum v2: A lightweight core package for porting Transformers, Diffusers, or Sentence-Transformers to specialized AI hardware/software/accelerators.. - Optimum‑ONNX: A dedicated package where the ONNX/ONNX Runtime ecosystem lives and evolves, faster-moving and decoupled from the Optimum core. 🎯 Why this matters: - A clearer governance path for ONNX, fostering stronger community collaboration and improved developer experience.. - Enable innovation at a faster pace in a more modular, open-source environment. 💡 What this means: - More transparency, broader participation, and faster development driven by the community and key actors in the ONNX ecosystem (PyTorch, Microsoft, Joshua Lochner 👀, ...) - A cleaner, more maintainable core Optimum, focused on extending HF libraries to special AI hardware/software/accelerators tooling and used by our partners (Intel Corporation, Amazon Web Services (AWS), AMD, NVIDIA, FuriosaAI, ...) 🛠️ Major updates I worked on in this release: ✅ Added support for Transformers v4.53 and SmolLM3 in ONNX/ONNXRuntime. ✅ Solved batched inference/generation for all supported decoder model architectures (LLMs). ✨ Big shoutout to @echarlaix for leading the refactoring work that cleanly separated ONNX exporter logic and enabled the creation of Optimum‑ONNX. 📝 Release Notes: https://lnkd.in/gXtE_qji 📦 Optimum : https://lnkd.in/ecAezNT6 🎁 Optimum-ONNX: https://lnkd.in/gzjyAjSi #Optimum #ONNX #OpenSource #HuggingFace #Transformers #Diffusers

upvoted an article 3 days ago

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

updated a model 4 days ago

optimum-internal-testing/tiny-random-falcon-alibi-True

View all activity

Organizations

New activity in onnx/export 3 months ago

onnx-export-fix

#13 opened 3 months ago by

New activity in optimum-intel/benchmark-openvino 10 months ago

Benchmark results for model trl-internal-testing/tiny-random-LlamaForCausalLM and task text-generation

#1 opened 10 months ago by

IlyasMoutawwakil

Benchmark results for model trl-internal-testing/tiny-random-LlamaForCausalLM and task text-generation

#2 opened 10 months ago by

IlyasMoutawwakil

New activity in optimum/auto-benchmark 10 months ago

Please ask what the problem is:

#1 opened over 1 year ago by

New activity in optimum/llm-perf-leaderboard 12 months ago

add t4 to leaderboard

#30 opened 12 months ago by

New activity in optimum/llm-perf-leaderboard about 1 year ago

Why eliminate the Nvida 4090 benchmark

#28 opened about 1 year ago by

MarxistLeninist

New activity in optimum-intel/fastrag-e2e about 1 year ago

upgrade to recent gradio

#1 opened about 1 year ago by

New activity in IlyasMoutawwakil/tiny-random-LlavaForConditionalGeneration over 1 year ago

Does not contain `<image>` token.

#1 opened over 1 year ago by

New activity in optimum/llm-perf-leaderboard over 1 year ago

How to request performance evaluation with 'llm-perf-backend'?

#27 opened over 1 year ago by

How often do you update the leaderboard?

#22 opened over 1 year ago by

Cache the Results?

#16 opened almost 2 years ago by

What are differences between Max Allocated Memory, Max Reserved Memory and Max Used Memory?

#21 opened over 1 year ago by

How to download the table as csv file?

#18 opened over 1 year ago by

Where to download the leaderboard?

#26 opened over 1 year ago by

What is the testing prompt?

#23 opened over 1 year ago by

🚩 Report: Not working

#24 opened over 1 year ago by

I have added an application to Open LLM Leaderboard and see my scores on it, but how do I use the LLM Performance Leaderboard?

#25 opened over 1 year ago by

Mention models with latencies that are too high

#6 opened about 2 years ago by

Bug report , application fail to start

#15 opened almost 2 years ago by

Accuracy differences for models with quantization

#12 opened almost 2 years ago by