Mohd Kaif's picture

Mohd Kaif

kaifahmad

·

https://github.com/KaifAhmad1

AI & ML interests

Open Source AI

Recent Activity

liked a model about 1 month ago

moonshotai/Kimi-K2-Thinking

upvoted a collection about 1 month ago

Agent training frameworks

upvoted a paper about 1 month ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

View all activity

Organizations

upvoted a collection about 1 month ago

Agent training frameworks

1 item • Updated Apr 30 • 1

upvoted a paper about 1 month ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 80

upvoted 2 collections about 2 months ago

VideoPrism

VideoPrism is a foundational video encoder that enables state-of-the-art performance on a large variety of video understanding tasks. • 5 items • Updated Jul 16 • 12

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 357

upvoted an article about 2 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21

•

275

upvoted an article 2 months ago

Article

Public AI on Hugging Face Inference Providers 🔥

+4

Sep 17

•

22

upvoted a collection 3 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 6 days ago • 159

upvoted a collection 4 months ago

Health AI Developer Foundations (HAI-DEF)

Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations • 15 items • Updated Jul 10 • 118

upvoted 3 articles 4 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5

•

509

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

+1

Apr 19, 2024

•

188

Article

Evaluating Audio Reasoning with Big Bench Audio

Dec 20, 2024

•

26

upvoted an article 6 months ago

Article

Featherless AI on Hugging Face Inference Providers 🔥

+4

Jun 12

•

48

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12

•

568

upvoted a collection 8 months ago

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 6 days ago • 60

upvoted an article 8 months ago

Article

State of open video generation models in Diffusers

+1

Jan 27

•

63

upvoted an article 9 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

Mar 12

•

474

upvoted 2 articles 10 months ago

Article

SigLIP 2: A better multilingual vision language encoder

+1

Feb 21

•

193

Article

Welcome to Inference Providers on the Hub 🔥

+5

Jan 28

•

490

upvoted 2 articles about 1 year ago

Article

Visually Multilingual: Introducing mcdse-2b

Oct 27, 2024

•

41

Article

Deploying Speech-to-Speech on Hugging Face

+2

Oct 22, 2024

•

45