52 40 362

Alara Dirik

adirik

alaradirik

AI & ML interests

None yet

Recent Activity

liked a Space about 18 hours ago

HumanAIGC/OutfitAnyone

liked a Space about 18 hours ago

WeShopAI/WeShopAI-Virtual-Try-On

liked a Space about 18 hours ago

yisol/IDM-VTON

View all activity

Organizations

upvoted 3 articles 8 days ago

Article

FineVideo: behind the scenes

and 5 others •

Sep 23, 2024

• 34

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

and 3 others •

Oct 23, 2024

• 18

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

and 3 others •

9 days ago

• 30

upvoted an article 10 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

and 2 others •

May 14, 2024

• 263

upvoted 4 collections about 2 months ago

upvoted a paper about 2 months ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58

upvoted an article 2 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 491

upvoted a collection 3 months ago

D-FINE

Collection

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 55

upvoted 2 articles 5 months ago

Article

SigLIP 2: A better multilingual vision language encoder

and 2 others •

Feb 21

• 174

Article

FastRTC: The Real-Time Communication Library for Python

and 1 other •

Feb 25

• 171

upvoted 4 articles 6 months ago

Article

Build awesome datasets for video generation

Feb 12

• 34

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.28k

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

•

Jan 29

• 19

Article

Welcome to Inference Providers on the Hub 🔥

and 6 others •

Jan 28

• 485

upvoted a paper 6 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 160

upvoted an article 6 months ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

Jan 20

• 70

upvoted an article 7 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

•

Jan 15

• 199

Alara Dirik

AI & ML interests

Recent Activity

Organizations

adirik's activity

FineVideo: behind the scenes

CinePile 2.0 - making stronger datasets with adversarial refinement

TimeScope: How Long Can Your Video Large Multimodal Model Go?

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Vision Language Models (Better, Faster, Stronger)

SigLIP 2: A better multilingual vision language encoder

FastRTC: The Real-Time Communication Library for Python

Build awesome datasets for video generation

Open-source DeepResearch – Freeing our search agents

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

Welcome to Inference Providers on the Hub 🔥

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

Train 400x faster Static Embedding Models with Sentence Transformers