Vikas Kumar's picture

36 53

Vikas Kumar

vikas

·

vikasiitkgp

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted an article 26 days ago

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

liked a model about 2 months ago

Qwen/Qwen3-Embedding-0.6B-GGUF

View all activity

Organizations

upvoted an article 4 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

22 days ago

• 591

upvoted an article 26 days ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

By

and 1 other •

29 days ago

• 105

upvoted a collection 2 months ago

Deepseek Papers

Deepseek papers collection • 24 items • Updated 9 days ago • 264

upvoted an article 2 months ago

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

May 15

• 116

upvoted a paper 5 months ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 36

upvoted an article 6 months ago

Article

1 Billion Classifications

By

•

Feb 13

• 43

upvoted an article 7 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

By

•

Jan 15

• 197

upvoted an article 11 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

By

•

Aug 22, 2024

• 90

upvoted 2 papers 12 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 162

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 70

upvoted an article 12 months ago

Article

Finetuning PaliGemma with AutoTrain

By

•

Jul 25, 2024

• 11

upvoted a collection 12 months ago

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated 20 days ago • 81

upvoted an article 12 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 350

upvoted an article about 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

By

and 7 others •

Jul 23, 2024

• 236

upvoted a collection about 1 year ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 231

upvoted 3 articles about 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 401

Article

The Rise of Agentic Data Generation

By

•

Jul 15, 2024

• 83

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5, 2024

• 281

upvoted a collection about 1 year ago

Florence

9 items • Updated May 1 • 172

upvoted an article about 1 year ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

By

and 2 others •

Jun 24, 2024

• 199