mltrials

AI & ML interests

None yet

Recent Activity

upvoted an article 24 days ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a collection 3 months ago

Granite Docling

liked a model 3 months ago

OmniDimen/OmniDimen-4B-Emotion-GGUF-q4_K_M

View all activity

Organizations

None yet

upvoted an article 24 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

26 days ago

•

547

upvoted a collection 3 months ago

Granite Docling

Collection

5 items • Updated Nov 17 • 60

liked a model 3 months ago

OmniDimen/OmniDimen-4B-Emotion-GGUF-q4_K_M

Text Generation • 4B • Updated Sep 19 • 335 • 5

upvoted an article 3 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

•

267

upvoted 2 papers 4 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19 • 48

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 129

liked a model 6 months ago

osmosis-ai/Osmosis-Apply-1.7B

Text Generation • 2B • Updated Jul 3 • 71 • 91

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

740

upvoted a paper 6 months ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 63

updated a model 7 months ago

mltrials/opt-350m-lora

Updated Jun 9

published a model 7 months ago

mltrials/opt-350m-lora

Updated Jun 9

liked a model 10 months ago

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • 8B • Updated Jun 18 • 1.55M • • 4.34k

liked a model 11 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 517k • • 12.9k

upvoted 3 papers 12 months ago

upvoted 4 articles 12 months ago

Article

🌁#81: Key AI Concepts to Follow in 2025

Dec 23, 2024

•

Article

Fine-tune ModernBERT for text classification using synthetic data

Dec 30, 2024

•

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Jan 2

•

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

Jan 3

•

mltrials

AI & ML interests

Recent Activity

Organizations

mltrials's activity

We Got Claude to Fine-Tune an Open Source LLM

Welcome EmbeddingGemma, Google's new efficient embedding model

SmolLM3: smol, multilingual, long-context reasoner

🌁#81: Key AI Concepts to Follow in 2025

Fine-tune ModernBERT for text classification using synthetic data

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Fine-tune a SmolLM on domain-specific synthetic data from a LLM