martin's picture

martin

martintomov

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

LiquidAI/LFM2.5-VL-450M

liked a dataset 2 days ago

allenai/WildDet3D-Data

liked a model 6 days ago

tiiuae/Falcon-Perception

View all activity

Organizations

upvoted a collection 23 days ago

MolmoPoint

MolmoPoint models • 3 items • Updated 23 days ago • 11

upvoted a collection about 1 month ago

Qwen3.5

21 items • Updated Mar 9 • 1.48k

upvoted a paper 6 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 130

upvoted an article 12 months ago

Article

How to Build an MCP Server with Gradio

Apr 30, 2025

•

202

upvoted a collection about 1 year ago

JARVIS-VLA-v1

Vision-Language-Action Models in Minecraft. • 4 items • Updated Mar 22, 2025 • 11

upvoted an article about 1 year ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

Feb 20, 2025

•

336

upvoted a collection about 1 year ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

upvoted an article about 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4, 2025

•

1.32k

upvoted a collection about 1 year ago

Cosmos

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/nvidia-cosmos-2 • 14 items • Updated 4 days ago • 301

upvoted a collection over 1 year ago

[MASK] is All You Need

Code, dataset, and pretrained model • 6 items • Updated Feb 6, 2025 • 9

upvoted a paper over 1 year ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 135

upvoted a collection over 1 year ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 30 days ago • 152

upvoted a paper over 1 year ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 61

upvoted a collection over 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 711

upvoted 6 papers over 1 year ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 80

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published Nov 11, 2024 • 68

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 154

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published Nov 7, 2024 • 56

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 58

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115