1 42 44

Igor Gromov

Transformator

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

The Bestiary

liked a model 2 days ago

p-e-w/gemma-3-12b-it-heretic

upvoted a paper 7 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

View all activity

Organizations

None yet

upvoted a collection 2 days ago

The Bestiary

Collection

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated 4 days ago • 54

upvoted a paper 7 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 8 days ago • 92

upvoted a collection 8 days ago

MDGA

Collection

Make Diffusion Great Again. The resource list for Super Data Learners, Quokka, and OpenMoE 2. • 16 items • Updated 16 days ago • 6

upvoted a paper 8 days ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 15 days ago • 116

upvoted a paper 12 days ago

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Paper • 2511.03774 • Published 14 days ago • 12

upvoted a paper 20 days ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published 21 days ago • 44

upvoted a paper about 1 month ago

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10 • 26

upvoted a collection 7 months ago

Health AI Developer Foundations (HAI-DEF)

Collection

Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations • 15 items • Updated Jul 10 • 110

upvoted 2 papers 8 months ago

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 95

Aligning Multimodal LLM with Human Preference: A Survey

Paper • 2503.14504 • Published Mar 18 • 26

upvoted an article 8 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

492

upvoted 3 papers 8 months ago

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 88

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 101

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 45

upvoted an article 8 months ago

Article

Open Source Developers Guide to the EU AI Act

Dec 2, 2024

•

upvoted 5 papers 9 months ago

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 21

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 18

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 29

ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Paper • 2502.18017 • Published Feb 25 • 21

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

Paper • 2502.13233 • Published Feb 18 • 15

Igor Gromov

AI & ML interests

Recent Activity

Organizations

Transformator's activity

Vision Language Models Explained

Open Source Developers Guide to the EU AI Act