The Markovian Thinker Collection Reformulating the RL of reasoning LLMs through Markovian Thinking paradigm. • 7 items • Updated Oct 9 • 11
Running Featured 186 Qwen3 Omni Demo ⚡ 186 Interact with a multimodal chatbot using text, audio, images, or video
Lost in Embeddings: Information Loss in Vision-Language Models Paper • 2509.11986 • Published Sep 15 • 27
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling Paper • 2509.12201 • Published Sep 15 • 103
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face Jul 29 • 199
Watch, Listen, Understand, Mislead: Tri-modal Adversarial Attacks on Short Videos for Content Appropriateness Evaluation Paper • 2507.11968 • Published Jul 16
MIMIC: Multimodal Islamophobic Meme Identification and Classification Paper • 2412.00681 • Published Dec 1, 2024