Step-Audio-R1 Collection Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling. • 3 items • Updated 9 days ago • 11
Kandinsky 5.0 Image Lite Collection Kandinsky 5.0 Image Lite is a 6B DiT-based model that generates and edits HD images from English and Russian text prompts with high visual quality. • 4 items • Updated 5 days ago • 12
KVAE 1.0 Collection KVAE 1.0 tokenizers are for images (KVAE-2D-1.0) and video (KVAE-3D-1.0) are distributed under MIT license (commercial use is possible). • 2 items • Updated 5 days ago • 5
Kandinsky 5.0 Video Lite Collection Kandinsky 5.0 Video Lite is a lightweight 2B model that generates up to 10-second SD videos from English and Russian prompts with high visual quality. • 9 items • Updated 5 days ago • 7
SenseNova-SI Collection Scaling Spatial Intelligence with Multimodal Foundation Models • 4 items • Updated 12 days ago • 10
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms 10 days ago • 25
DR Tulu Collection Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated 5 days ago • 28
MXFP4 Hybrid GGUF Collection MXFP4 hybrid GGUF models getting... well.. Getting some interesting results. • 11 items • Updated 13 days ago • 3
MiroThinker-v1.0 Collection Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling • 7 items • Updated 11 days ago • 39
Jan-v2-VL Collection Jan-v2-VL: an 8B VLM focused on reliable, many-step task execution. • 6 items • Updated 17 days ago • 36