RynnVLA-002: A Unified Vision-Language-Action and World Model Paper β’ 2511.17502 β’ Published 12 days ago β’ 24
Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs Paper β’ 2511.17220 β’ Published 12 days ago β’ 15
Running on Zero Featured 295 Depth Anything 3 π’ 295 Generate depth maps from images using GPU acceleration
Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation Paper β’ 2511.10547 β’ Published 20 days ago β’ 4
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper β’ 2511.08521 β’ Published 22 days ago β’ 37
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper β’ 2511.10629 β’ Published 20 days ago β’ 117
Depth Anything 3: Recovering the Visual Space from Any Views Paper β’ 2511.10647 β’ Published 20 days ago β’ 90
Running on Zero MCP Featured 32 Qwen3 VL HF Demo π₯ 32 object detection, visual grounding, keypoint detection
Kimi Linear: An Expressive, Efficient Attention Architecture Paper β’ 2510.26692 β’ Published Oct 30 β’ 113
Runtime error Featured 46 WithAnyone Demo π 46 WithAnyone is capable of generating high-quality, controllab
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper β’ 2510.14975 β’ Published Oct 16 β’ 83
huihui-ai/Qwen2.5-VL-7B-Instruct-abliterated Image-Text-to-Text β’ 8B β’ Updated 25 days ago β’ 1.08k β’ 33
Running 6 Huihui Ai Mistral Small 24B Instruct 2501 Abliterated π» 6 Generate text using a large language model
huihui-ai/Phi-4-multimodal-instruct-abliterated Automatic Speech Recognition β’ 6B β’ Updated Mar 3 β’ 133 β’ 25
huihui-ai/Huihui-Qwen3-VL-30B-A3B-Instruct-abliterated Image-Text-to-Text β’ 31B β’ Updated Nov 1 β’ 25.1k β’ 66