daisuke
dai
AI & ML interests
dai incl. AI.
so Contender.
Recent Activity
reacted
to
prithivMLmods's
post
with π
8 days ago
The demo of Qwen3-VL-30B-A3B-Instruct, the next-generation and powerful vision-language model in the Qwen series, delivers comprehensive upgrades across the board β including superior text understanding and generation, deeper visual perception and reasoning, extended context length, enhanced spatial and video dynamics comprehension, and stronger agent interaction capabilities. π€π₯
β‘ Space / App: https://huggingface.co/spaces/prithivMLmods/Qwen3-VL-HF-Demo
The modelβs demo supports a wide range of tasks, including;
Image Inference, Video Inference, PDF Inference, Image Captioning (VLA), GIF Inference.
β‘ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
Thanks for granting the blazing-fast Zero GPU access, @merve π
β‘ Other Pages
> Github: https://github.com/prithivsakthiur/qwen3-vl-hf-demo
> Multimodal VLMs July'25 : https://huggingface.co/collections/prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027
> VL caption β < Sep 15 β25 : https://huggingface.co/collections/prithivMLmods/vl-caption-sep-15-25-68c7f6d737985c63c13e2391
> Multimodal VLMs - Aug'25 : https://huggingface.co/collections/prithivMLmods/multimodal-vlms-aug25-68a56aac39fe8084f3c168bd
To know more about it, visit the app page or the respective model page!!
liked
a model
19 days ago
distil-labs/Llama-3_2-gitara-3B
liked
a dataset
21 days ago
nvidia/Nemotron-Personas-Japan