Image-to-Text
Transformers
Safetensors
qwen2_vl
text-generation-inference

An end-to-end multimodal LLM for Scene Graph Generation (SGG), which was introduced in [Compile Scene Graphs with Reinforcement Learning](https://huggingface.co/papers/2504.13617

Downloads last month
4
Safetensors
Model size
8.29B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for JosephZ/R1-SGG-Zero-7B

Base model

Qwen/Qwen2-VL-7B
Finetuned
(379)
this model

Dataset used to train JosephZ/R1-SGG-Zero-7B