Text Generation
Transformers
PyTorch
llava

Model Card for LISA-RS

We fine-tune LISA reasoning segmentation model with dataset from AVS training dataset from AVS-Bench.

For more information on usage, please refer to the LISA-AVS Github repository here.

Citation

@inproceedings{tan2025searchtta,
  title     = {Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild},
  author    = {Derek Ming Siang Tan, Shailesh, Boyang Liu, Alok Raj, Qi Xuan Ang, Weiheng Dai, Tanishq Duhan, Jimmy Chiun, Yuhong Cao, Florian Shkurti, Guillaume Sartoretti},
  booktitle = {Conference on Robot Learning},
  year      = {2025},
  url       = {https://arxiv.org/abs/2505.11350}
}

@article{lai2023lisa,
  title={LISA: Reasoning Segmentation via Large Language Model},
  author={Lai, Xin and Tian, Zhuotao and Chen, Yukang and Li, Yanwei and Yuan, Yuhui and Liu, Shu and Jia, Jiaya},
  journal={arXiv preprint arXiv:2308.00692},
  year={2023}
}
Downloads last month
13
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using derektan95/LISA-AVS 1

Collection including derektan95/LISA-AVS