variante/llava-1.5-7b-llara-D-RT2-Style-VIMA-80k Image-Text-to-Text • 7B • Updated Aug 28, 2024 • 2
LLaRA Collection Models released with LLaRA: Supercharging Robot Learning Data for Vision-Language Policy • 7 items • Updated Aug 28, 2024 • 1
Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 Image-Text-to-Text • 4B • Updated Feb 3 • 735 • 56
variante/llava-1.5-7b-llara-D-inBC-VIMA-80k Image-Text-to-Text • 7B • Updated Jul 13, 2024 • 3 • 1
variante/llava-1.5-7b-llara-D-inBC-Aux-D-VIMA-80k Image-Text-to-Text • 7B • Updated Jul 13, 2024 • 3 • 1
variante/llava-1.5-7b-llara-D-inBC-Aux-B-VIMA-80k Image-Text-to-Text • 7B • Updated Jul 15, 2024 • 9 • 2
Theia: Distilling Diverse Vision Foundation Models for Robot Learning Paper • 2407.20179 • Published Jul 29, 2024 • 47
theaiinstitute/theia-tiny-patch16-224-cddsv Feature Extraction • 16.2M • Updated Jul 30, 2024 • 2.4k • 4
theaiinstitute/theia-base-patch16-224-cdiv Feature Extraction • 0.1B • Updated Jul 30, 2024 • 4.14k • 8
theaiinstitute/theia-small-patch16-224-cdiv Feature Extraction • 36.7M • Updated Jul 30, 2024 • 27 • 3