Instructions to use Salesforce/blip-image-captioning-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Salesforce/blip-image-captioning-base with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="Salesforce/blip-image-captioning-base")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("Salesforce/blip-image-captioning-base") model = AutoModelForImageTextToText.from_pretrained("Salesforce/blip-image-captioning-base") - Notebooks
- Google Colab
- Kaggle
Is finetuning possible for this model?
Is finetuning supported for this model? If so can people give me some pointers?
I found this post: https://discuss.huggingface.co/t/finetune-blip-on-customer-dataset-20893/28446
but this is about Salesforce/blip-vqa-base instead of this model Salesforce/blip-image-captioning-large
Is finetuning supported for this model? If so can people give me some pointers?
I found this post: https://discuss.huggingface.co/t/finetune-blip-on-customer-dataset-20893/28446
but this is about Salesforce/blip-vqa-base instead of this model Salesforce/blip-image-captioning-large
Hello!
Have you solved this problem? I hope to get some solutions from you and look forward to your reply.
Good luck to you!
Hi !
Please have a look at this section: https://github.com/NielsRogge/Transformers-Tutorials/tree/master/BLIP-2 from @nielsr as a starting point - good luck !