Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

HuggingFaceM4
/
Idefics3-8B-Llama3

Image-Text-to-Text
Transformers
Safetensors
English
idefics3
image-to-text
multimodal
vision
conversational
Model card Files Files and versions
xet
Community
25
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Potential Inconsistencies Model and Datasets License

#25 opened about 2 months ago by
yueyangchen

Question regarding license

#24 opened 3 months ago by
ir2718

حامد

#23 opened 4 months ago by
psyhamed

Text only generation for Idefics3

3
#22 opened 9 months ago by
FreekCool

This model can enabling video understanding and multi-image understanding capabilities?

#21 opened 9 months ago by
xJohn

Considerable speed loss after Lora Finetuning

14
#14 opened 11 months ago by
ayyylemao

Releasing base model and combined SFT dataset

3
#13 opened 11 months ago by
SS12444

How to use history prompts on the same image?

3
#12 opened 11 months ago by
MotiHa

Image encoding / rescaling Question

1
#11 opened 11 months ago by
ayyylemao

pretraining datasets

1
#8 opened 11 months ago by
yasserTII

gpu requirement

1
#7 opened 12 months ago by
mdeniz1

Support for Llama.cpp

#5 opened 12 months ago by
chibop

Any Idea When This Will Be Supported in TGI?

2
#3 opened 12 months ago by
pr1me

AssertionError: Padding_idx must be within num_embeddings

5
#2 opened 12 months ago by
ZeevRispler

Transformer Issue ?

5
#1 opened 12 months ago by
jgsmcmahon
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs