Request: DOI
#54 opened 8 months ago
		by
		
				
 hamad-83
							
						hamad-83
	
 
							Can't load model with SentenceTransformers 3.0.1 AttributeError: 'LatentAttentionConfig' object has no attribute '_attn_implementation_internal'
									9
	#50 opened about 1 year ago
		by
		
				
 jswarner85
							
						jswarner85
	
Can we get a simple example please using "huggingfaceembeddings" from the Langchain library or whatever class is the correct one?
π₯
							
						2
				#48 opened about 1 year ago
		by
		
				
 ctranslate2-4you
							
						ctranslate2-4you
	
Update feature for NVEmbedConfig class
									1
	#45 opened over 1 year ago
		by
		
				
 lukelv
							
						lukelv
	
Batch_size
#44 opened over 1 year ago
		by
		
				
 lukelv
							
						lukelv
	
replicate experimental results on the MTEB dataset
									1
	#42 opened over 1 year ago
		by
		
				
 lzq2021
							
						lzq2021
	
 
							Code trying to download model from huggingface instead of using Locally Downloaded Model
									4
	#41 opened over 1 year ago
		by
		
				
 sharedJackpot
							
						sharedJackpot
	
Model Loading Error
									3
	#40 opened over 1 year ago
		by
		
				
 kcsham
							
						kcsham
	
Supporting Flash Attention 2.0
#39 opened over 1 year ago
		by
		
				
 Cdemir
							
						Cdemir
	
'MistralModel' object has no attribute 'encode'
									1
	#38 opened over 1 year ago
		by
		
				
 dadada
							
						dadada
	
Has the tokenizer of the base model(Mistral-7B-v0.1) been retrained?
#37 opened over 1 year ago
		by
		
				
 LH0521
							
						LH0521
	
How did you trained your LatentAttentionLayer?
									1
	#36 opened over 1 year ago
		by
		
				
 juneonetwothree
							
						juneonetwothree
	
Why do we need to hardcode self._attn_implementation = "eager"
								1
#35 opened over 1 year ago
		by
		
				
 shantanuagarwal
							
						shantanuagarwal
	
Error to load model with HuggingFace API
									1
	#34 opened over 1 year ago
		by
		deleted
Regarding max seq length
									1
	#33 opened over 1 year ago
		by
		
				
 sandeep456
							
						sandeep456
	
How to fine-tune this model?
π
							β
							
						23
				#32 opened over 1 year ago
		by
		
				
 caochengchen
							
						caochengchen
	
error with module datasets
									2
	#31 opened over 1 year ago
		by
		
				
 claraadam
							
						claraadam
	
Distant resource does not have a Content-Length
#30 opened over 1 year ago
		by
		
				
 caochengchen
							
						caochengchen
	
Best instructions for clustering and semantic similarity
									2
	#29 opened over 1 year ago
		by
		
				
 rmilliere
							
						rmilliere
	
Dataloader multiprocessing error
									1
	#28 opened over 1 year ago
		by
		
				
 
							 Atsunori
						Atsunori
	
 
							Fixing "KeyError: 'NVEmbedConfig'"
									10
	#27 opened over 1 year ago
		by
		
				
 Th3l
							
						Th3l
	
Error using multi-gpu support
									5
	#26 opened over 1 year ago
		by
		
				
 bobwhiterabbit
							
						bobwhiterabbit
	
Access to model nvidia/NV-Embed-v1 is restricted. You must be authenticated to access it
									6
	#25 opened over 1 year ago
		by
		
				
 yijiu
							
						yijiu
	
Matryoshka Embedding
									1
	#24 opened over 1 year ago
		by
		
				
 XingyanZhang
							
						XingyanZhang
	
nvidia/NV-Embed-v1 is not the path to a directory containing a file named config.json.
									3
	#23 opened over 1 year ago
		by
		
				
 XuehangCang
							
						XuehangCang
	
 
							Finetuning guidelines
π
							π
							
						5
				
								1
#21 opened over 1 year ago
		by
		
				
 mali404
							
						mali404
	
How much VRAM is needed to run this model? Like for the bare minimum length etc?
								3
#20 opened over 1 year ago
		by
		
				
 smpa239
							
						smpa239
	
Ollama Version
π
							
						1
				
									1
	#19 opened over 1 year ago
		by
		
				
 yangwang825
							
						yangwang825
	
 
							Weights are in FP16 (loaded in FP32) but paper mentions BF16
π
							
						6
				#17 opened over 1 year ago
		by
		
				
 AdrienC
							
						AdrienC
	
ONNX version
									1
	#16 opened over 1 year ago
		by
		
				
 michaelfeil
							
						michaelfeil
	
 
							Sentence Transformer compatibility
π
							
						1
				
									4
	#15 opened over 1 year ago
		by
		
				
 michaelfeil
							
						michaelfeil
	
 
							Please provide a 8bit quantified version
π
							π
							
						4
				#14 opened over 1 year ago
		by
		
				
 fukai
							
						fukai
	
How to use for AutoModelForSequenceClassification?
π
							
						5
				#13 opened over 1 year ago
		by
		
				
 deshwalmahesh
							
						deshwalmahesh
	
Possible to implement `_no_split_modules` attribute?
									1
	#12 opened over 1 year ago
		by
		
				
 ronnybehrens
							
						ronnybehrens
	
 
							missing citation
									3
	#11 opened over 1 year ago
		by
		
				
 SeanLee97
							
						SeanLee97
	
 
							Multi-Lingual?
								2
#10 opened over 1 year ago
		by
		
				
 dejanseo
							
						dejanseo
	
 
							Getting "KeyError" when loading model
π
							
						1
				
									5
	#8 opened over 1 year ago
		by
		
				
 tsakaiba
							
						tsakaiba
	
TypeError: MistralDecoderLayer.forward() got an unexpected keyword argument 'is_causal'
π
							
						1
				
									3
	#7 opened over 1 year ago
		by
		
				
 yxzwayne
							
						yxzwayne
	
 
							Is this model active?
								1
#5 opened over 1 year ago
		by
		
				
 gsnic
							
						gsnic
	
Sharing training data & reproducing training
π€
							
						6
				
								2
#4 opened over 1 year ago
		by
		
				
 xhluca
							
						xhluca
	
 
							