Can Llama-3.1- Nemotron-40B-Instruct be released as well?
π
							
						1
				#24 opened 10 months ago
		by
		
				
							
						tdh111
	
What is the context size this model was trained on?
								2
#23 opened 10 months ago
		by
		
				
							
						treehugg3
	
Modified llama.cpp to generate GGUFs for Llama-3_1-Nemotron-51
β€οΈ
							π₯
							
						2
				#22 opened 11 months ago
		by
		
				
							
						ymcki
	
Documentation about the linear attention used in some layers of this model?
#21 opened 11 months ago
		by
		
				
							
						ymcki
	
Comparison to the 70B model?
π
							
						1
				
									1
	#20 opened 12 months ago
		by
		
				
							
						AIGUYCONTENT
	
Update README.md
#11 opened about 1 year ago
		by
		
				
							
						Vlad748283847
	
vLLM compatible?
π
							
						5
				
								3
#10 opened about 1 year ago
		by
		
				
							
						nickandbro
	
AttributeError: 'DeciLMConfig'
								3
#9 opened about 1 year ago
		by
		
				
							
						bluenevus
	
fp8 / int8 inference - use bitsandbytes or awq
π
							
						2
				#8 opened about 1 year ago
		by
		
				
							
						dtanow
	
GGUF possible ?
π
							β€οΈ
							
						4
				
								2
#5 opened about 1 year ago
		by
		
				
							
						gopi87
	
fine-tuning
#1 opened about 1 year ago
		by
		
				
							
						kzmaker