Llama3-8B-1.58 A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! HF1BitLLM/Llama3-8B-1.58-100B-tokens Text Generation • 3B • Updated Sep 19, 2024 • 1.08k • 193 HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 83 • 10 HF1BitLLM/Llama3-8B-1.58-Sigmoid-k100-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 1 • 9
Llama3-8B-1.58 A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! HF1BitLLM/Llama3-8B-1.58-100B-tokens Text Generation • 3B • Updated Sep 19, 2024 • 1.08k • 193 HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 83 • 10 HF1BitLLM/Llama3-8B-1.58-Sigmoid-k100-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 1 • 9