prince-canuma commited on
Commit
a3d944f
·
verified ·
1 Parent(s): ac26528

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: other
5
+ tags:
6
+ - facebook
7
+ - nvidia
8
+ - meta
9
+ - pytorch
10
+ - llama
11
+ - llama-3
12
+ - mlx
13
+ pipeline_tag: text-generation
14
+ license_name: llama3
15
+ license_link: LICENSE
16
+ ---
17
+
18
+ # mlx-community/Llama3-ChatQA-1.5-8B-8bit
19
+ This model was converted to MLX format from [`mlx-community/Llama3-ChatQA-1.5-8B`]() using mlx-lm version **0.12.0**.
20
+
21
+ Model added by [Prince Canuma](https://twitter.com/Prince_Canuma).
22
+
23
+ Refer to the [original model card](https://huggingface.co/nvidia/Llama3-ChatQA-1.5-8B) for more details on the model.
24
+ ## Use with mlx
25
+
26
+ ```bash
27
+ pip install mlx-lm
28
+ ```
29
+
30
+ ```python
31
+ from mlx_lm import load, generate
32
+
33
+ model, tokenizer = load("mlx-community/Llama3-ChatQA-1.5-8B-8bit")
34
+ response = generate(model, tokenizer, prompt="hello", verbose=True)
35
+ ```