mattshumer
/

mistral-8x7b-chat

Text Generation

text-generation-inference

Model card Files Files and versions

mattshumer commited on Dec 10, 2023

Commit

9f0ab4a

·

1 Parent(s): fe94c77

Create README.md

Files changed (1) hide show

README.md +27 -0

README.md ADDED Viewed

	@@ -0,0 +1,27 @@

+A very capable chat model built on top of the new Mistral MoE model, trained on the SlimOrca dataset for 1 epoch, using QLoRA.
+Inference:
+```
+import torch
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("mattshumer/mistral-8x7b-chat", low_cpu_mem_usage=True, device_map="auto", trust_remote_code=True)
+tok = AutoTokenizer.from_pretrained("mattshumer/mistral-8x7b-chat")
+x = tok.encode(PROMPT_GOES_HERE, return_tensors="pt").cuda()
+x = model.generate(x, max_new_tokens=512).cpu()
+print(tok.batch_decode(x))
+```
+Prompt Template:
+```
+<|im_start|>system
+You are an AI assistant.<|im_end|>
+<|im_start|>user
+Hi, how are you?<|im_end|>
+<|im_start|>assistant
+I'm doing well, thanks for asking!<|im_end|>
+<|im_start|>user
+Write me a poem about AI.<|im_end|>
+```
+Trained w/ Axolotl on 6x H100s for nine hours.