MediaTek-Research
/

Breeze-7B-32k-Instruct-v1_0

Text Generation

text-generation-inference

Model card Files Files and versions

Splend1dchan commited on Apr 24, 2024

Commit

7b22c9c

·

verified ·

1 Parent(s): 6652ccd

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -89,8 +89,14 @@ pip install flash-attn
 ```
 Then load the model in transformers:
 ```python
->>> from transformers import AutoTokenizer
 >>> tokenizer = AutoTokenizer.from_pretrained("MediaTek-Research/Breeze-7B-32k-Instruct-v1_0/")
 >>> chat = [
 ...   {"role": "user", "content": "你好，請問你可以完成什麼任務？"},
 ...   {"role": "assistant", "content": "你好，我可以幫助您解決各種問題、提供資訊和協助您完成許多不同的任務。例如：回答技術問題、提供建議、翻譯文字、尋找資料或協助您安排行程等。請告訴我如何能幫助您。"},
@@ -105,6 +111,7 @@ Then load the model in transformers:
 ```
 ## Citation
 ```

 ```
 Then load the model in transformers:
 ```python
+>>> from transformers import AutoModelForCausalLM, AutoTokenizer
 >>> tokenizer = AutoTokenizer.from_pretrained("MediaTek-Research/Breeze-7B-32k-Instruct-v1_0/")
+>>> model = AutoModelForCausalLM.from_pretrained(
+    "MediaTek-Research/Breeze-7B-Instruct-v0_1",
+    device_map="auto",
+    torch_dtype=torch.bfloat16,
+    attn_implementation="flash_attention_2"
+)
 >>> chat = [
 ...   {"role": "user", "content": "你好，請問你可以完成什麼任務？"},
 ...   {"role": "assistant", "content": "你好，我可以幫助您解決各種問題、提供資訊和協助您完成許多不同的任務。例如：回答技術問題、提供建議、翻譯文字、尋找資料或協助您安排行程等。請告訴我如何能幫助您。"},
 ```
 ## Citation
 ```