Zyphra
/

Zamba2-7B-Instruct

Text Generation

Model card Files Files and versions

pglo commited on Feb 5

Commit

32467f2

·

verified ·

1 Parent(s): 2394318

Update README.md

Files changed (1) hide show

README.md +12 -5

README.md CHANGED Viewed

@@ -16,11 +16,18 @@ Zamba2-7B-Instruct long-context has been extended from 4k to 16k context by adju
 ### Prerequisites
-To download Zamba2-2.7B-instruct, clone Zyphra's fork of transformers:
-1. `git clone https://github.com/Zyphra/transformers_zamba2.git`
-2. `cd transformers_zamba2`
-3. Install the repository: `pip install -e .`
-4. `pip install accelerate`
 ### Inference

 ### Prerequisites
+To use Zamba2-7B-instruct, install `transformers`:
+`pip install transformers`
+To install dependencies necessary to run Mamba2 kernels, install `mamba-ssm` from source (due to compatibility issues with PyTorch) as well as `causal-conv1d`:
+1. `git clone https://github.com/state-spaces/mamba.git`
+2. `cd mamba && git checkout v2.1.0 && pip install .`
+3. `pip install causal-conv1d`
+You can run the model without using the optimized Mamba2 kernels, but it is **not** recommended as it will result in significantly higher latency and memory usage.
 ### Inference