How to use Tiiny/ReluLLaMA-7B-PowerInfer-GGUF with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Tiiny/ReluLLaMA-7B-PowerInfer-GGUF", dtype="auto")