Trying to 'Use this model', getting: ' The requested model 'HuggingFaceH4/zephyr-7b-beta' is not supported by any provider you have enabled'

#74
by OnurKerimoglu - opened

client.chat.completions.create returns:
BadRequestError: Error code: 400 - {'error': {'message': "The requested model 'HuggingFaceH4/zephyr-7b-beta' is not supported by any provider you have enabled.", 'type': 'invalid_request_error', 'param': 'model', 'code': 'model_not_supported'}}

Steps to reproduce:

  • Click on 'Use this model'
  • Choose Google Colab
  • scroll down to the 'Remote Inference via Inference Providers' section
  • Set the os.environ['HF_TOKEN'] with valid token
  • Run the cell below as-is, contents of which (as of writing this):
import os
from openai import OpenAI

client = OpenAI(
    base_url="https://router.huggingface.co/v1",
    api_key=os.environ["HF_TOKEN"],
)

completion = client.chat.completions.create(
    model="HuggingFaceH4/zephyr-7b-beta",
    messages=[
        {
            "role": "user",
            "content": "What is the capital of France?"
        }
    ],
)

print(completion.choices[0].message)

I am having the exact same issue here

Sign up or log in to comment