how to accelerate the inference speed

#22

by tobywang - opened Nov 7, 2023

Nov 7, 2023

Is there any frameworks which can accelerate the inference speed of this model

Jan 7, 2024

Feb 1, 2024

Hello, does vllm work for you? I tried vllm but found that the generation quality is degraded and the model simply outputs repetitive words.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment