Usage of GPU by abliterated model

by hdev27n - opened Jun 8

Jun 8

Hey!
Thanks for huihui-ai to create and share all the ablitedated models! :)

Beginners question:
When I use the original gemma-3-27b, it uses my GPU and works really fast.
When I use gemma-3-27b-it-abliterated, it uses almost my CPU only and - guess what - it calms down a lot.
Is there a hint how to a accelerate the abliterated model by using GPU?

I use Open WebUI with Ollama.

hughte

Jun 28

Came here to see if anyone else was experiencing this as well. It's nearly 4x the size of the OG gemma3:27b model, hence why it gets offloaded.

huihui-ai

Owner Jun 28

gemma3-abliterated:27b corresponds to gemma3-abliterated:27b-q8_0

Q4_K_M was not uploaded because the quality was not too good.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment