Model Loading Error

by imranali291 - opened Jan 23

Jan 23

Running your code in colab but getting error: Error loading models: .to is not supported for 4-bit or 8-bit bitsandbytes models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct dtype.

John6666

Owner Jan 23

Thanks for the report. I wonder if the library's specifications have changed...🤔
I don't know the direct cause, so I've tried removing the suspicious parts for now.

milk12345

Feb 1

@John6666
The error still seems to be occurring. Could you please check?

Running on cuda
Loading in NF4
Loading CLIP 📎
Loading VLM's custom vision model 📎
Loading tokenizer 🪙
Loading LLM: unsloth/Meta-Llama-3.1-8B-bnb-4bit 🤖
Error loading models: .to is not supported for 4-bit or 8-bit bitsandbytes models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct dtype.

John6666

Owner Feb 1

It seems that accelerate and bitsandbytes were fighting. It's probably because of the version upgrade of accelerate in Colab. I think the change I just made will fix this.
https://github.com/OpenBMB/MiniCPM-o/issues/379

milk12345

Feb 1

Thank you, it worked properly!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment