RuntimeError: Expected all tensors to be on the same device
#3
by
SekiroRong
- opened
Bravo for your great work! However, do you have any idea about the RuntimeError when infer after apply your Qint4 model (use load_quantized_hi3_m2)?
This is probably related to the official inferencing code, should be not related to the model itself.
It works, thank you!
wikeeyang
changed discussion status to
closed