Wrong values using webgpu on q8

#18

by alien79 - opened Sep 21

Sep 21

I've tried the q8 version and I've seen that when usingwebgpu device, result is different than wasm
this doesn't happens with fp32

is it a known limitation? what's the problem?
(I didn't tested other quantized versions)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment