Commit
·
8dc66f1
1
Parent(s):
67778f7
update int8 onnx model and readme
Browse filesSigned-off-by: dujun <[email protected]>
- README.md +4 -2
- model.onnx +2 -2
README.md
CHANGED
|
@@ -50,12 +50,14 @@ This is an INT8 ONNX model quantized with [Intel® Neural Compressor](https://gi
|
|
| 50 |
|
| 51 |
The original fp32 model comes from the fine-tuned model [Intel/bert-base-uncased-mrpc](https://huggingface.co/Intel/bert-base-uncased-mrpc).
|
| 52 |
|
|
|
|
|
|
|
| 53 |
#### Test result
|
| 54 |
|
| 55 |
| |INT8|FP32|
|
| 56 |
|---|:---:|:---:|
|
| 57 |
-
| **Accuracy (eval-f1)** |0.
|
| 58 |
-
| **Model size (MB)** |
|
| 59 |
|
| 60 |
|
| 61 |
#### Load ONNX model:
|
|
|
|
| 50 |
|
| 51 |
The original fp32 model comes from the fine-tuned model [Intel/bert-base-uncased-mrpc](https://huggingface.co/Intel/bert-base-uncased-mrpc).
|
| 52 |
|
| 53 |
+
The calibration dataloader is the eval dataloader. The calibration sampling size is 100.
|
| 54 |
+
|
| 55 |
#### Test result
|
| 56 |
|
| 57 |
| |INT8|FP32|
|
| 58 |
|---|:---:|:---:|
|
| 59 |
+
| **Accuracy (eval-f1)** |0.9021|0.9042|
|
| 60 |
+
| **Model size (MB)** |236|418|
|
| 61 |
|
| 62 |
|
| 63 |
#### Load ONNX model:
|
model.onnx
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:95d79d0083517ba6280f566add4624c6533eddf5e2d156d3bec64d0d049af637
|
| 3 |
+
size 247160325
|