Collection of State-of-the-art FP8 Block Quantized Models
NM Testing
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
482
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Static-Asym-e2e
1B
•
Updated
•
143
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Asym-e2e
1B
•
Updated
•
139
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-e2e
0.4B
•
Updated
•
149
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16_channel-e2e
0.4B
•
Updated
•
154
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-sym-awq-e2e
0.3B
•
Updated
•
146
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-asym-awq-e2e
0.3B
•
Updated
•
153
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-e2e
0.3B
•
Updated
•
166
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16_channel-e2e
0.3B
•
Updated
•
176
nm-testing/TinyLlama-1.1B-Chat-v1.0-actorder-weight-e2e
0.3B
•
Updated
•
162
nm-testing/TinyLlama-1.1B-Chat-v1.0-actorder-group-e2e
0.3B
•
Updated
•
314