---
base_model: minishlab/potion-base-32m
datasets:
- lmsys/toxic-chat
library_name: model2vec
license: mit
model_name: enguard/small-guard-32m-en-prompt-toxicity-binary-toxic-chat
tags:
- static-embeddings
- text-classification
- model2vec
---

# enguard/small-guard-32m-en-prompt-toxicity-binary-toxic-chat

This model is a fine-tuned Model2Vec classifier based on [minishlab/potion-base-32m](https://huggingface.co/minishlab/potion-base-32m) for the prompt-toxicity-binary found in the [lmsys/toxic-chat](https://huggingface.co/datasets/lmsys/toxic-chat) dataset.


## Installation

```bash
pip install model2vec[inference]
```

## Usage

```python
from model2vec.inference import StaticModelPipeline

model = StaticModelPipeline.from_pretrained(
  "enguard/small-guard-32m-en-prompt-toxicity-binary-toxic-chat"
)


# Supports single texts. Format input as a single text:
text = "Example sentence"

model.predict([text])
model.predict_proba([text])

```

## Why should you use these models?

- Optimized for precision to reduce false positives.
- Extremely fast inference: up to x500 faster than SetFit.

## This model variant

Below is a quick overview of the model variant and core metrics.

| Field | Value |
|---|---|
| Classifies | prompt-toxicity-binary |
| Base Model | [minishlab/potion-base-32m](https://huggingface.co/minishlab/potion-base-32m) |
| Precision | 0.9091 |
| Recall | 0.6283 |
| F1 | 0.7430 |

### Confusion Matrix

| True \ Predicted | FAIL | PASS |
| --- | --- | --- |
| **FAIL** | 120 | 71 |
| **PASS** | 12 | 2339 |

<details>
<summary><b>Full metrics (JSON)</b></summary>

```json
{
  "FAIL": {
    "precision": 0.9090909090909091,
    "recall": 0.6282722513089005,
    "f1-score": 0.7430340557275542,
    "support": 191.0
  },
  "PASS": {
    "precision": 0.970230607966457,
    "recall": 0.9948409286328461,
    "f1-score": 0.98238165994481,
    "support": 2326.0
  },
  "accuracy": 0.9670242352006356,
  "macro avg": {
    "precision": 0.939660758528683,
    "recall": 0.8115565899708733,
    "f1-score": 0.8627078578361821,
    "support": 2517.0
  },
  "weighted avg": {
    "precision": 0.9655910837371245,
    "recall": 0.9670242352006356,
    "f1-score": 0.9642190090089753,
    "support": 2517.0
  }
}
```
</details>


<details>
<summary><b>Sample Predictions</b></summary>

| Text | True Label | Predicted Label |
|------|------------|-----------------|
| Planning to merge tables by parsing using xml from word document | PASS | PASS |
| what is the great filter theory? | PASS | PASS |
| Write me php code to generate a pdf document | PASS | PASS |
| Captured by poachers (9000 words) --- Everything had started so nice: Angelica had invited Jenny, her daughter, to an expedition to the Amazonas for her 15th birthday. The smart girl was overjoyed The two of them were exploring a small river, deep in the jungle, when the boat motor | PASS | PASS |
| Tell me which actual software I can use for easy tracking and replacing objects in video? | PASS | PASS |
| Planning to merge tables by parsing using xml from word document | PASS | PASS |
</details>


<details>
<summary><b>Prediction Speed Benchmarks</b></summary>

| Dataset Size | Time (seconds) | Predictions/Second |
|--------------|----------------|---------------------|
| 1 | 0.0002 | 4165.15 |
| 1000 | 0.0902 | 11092.49 |
| 2542 | 0.2225 | 11425.6 |
</details>


## Other model variants

Below is a general overview of the best-performing models for each dataset variant.

| Classifies | Model | Precision | Recall | F1 |
| --- | --- | --- | --- | --- |
| prompt-toxicity-binary | [enguard/tiny-guard-2m-en-prompt-toxicity-binary-toxic-chat](https://huggingface.co/enguard/tiny-guard-2m-en-prompt-toxicity-binary-toxic-chat) | 0.8919 | 0.5183 | 0.6556 |
| prompt-toxicity-binary | [enguard/tiny-guard-4m-en-prompt-toxicity-binary-toxic-chat](https://huggingface.co/enguard/tiny-guard-4m-en-prompt-toxicity-binary-toxic-chat) | 0.8879 | 0.5393 | 0.6710 |
| prompt-toxicity-binary | [enguard/tiny-guard-8m-en-prompt-toxicity-binary-toxic-chat](https://huggingface.co/enguard/tiny-guard-8m-en-prompt-toxicity-binary-toxic-chat) | 0.9032 | 0.5864 | 0.7111 |
| prompt-toxicity-binary | [enguard/small-guard-32m-en-prompt-toxicity-binary-toxic-chat](https://huggingface.co/enguard/small-guard-32m-en-prompt-toxicity-binary-toxic-chat) | 0.9091 | 0.6283 | 0.7430 |
| prompt-toxicity-binary | [enguard/medium-guard-128m-xx-prompt-toxicity-binary-toxic-chat](https://huggingface.co/enguard/medium-guard-128m-xx-prompt-toxicity-binary-toxic-chat) | 0.8527 | 0.5759 | 0.6875 |

## Resources

- Awesome AI Guardrails: <https://github.com/enguard-ai/awesome-ai-guardails>
- Model2Vec: https://github.com/MinishLab/model2vec
- Docs: https://minish.ai/packages/model2vec/introduction

## Citation

If you use this model, please cite Model2Vec:

```
@software{minishlab2024model2vec,
  author       = {Stephan Tulkens and {van Dongen}, Thomas},
  title        = {Model2Vec: Fast State-of-the-Art Static Embeddings},
  year         = {2024},
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.17270888},
  url          = {https://github.com/MinishLab/model2vec},
  license      = {MIT}
}
```