mohanrj
/

mybully-hatebert-manual-hitl

abusive-language

Model card Files Files and versions

mohanrj commited on Oct 3

Commit

3030d0c

·

verified ·

1 Parent(s): a71aaea

Create README.md

Files changed (1) hide show

README.md +43 -0

README.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+language:
+- ms
+tags:
+- hate-speech
+- abusive-language
+- malay
+- classification
+license: mit
+datasets:
+- mohanrj/MYBully
+metrics:
+- accuracy
+- f1
+base_model:
+- mesolitica/roberta-base-bahasa-cased
+---
+# MYBully-HateBERT (Manual + HITL)
+## Model Overview
+This model is **MYBully-HateBERT**, fine-tuned on **manual + HITL annotated data** from the MYBully dataset.
+It captures more nuanced hate/offensive patterns than the manual baseline.
+## Intended Use
+- Detecting hate speech and abusive language in Bahasa Malaysia tweets.
+## Training Data
+- **Dataset:** MYBully (Bahasa Malaysia tweets).
+- **Annotation:** Manual + HITL
+## Model Details
+- **Base model:** roberta-base-bahasa-cased
+- **Fine-tuning:** Binary classification head
+- **Labels:** Hate Speech (1), Non-Hate Speech (0)
+## Performance
+| Metric | Value |
+|--------|-------|
+| Accuracy | 0.85 |
+| Precision | 0.75 |
+| Recall | 0.86 |
+| F1 | 0.81 |