incidelen's picture
Update README.md
911ddad verified
metadata
datasets:
  - maydogan/Turkish_SentimentAnalysis_TRSAv1
language:
  - tr
metrics:
  - accuracy
  - precision
  - recall
  - f1
base_model:
  - dbmdz/bert-base-turkish-128k-cased
pipeline_tag: text-classification
tags:
  - Turkish Sentiment Analysis

🇹🇷 BERTurk for Turkish Sentiment Analysis

This model is a fine-tuned version of BERTurk 128k on the TRSAv1 dataset, a labeled collection of Turkish e-commerce reviews categorized into positive, neutral, and negative sentiments. For more details about the dataset, methodology, and experiments, you can refer to the corresponding research paper.


How to Use

You can use the model directly with 🤗 Transformers:

from transformers import pipeline

classifier = pipeline("text-classification", model="incidelen/bert-base-turkish-sentiment-analysis-128k-cased")
result = classifier("Ürün çok kaliteli, paketleme harikaydı. Kesinlikle tavsiye ederim!")
print(result)

Citation

If you use this model in your research or application, please cite the following paper:

@article{incidelen15sentiment,
  title={Sentiment Analysis in Turkish Using Language Models: A Comparative Study},
  author={{\.I}ncidelen, Mert and Aydo{\u{g}}an, Murat},
  journal={European Journal of Technique (EJT)},
  volume={15},
  number={1},
  pages={68--74},
  publisher={Hibetullah KILI{\c{C}}}
}

Dataset Overview

The TRSAv1 dataset includes 150,000 Turkish product reviews from e-commerce platforms. It is balanced across three sentiment classes:

Sentiment Count
Negative 50,000
Neutral 50,000
Positive 50,000
TOTAL 150,000

Evaluation Results

Overall Performance

Accuracy (%) Precision (%) Recall (%) F1 Score (%)
83.68 83.69 83.68 83.66

Class-wise Performance

Sentiment Precision (%) Recall (%) F1 Score (%)
Negative 88.35 85.20 86.74
Neutral 77.01 76.45 76.73
Positive 85.70 89.38 87.50

Acknowledgments

Special thanks to maydogan for their contributions and support.