File size: 3,733 Bytes

da86678
5ff260f
 
51a1c2e
5ff260f
 
51a1c2e
 
 
 
 
5ff260f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
da86678
 
 
 
5ff260f
 
 
da86678
 
 
 
5ff260f
da86678
5ff260f
 
 
 
 
 
 
 
 
da86678
 
 
5ff260f
da86678
5ff260f
 
 
da86678
5ff260f
da86678
5ff260f
da86678
5ff260f
da86678
5ff260f
 
 
da86678
5ff260f
da86678
5ff260f
 
da86678
5ff260f
 
 
 
 
 
 
 
 
e89de90
5ff260f
da86678
5ff260f
da86678
5ff260f
 
 
e89de90
5ff260f
da86678
5ff260f
da86678
5ff260f
da86678
5ff260f
 
 
 
 
 
da86678
5ff260f
da86678
5ff260f
 
 
 
 
 
 
da86678
5ff260f
da86678
5ff260f
da86678
5ff260f

---
tags:
  - sentence-transformers
  - text-retrieval
  - sentence-similarity
  - feature-extraction
  - semantic-search
  - amharic
  - text-embedding-inference
  - transformers

pipeline_tag: sentence-similarity
library_name: sentence-transformers
license: mit
metrics:
  - cosine_accuracy
model-index:
  - name: SentenceTransformer
    results:
      - task:
          type: triplet
          name: Triplet
        dataset:
          name: TestTripletEvaluator
          type: TestTripletEvaluator
        metrics:
          - type: cosine_accuracy
            value: 0.875
            name: Cosine Accuracy
---

# SentenceTransformer Fine-Tuned for Amharic Retrieval

This model is a [sentence-transformers](https://www.sbert.net) model finetuned on Amharic QA triplets. It maps sentences and paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

## Model Details

- **Model Type:** Sentence Transformer
- **Base Model:** `sentence-transformers/paraphrase-xlm-r-multilingual-v1`
- **Training Task:** Triplet loss with Matryoshka loss
- **Language:** Amharic
- **Maximum Sequence Length:** 128 tokens
- **Output Dimensionality:** 768 dimensions
- **Similarity Function:** Cosine Similarity

## Training Overview

- **Training Data:** Custom Amharic QA triplets (with positive and negative examples)
- **Training Strategy:**  
  The model was finetuned using a combination of triplet loss and a Matryoshka loss, with evaluation performed using a `TripletEvaluator`.
- **Hyperparameters:**  
  - Epochs: 3  
  - Batch Size: 16  
  - Learning Rate: 1e-6  
  - Warmup Ratio: 0.08  
  - Weight Decay: 0.05

## Evaluation

The model was evaluated on a held-out test set using cosine similarity as the metric:

| Metric              | Value  |
|---------------------|--------|
| **Cosine Accuracy** | 0.875  |

## Usage

To use the model in your own project:

1. **Install Sentence Transformers:**

   ```bash
   pip install -U sentence-transformers
   ```

2. **Load the Model:**

   ```python
   from sentence_transformers import SentenceTransformer

   model = SentenceTransformer("abdulmunimjemal/xlm-r-retrieval-am-v5")
   sentences = [
       "ሰማይ ምን አይነት ቀለም ነው?",
       "ሰማይ ሰማያዊ ቀለም አለው።" ,
       "እኔ ምሳ እንጀራ በላሁ።" ,
       "ባሕር ምን አይነት ቀለም ነው?",
       "አየር በምድር ዙሪያ ያለ ነው።"
   ]
   embeddings = model.encode(sentences)
   print(embeddings.shape)  # Expected output: (5, 768)
   ```

3. **Compute Similarity:**

   ```python
   from sklearn.metrics.pairwise import cosine_similarity
   similarities = cosine_similarity(embeddings, embeddings)
   print(similarities.shape)  # Expected output: (5, 5)
   ```

## Model Architecture

Below is an outline of the model architecture:

```
SentenceTransformer(
  (0): Transformer({'max_seq_length': 128, 'do_lower_case': False}) with Transformer model: XLMRobertaModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_mean_tokens': True, ...})
)
```

## Training Environment

- **Python:** 3.11.11
- **Sentence Transformers:** 3.3.1
- **Transformers:** 4.47.1
- **PyTorch:** 2.5.1+cu124
- **Accelerate:** 1.2.1
- **Datasets:** 3.2.0
- **Tokenizers:** 0.21.0

## Citation

If you use this model in your research, please cite it appropriately.

```bibtex
@misc{your_model,
  title = {SentenceTransformer Fine-Tuned for Amharic Retrieval},
  author = {Abdulmunim J. Jemal},
  year = {2025},
  howpublished = {Hugging Face Model Hub, \url{https://huggingface.co/abdulmunimjemal/xlm-r-retrieval-am-v1}}
}
```