Model Card for Model ID
A fine-tuned Llama-3.1-8B-Instruct model specialized for converting regional/dialectal Bangla speech (Chittagonian, Sylheti, Barishali, etc.) into clean, standard written Bangla. It takes noisy or dialect-heavy Bangla transcriptions as input and reliably outputs grammatically correct, formal Standard Bangla while preserving the original meaning โ perfect for post-processing ASR outputs from regional Bangladeshi speech.
Model Details
This model is a fine-tuned version of the LLaMA 3.1 8B Instruct architecture, optimized specifically for Bangladeshi regional dialect to Standard Bangla text normalization. The training was carried out using the Unsloth framework, which allows efficient LoRA fine-tuning on low-memory hardware such as the T4 GPU available on Kaggle. The model was trained on a dataset consisting of regional Bangla expressions paired with their correct Standard Bangla equivalents. Through careful preprocessing, filtering, and normalization of the dataset, the model was able to learn consistent linguistic patterns across dialects. The training process included a standard trainโvalidation split and continual monitoring of the validation loss to select the best-performing checkpoint. The resulting model is compact, memory-efficient, and highly effective at handling dialect variations while maintaining fluency and correctness in Standard Bangla.
Model Description
This model converts Bangladeshi regional dialect text into Standard Bangla with high accuracy, making it ideal for ASR post-processing, dialect normalization, and text standardization tasks. It is capable of transforming colloquial, informal, or region-specific Bangla phrases into clear, grammatically correct Standard Bangla, even when the input contains fragmented or ambiguous forms. The training process used an instruction-tuned approach in which each sample included a user prompt and the expected normalized response, allowing the model to learn the structure of conversational corrections and editing tasks. By employing QLoRA during training, the model maintains strong performance while requiring significantly less GPU memory, enabling smooth experimentation on the Kaggle free tier environment. The final model generalizes well to unseen dialect inputs and integrates easily into existing NLP pipelines, model hosting solutions, and competition submission workflows
- Developed by: [More Information Needed]
- Funded by [optional]: [More Information Needed]
- Shared by [optional]: [More Information Needed]
- Model type: [More Information Needed]
- Language(s) (NLP): [More Information Needed]
- License: [More Information Needed]
- Finetuned from model [optional]: [More Information Needed]
Model Sources [optional]
- Repository: [More Information Needed]
- Paper [optional]: [More Information Needed]
- Demo [optional]: [More Information Needed]
Uses
Direct Use
[More Information Needed]
Downstream Use [optional]
[More Information Needed]
Out-of-Scope Use
[More Information Needed]
Bias, Risks, and Limitations
[More Information Needed]
Recommendations
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
How to Get Started with the Model
Use the code below to get started with the model.
[More Information Needed]
Training Details
Training Data
[More Information Needed]
Training Procedure
Preprocessing [optional]
[More Information Needed]
Training Hyperparameters
- Training regime: [More Information Needed]
Speeds, Sizes, Times [optional]
[More Information Needed]
Evaluation
Testing Data, Factors & Metrics
Testing Data
[More Information Needed]
Factors
[More Information Needed]
Metrics
[More Information Needed]
Results
[More Information Needed]
Summary
Model Examination [optional]
[More Information Needed]
Environmental Impact
Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).
- Hardware Type: [More Information Needed]
- Hours used: [More Information Needed]
- Cloud Provider: [More Information Needed]
- Compute Region: [More Information Needed]
- Carbon Emitted: [More Information Needed]
Technical Specifications [optional]
Model Architecture and Objective
[More Information Needed]
Compute Infrastructure
[More Information Needed]
Hardware
[More Information Needed]
Software
[More Information Needed]
Citation [optional]
BibTeX:
[More Information Needed]
APA:
[More Information Needed]
Glossary [optional]
[More Information Needed]
More Information [optional]
[More Information Needed]
Model Card Authors [optional]
[More Information Needed]
Model Card Contact
[More Information Needed]
Framework versions
- PEFT 0.16.0
- Downloads last month
- 65