Akan POS Tagger (SpaCy model)
This is a custom-trained Part-of-Speech (POS) tagging model for the Akan language using spaCy.
β¨ Installation and Usage
Prerequisites
pip install spacy huggingface_hub
Loading and Using the Model
from huggingface_hub import snapshot_download
import spacy
# Download the model from Hugging Face Hub
model_path = snapshot_download(repo_id="michsethowusu/akan-pos-tagger")
# Load the model
nlp = spacy.load(model_path)
# Use the model for POS tagging
doc = nlp("bosom som nyΙ")
for token in doc:
print(f"{token.text} -> {token.tag_}")
Expected Output
bosom -> N
som -> V
nyΙ -> ADV
π· Complete POS Tags Reference
This model uses a comprehensive custom POS tagset with over 100 tags for Akan linguistic structures:
Adjectives
- ADJ β adjective
- ADJatt β attributive adjective
- ADJC β comparative adjective
- ADJpret β predicative adjective
- ADJS β superlative adjective
Adverbs
- ADV β adverb
- ADVdir β directional adverb
- ADVm β manner adverb
- ADVneg β negative operator
- ADVplc β place adverb
- ADVtemp β temporal adverb
Articles and Determiners
- ART β article
- DET β determiner
- DEM β demonstrative
Auxiliary and Copula
- AUX β auxiliary
- COP β copula
- COPident β identity copula
- COPloc β locative copula
- COPneg β negative copula
Numerals
- CARD β cardinal numeral (e.g. 4, sixty-five)
- NUM β numeral
- NUMpart β partitive numeral
- ORD β ordinal
Conjunctions
- CONJ β conjunction
- CONJC β coordinating conjunction (e.g. and, or)
- CONJS β subordinating conjunction (e.g. although, when)
- CONJSinf β subordinating conjunction (introducing an infinitive clause)
Nouns
- CN β common noun
- N β common noun
- Nbare β bare noun
- Ncomm β noun with common gender
- NFEM β feminine noun
- NMASC β masculine noun
- NNEUT β neuter noun
- NNO β noun neutral for number (e.g. data, aircraft)
- Np β proper noun
- Npinst β name of an institution
- Nploc β name of a location
- Npname β personal name
- Nrel β relational noun
- Nspat β spatial noun
Pronouns
- PN β personal pronoun
- PNabs β absolute pronoun (Bantu)
- PNana β pronominal anaphor
- PNdem β demonstrative pronoun
- PNposs β possessive pronoun
- PNrefl β reflexive pronoun
- PNrel β relative pronoun
- PROint β interrogative pronoun
- PROPana β propositional anaphor
Prepositions and Postpositions
- PREP β preposition
- PREPdir β directional preposition
- PREPplc β locative preposition
- PREPsel β selected preposition
- PREPtemp β temporal preposition
- PPOST β postposition
Particles
- PRT β particle
- PRTexist β existential marker
- PRTinf β infinitive marker
- PRTint β interrogative particle
- PRTn β nominal particle
- PRTneg β negative particle
- PRTposs β possessive particle
- PRTpred β predicative particle
- PRTprst β presentational particle
- PRTresp β response words such as "thanks, please, no, yes..."
- PRTv β verbal particle
Verbs
- V β verb
- V1 β first verb in a verbal chain
- V2 β second verb in a verbal chain
- V3 β third verb in a verbal chain
- V4 β fourth verb in a chain
- Vbid β verb bid (Kwa)
- Vcon β converb
- Vdtr β ditransitive verb
- Vimprs β impersonal verb
- Vitr β intransitive verb
- VitrOBL β intransitive verb with a prepositional object
- Vlght β light verb
- Vmod β modal verb
- Vneg β negative verb
- Vpre β preverb
- Vrefl β reflexive verb
- Vtr β transitive verb
- VtrOBL β transitive verb with a prepositional object
- Vvector β vector verb
Other Categories
- CIRCP β circumposition
- CL β clitic
- CLF β classifier
- CLFnom β nominal classifier
- CLFnum β numeral classifier
- COMP β complementiser
- EXPL β expletive pronoun
- INTRJCT β interjection
- IPHON β ideophone, onomatopoeia
- MOD β modifier
- PTCP β participle
- QUANT β quantifier
- REL β relative clause marker
- TRUNC β truncation
- Wh β wh-word
Punctuation
- PUL β punctuation: left bracket (e.g. ( or [)
- PUN β punctuation: general separating mark (e.g. , ; . ! : ?)
- PUQ β punctuation: quotation marks (' or ")
- PUR β punctuation: right bracket (e.g. ) or ])
Special Categories
- NE β named entity
- XY β non-words such as "JU52"
π§ Technical Details
- Framework: SpaCy
- Language: Akan (ak)
- Training Epoch: 1 (best-performing model)
- Tag Set: Custom 100+ tag dictionary based on Akan linguistic structures
- Model Type: Token classification for POS tagging
π Performance
The model achieves competitive accuracy on Akan POS tagging tasks, with specialized handling for the complex morphological and syntactic features of the Akan language.
π€ Citation
If you use this model in your research, please cite:
@misc{{akan-pos-tagger,
title={{Akan POS Tagger}},
author={{{michsethowusu}}},
year={{2025}},
url={{https://huggingface.co/michsethowusu/akan-pos-tagger}}
}}
"""
- Downloads last month
- -
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support
Evaluation results
- POS Tagging Accuracy on Akan POS Corpusself-reported0.850