| Class | Description |
|---|---|
| ArabicDocumentReaderAndWriter |
Reads newline delimited UTF-8 Arabic sentences with or without
gold segmentation markers.
|
| ArabicDocumentReaderAndWriter.RewrittenArabicAnnotation | |
| ArabicSegmenter |
Arabic word segmentation model based on conditional random fields (CRF).
|
| ArabicSegmenterFeatureFactory<IN extends CoreLabel> |
Feature factory for the IOB clitic segmentation model described by
Green and DeNero (2012).
|
| ArabicTokenizer<T extends HasWord> |
Tokenizer for UTF-8 Arabic.
|
| ArabicTokenizer.ArabicTokenizerFactory<T extends HasWord> | |
| IOBUtils |
A class for converting strings to input suitable for processing by
an IOB sequence model.
|
| StartAndEndArabicSegmenterFeatureFactory<IN extends CoreLabel> |
Feature factory for the IOB clitic segmentation model described by
Green and DeNero (2012).
|