ai4bharat
/

IndicBART

text2text-generation

Model card Files Files and versions

prajdabre commited on Dec 30, 2021

Commit

163ecbb

·

1 Parent(s): ee0026b

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ For detailed documentation, look here: https://github.com/AI4Bharat/indic-bart/
 # Pre-training corpus
-We used the IndicCorp data spanning 12 languages with 452 million sentences (9 billion tokens). The model was trained using the text-infilling objective used in mBART.
 # Usage:
@@ -78,7 +78,7 @@ print(decoded_output) # I am happy
 # Fine-tuning on a downstream task
-1. If you wish to fine-tune this model, then you can do so using the toolkit <a href="https://github.com/prajdabre/yanmtt">YANMTT</a> following the instructions <a href="https://github.com/AI4Bharat/indic-bart ">here<a>.
 2. (Untested) Alternatively, you may use the official huggingface scripts for <a href="https://github.com/huggingface/transformers/tree/master/examples/pytorch/translation">translation</a> and <a href="https://github.com/huggingface/transformers/tree/master/examples/pytorch/summarization">summarization</a>.
 # Contributors

 # Pre-training corpus
+We used the <a href="https://indicnlp.ai4bharat.org/corpora/">IndicCorp</a> data spanning 12 languages with 452 million sentences (9 billion tokens). The model was trained using the text-infilling objective used in mBART.
 # Usage:
 # Fine-tuning on a downstream task
+1. If you wish to fine-tune this model, then you can do so using the toolkit <a href="https://github.com/prajdabre/yanmtt">YANMTT</a> following the instructions <a href="https://github.com/AI4Bharat/indic-bart ">here</a>.
 2. (Untested) Alternatively, you may use the official huggingface scripts for <a href="https://github.com/huggingface/transformers/tree/master/examples/pytorch/translation">translation</a> and <a href="https://github.com/huggingface/transformers/tree/master/examples/pytorch/summarization">summarization</a>.
 # Contributors