Update README.md
Browse files
README.md
CHANGED
|
@@ -7,4 +7,5 @@ This model is a TensorFlow port of ViT B-16 [1] trained with recipes from [2]. I
|
|
| 7 |
## References
|
| 8 |
|
| 9 |
[1] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale: https://arxiv.org/abs/2010.11929
|
|
|
|
| 10 |
[2] How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers: https://arxiv.org/abs/2106.10270
|
|
|
|
| 7 |
## References
|
| 8 |
|
| 9 |
[1] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale: https://arxiv.org/abs/2010.11929
|
| 10 |
+
|
| 11 |
[2] How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers: https://arxiv.org/abs/2106.10270
|