ananthu-aniraj
/

pdiscoformer_pimagenet_seg_k_50

Image Classification

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions

ananthu-aniraj commited on Sep 25, 2024

Commit

39b1a14

·

verified ·

1 Parent(s): 832f2c7

Update README.md

Files changed (1) hide show

README.md +25 -3

README.md CHANGED Viewed

@@ -3,8 +3,30 @@ pipeline_tag: image-classification
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
-This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
-- Library: https://github.com/ananthu-aniraj/pdiscoformer
-- Docs: [More Information Needed]

 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
+- image-classification
+license: mit
+language:
+- en
+base_model:
+- timm/vit_base_patch14_reg4_dinov2.lvd142m
 ---
+# PdiscoFormer PartImageNet Seg Model (K=50)
+PdiscoFormer (Vit-base-dinov2-reg4) trained on PartImageNet Seg with K (number of unsupervised parts to discover) set to a value of 50.
+PdiscoFormer is a novel method for unsupervised part discovery using self-supervised Vision Transformers which achieves state-of-the-art results for this task, both qualitatively and quantitatively. The code can be found in the following repository: https://github.com/ananthu-aniraj/pdiscoformer
+# BibTex entry and citation info
+```
+@misc{aniraj2024pdiscoformerrelaxingdiscoveryconstraints,
+      title={PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers},
+      author={Ananthu Aniraj and Cassio F. Dantas and Dino Ienco and Diego Marcos},
+      year={2024},
+      eprint={2407.04538},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2407.04538},
+}