Image Classification
timm
PyTorch
Safetensors
pit
vision-transformer
transformer
gravitational-lensing
strong-lensing
astronomy
astrophysics
Eval Results (legacy)
Instructions to use parlange/pit-gravit-c2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- timm
How to use parlange/pit-gravit-c2 with timm:
import timm model = timm.create_model("hf_hub:parlange/pit-gravit-c2", pretrained=True) - Notebooks
- Google Colab
- Kaggle
| license: apache-2.0 | |
| tags: | |
| - image-classification | |
| - pytorch | |
| - timm | |
| - pit | |
| - vision-transformer | |
| - transformer | |
| - gravitational-lensing | |
| - strong-lensing | |
| - astronomy | |
| - astrophysics | |
| datasets: | |
| - parlange/gravit-c21-j24 | |
| metrics: | |
| - accuracy | |
| - auc | |
| - f1 | |
| paper: | |
| - title: "GraViT: A Gravitational Lens Discovery Toolkit with Vision Transformers" | |
| url: "https://arxiv.org/abs/2509.00226" | |
| authors: "Parlange et al." | |
| model-index: | |
| - name: PiT-c2 | |
| results: | |
| - task: | |
| type: image-classification | |
| name: Strong Gravitational Lens Discovery | |
| dataset: | |
| type: common-test-sample | |
| name: Common Test Sample (More et al. 2024) | |
| metrics: | |
| - type: accuracy | |
| value: 0.8074 | |
| name: Average Accuracy | |
| - type: auc | |
| value: 0.8443 | |
| name: Average AUC-ROC | |
| - type: f1 | |
| value: 0.5437 | |
| name: Average F1-Score | |
| # 🌌 pit-gravit-c2 | |
| 🔭 This model is part of **GraViT**: Transfer Learning with Vision Transformers and MLP-Mixer for Strong Gravitational Lens Discovery | |
| 🔗 **GitHub Repository**: [https://github.com/parlange/gravit](https://github.com/parlange/gravit) | |
| ## 🛰️ Model Details | |
| - **🤖 Model Type**: PiT | |
| - **🧪 Experiment**: C2 - C21+J24-half | |
| - **🌌 Dataset**: C21+J24 | |
| - **🪐 Fine-tuning Strategy**: half | |
| ## 💻 Quick Start | |
| ```python | |
| import torch | |
| import timm | |
| # Load the model directly from the Hub | |
| model = timm.create_model( | |
| 'hf-hub:parlange/pit-gravit-c2', | |
| pretrained=True | |
| ) | |
| model.eval() | |
| # Example inference | |
| dummy_input = torch.randn(1, 3, 224, 224) | |
| with torch.no_grad(): | |
| output = model(dummy_input) | |
| predictions = torch.softmax(output, dim=1) | |
| print(f"Lens probability: {predictions[0][1]:.4f}") | |
| ``` | |
| ## ⚡️ Training Configuration | |
| **Training Dataset:** C21+J24 (Cañameras et al. 2021 + Jaelani et al. 2024) | |
| **Fine-tuning Strategy:** half | |
| | 🔧 Parameter | 📝 Value | | |
| |--------------|----------| | |
| | Batch Size | 192 | | |
| | Learning Rate | AdamW with ReduceLROnPlateau | | |
| | Epochs | 100 | | |
| | Patience | 10 | | |
| | Optimizer | AdamW | | |
| | Scheduler | ReduceLROnPlateau | | |
| | Image Size | 224x224 | | |
| | Fine Tune Mode | half | | |
| | Stochastic Depth Probability | 0.1 | | |
| ## 📈 Training Curves | |
|  | |
| ## 🏁 Final Epoch Training Metrics | |
| | Metric | Training | Validation | | |
| |:---------:|:-----------:|:-------------:| | |
| | 📉 Loss | 0.1768 | 0.1241 | | |
| | 🎯 Accuracy | 0.9266 | 0.9534 | | |
| | 📊 AUC-ROC | 0.9810 | 0.9902 | | |
| | ⚖️ F1 Score | 0.9263 | 0.9533 | | |
| ## ☑️ Evaluation Results | |
| ### ROC Curves and Confusion Matrices | |
| Performance across all test datasets (a through l) in the Common Test Sample (More et al. 2024): | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
| ### 📋 Performance Summary | |
| Average performance across 12 test datasets from the Common Test Sample (More et al. 2024): | |
| | Metric | Value | | |
| |-----------|----------| | |
| | 🎯 Average Accuracy | 0.8074 | | |
| | 📈 Average AUC-ROC | 0.8443 | | |
| | ⚖️ Average F1-Score | 0.5437 | | |
| ## 📘 Citation | |
| If you use this model in your research, please cite: | |
| ```bibtex | |
| @misc{parlange2025gravit, | |
| title={GraViT: Transfer Learning with Vision Transformers and MLP-Mixer for Strong Gravitational Lens Discovery}, | |
| author={René Parlange and Juan C. Cuevas-Tello and Octavio Valenzuela and Omar de J. Cabrera-Rosas and Tomás Verdugo and Anupreeta More and Anton T. Jaelani}, | |
| year={2025}, | |
| eprint={2509.00226}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CV}, | |
| url={https://arxiv.org/abs/2509.00226}, | |
| } | |
| ``` | |
| --- | |
| ## Model Card Contact | |
| For questions about this model, please contact the author through: https://github.com/parlange/ | |