Peiqi Chen1* · Lei Yu2* · Yi Wan1† Yingying Pei1 · Xinyi Liu1 · Yongxiang Yao1
Yingying Zhang2 · Lixiang Ru2 · Liheng Zhong2 · Jingdong Chen2 · Ming Yang2 · Yongjun Zhang1†
1Wuhan University 2Ant Group
*Equal contribution †Corresponding author
CasP decomposes the matching stage into two progressive layers, with the former layer providing the one-to-many priors that constrain the search range of the latter.
Here are the results as Area Under the Curve (AUC) of the relative pose error at 10 degree:
| Method | Fine-tuned | RGB-Infrared | RGB-Depth | RGB-Normal | RGB-Event | RGB-Sketch | RGB-Paint | |----------------|------------|--------------|-----------|------------|-----------|------------|-----------| | LoFTR | ❌ | 12.58 | 0.44 | 12.07 | 12.43 | 54.82 | 12.22 | | ELoFTR | ❌ | 14.59 | 0.79 | 21.67 | 20.39 | 61.09 | 25.11 | | CasP | ❌ | **22.53** | **1.20** | **30.25** | **35.51** | **62.92** | **39.70** | | MINIMA_LoFTR | ✅ | 32.36 | 28.81 | 44.26 | 32.74 | 53.54 | 15.45 | | MINIMA_ELoFTR | ✅ | 26.36 | 32.26 | 47.47 | 30.72 | 59.63 | 27.02 | | MINIMA_CasP | ✅ | **43.87** | **40.55** | **53.64** | **40.06** | **60.30** | **40.76** |Here are visualizations of image registration across unseen modalities, produced by our model fine-tuned on MINIMA:
![]() Optical-Point Cloud |
![]() Optical-SAR |
![]() Optical-Vector Map |