Professional Documents
Culture Documents
net/publication/342515660
CITATIONS READS
0 164
2 authors:
Some of the authors of this publication are also working on these related projects:
Solving inverse problems in computational imaging using deep learning View project
All content following this page was uploaded by Honey Gupta on 28 June 2020.
Visual-range
RGB camera
● Existing GSR methods assume pixel-to-pixel
alignment
Thermal camera
● Novel CNN model with dense-blocks and self-attention modules for GSR
Proposed network architecture
(without alignment correction)
UGSR-Base
Implicit alignment in feature-space
Misalignment in
feature-space
due to disparity
UGSR-Base
Implicit alignment in feature-space
UGSR-FA
High resolution
guidance image
Alignment in feature-space
UGSR-ME
High resolution
guidance image
Implicit alignment in input-space
UGSR-ME
High resolution
guidance image
Alignment in input-space
● We propose a misalignment estimation block
○ Classification approach
○ Aim: optimal translation-map estimation
● End-to-end optimisation
Misalignment-estimation block
Results
Comparison
(CATS dataset)
● To the best of our knowledge, ours is the first attempt towards unaligned GSR
● Among our two strategies, we found that aligning in the feature space is better
than aligning in the input-space.
Please post your questions on the Q&A feed at the CCD workshop page of the
CVPR website.
Thank you!
References
[1] Hirschmuller, H., 2005, June. Accurate and efficient stereo processing by semi-global matching and mutual information. In 2005
IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) (Vol. 2, pp. 807-814). IEEE.
[2] Cech, J. and Sara, R., 2007, June. Efficient sampling of disparity space for fast and accurate matching. In 2007 IEEE Conference
on Computer Vision and Pattern Recognition (pp. 1-8). IEEE.
[3] Yamaguchi, K., McAllester, D. and Urtasun, R., 2014, September. Efficient joint segmentation, occlusion labeling, stereo and flow
estimation. In European Conference on Computer Vision (pp. 756-771). Springer, Cham.
[4] Geiger, A., Lenz, P. and Urtasun, R., 2012, June. Are we ready for autonomous driving? the kitti vision benchmark suite. In 2012
IEEE Conference on Computer Vision and Pattern Recognition (pp. 3354-3361). IEEE.
[5] Deng, X. and Dragotti, P.L., 2019. Deep Coupled ISTA Network for Multi-Modal Image Super-Resolution. IEEE Transactions on
Image Processing, 29, pp.1683-1698.
[6] Almasri, F. and Debeir, O., 2018, December. Multimodal sensor fusion in single thermal image super-resolution. In Asian
Conference on Computer Vision (pp. 418-433). Springer, Cham.