Learning Mutual Modulation for Self-Supervised Cross-Modal Super-Resolution

التفاصيل البيبلوغرافية
العنوان: Learning Mutual Modulation for Self-Supervised Cross-Modal Super-Resolution
المؤلفون: Dong, Xiaoyu, Yokoya, Naoto, Wang, Longguang, Uezato, Tatsumi
بيانات النشر: arXiv, 2022.
سنة النشر: 2022
مصطلحات موضوعية: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
الوصف: Self-supervised cross-modal super-resolution (SR) can overcome the difficulty of acquiring paired training data, but is challenging because only low-resolution (LR) source and high-resolution (HR) guide images from different modalities are available. Existing methods utilize pseudo or weak supervision in LR space and thus deliver results that are blurry or not faithful to the source modality. To address this issue, we present a mutual modulation SR (MMSR) model, which tackles the task by a mutual modulation strategy, including a source-to-guide modulation and a guide-to-source modulation. In these modulations, we develop cross-domain adaptive filters to fully exploit cross-modal spatial dependency and help induce the source to emulate the resolution of the guide and induce the guide to mimic the modality characteristics of the source. Moreover, we adopt a cycle consistency constraint to train MMSR in a fully self-supervised manner. Experiments on various tasks demonstrate the state-of-the-art performance of our MMSR.
Comment: ECCV 2022
DOI: 10.48550/arxiv.2207.09156
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ec54a57ee745ce7cb86c1a238b17aaa8Test
حقوق: OPEN
رقم الانضمام: edsair.doi.dedup.....ec54a57ee745ce7cb86c1a238b17aaa8
قاعدة البيانات: OpenAIRE