Effective Video Mirror Detection with Inconsistent Motion Cues

Warren, Alex; Xu, Ke; Lin, Jiaying; Tam, Gary; Lau, Rynson

doi:10.1109/cvpr52733.2024.01632

Conference Paper/Proceeding/Abstract 1037 views 275 downloads

Effective Video Mirror Detection with Inconsistent Motion Cues

Alex Warren, Ke Xu, Jiaying Lin, Gary Tam

, Rynson Lau

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Volume: 2024, Pages: 17244 - 17252

Swansea University Authors: Alex Warren, Gary Tam , Rynson Lau

PDF | Accepted Manuscript

Author accepted manuscript document released under the terms of a Creative Commons CC-BY licence using the Swansea University Research Publications Policy (rights retention).
Download (5.79MB)

Check full text

DOI (Published version): 10.1109/cvpr52733.2024.01632

Abstract

Image-based mirror detection has recently undergone rapid research due to its significance in applications such as robotic navigation, semantic segmentation and scene re-construction. Recently, VMD-Net was proposed as the first video mirror detection technique, by modeling dual correspondences betwe...

Full description

Published in:	2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
ISBN:	979-8-3503-5301-3 979-8-3503-5300-6
ISSN:	1063-6919 2575-7075
Published:	IEEE 2024
Online Access:	Check full text
URI:	https://cronfa.swan.ac.uk/Record/cronfa65886

first_indexed	2024-04-15T13:43:24Z
last_indexed	2025-02-21T09:19:44Z
id	cronfa65886
recordtype	SURis
fullrecord	<?xml version="1.0"?><rfc1807><datestamp>2025-02-20T15:25:26.8930249</datestamp><bib-version>v2</bib-version><id>65886</id><entry>2024-03-23</entry><title>Effective Video Mirror Detection with Inconsistent Motion Cues</title><swanseaauthors><author><sid>38cd1eebf16295dbe5e1ff6769d6af69</sid><firstname>Alex</firstname><surname>Warren</surname><name>Alex Warren</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>e75a68e11a20e5f1da94ee6e28ff5e76</sid><ORCID>0000-0001-7387-5180</ORCID><firstname>Gary</firstname><surname>Tam</surname><name>Gary Tam</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>8d230434b6eadb1be5928241b0beecd0</sid><firstname>Rynson</firstname><surname>Lau</surname><name>Rynson Lau</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2024-03-23</date><deptcode>MACS</deptcode><abstract>Image-based mirror detection has recently undergone rapid research due to its significance in applications such as robotic navigation, semantic segmentation and scene re-construction. Recently, VMD-Net was proposed as the first video mirror detection technique, by modeling dual correspondences between the inside and outside of the mirror both spatially and temporally. However, this approach is not reliable, as correspondences can occur completely inside or outside of the mirrors. In addition, the proposed dataset VMD-D contains many small mirrors, limiting its applicability to real-world scenarios. To address these problems, we developed a more challenging dataset that includes mirrors of various shapes and sizes at different locations of the frames, providing a better reflection of real-world scenarios. Next, we observed that the motions between the inside and outside of the mirror are often in-consistent. For instance, when moving in front of a mirror, the motion inside the mirror is often much smaller than the motion outside due to increased depth perception. With these observations, we propose modeling inconsistent motion cues to detect mirrors, and a new network with two novel modules. The Motion Attention Module (MAM) ex-plicitly models inconsistent motions around mirrors via optical flow, and the Motion-Guided Edge Detection Module (MEDM) uses motions to guide mirror edge feature learning. Experimental results on our proposed dataset show that our method outperforms state-of-the-arts. The code and dataset are available at ht tps: // gi th ub. com/ AlexAnthonyWarren/MG-VMD.</abstract><type>Conference Paper/Proceeding/Abstract</type><journal>2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)</journal><volume>2024</volume><journalNumber/><paginationStart>17244</paginationStart><paginationEnd>17252</paginationEnd><publisher>IEEE</publisher><placeOfPublication/><isbnPrint>979-8-3503-5301-3</isbnPrint><isbnElectronic>979-8-3503-5300-6</isbnElectronic><issnPrint>1063-6919</issnPrint><issnElectronic>2575-7075</issnElectronic><keywords>Representation learning, Shape, Image edge detection, Semantic segmentation, Object detection, Reflection, Pattern recognition</keywords><publishedDay>16</publishedDay><publishedMonth>9</publishedMonth><publishedYear>2024</publishedYear><publishedDate>2024-09-16</publishedDate><doi>10.1109/cvpr52733.2024.01632</doi><url/><notes/><college>COLLEGE NANME</college><department>Mathematics and Computer Science School</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>MACS</DepartmentCode><institution>Swansea University</institution><apcterm>Not Required</apcterm><funders>Alex is supported by a Swansea GTA Research Scholarship. This project is in part supported by a GRF grant from the Research Grants Council of Hong Kong (Ref.: 11211223). We gratefully acknowledge the support of the HEFCW HERC fund (W21/21HE) for the provision of GPU equipment used in this research.</funders><projectreference>Alex is supported by a Swansea GTA Research Scholarship. This project is in part supported by a GRF grant from the Research Grants Council of Hong Kong (Ref.: 11211223). We gratefully acknowledge the support of the HEFCW HERC fund (W21/21HE) for the provision of GPU equipment used in this research.</projectreference><lastEdited>2025-02-20T15:25:26.8930249</lastEdited><Created>2024-03-23T18:38:09.9376188</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Mathematics and Computer Science - Computer Science</level></path><authors><author><firstname>Alex</firstname><surname>Warren</surname><order>1</order></author><author><firstname>Ke</firstname><surname>Xu</surname><order>2</order></author><author><firstname>Jiaying</firstname><surname>Lin</surname><order>3</order></author><author><firstname>Gary</firstname><surname>Tam</surname><orcid>0000-0001-7387-5180</orcid><order>4</order></author><author><firstname>Rynson</firstname><surname>Lau</surname><order>5</order></author></authors><documents><document><filename>65886__29817__f739ba90ec0a4f189b62a73a302d042e.pdf</filename><originalFilename>cvpr2024_supp.pdf</originalFilename><uploaded>2024-03-25T10:03:41.6058758</uploaded><type>Output</type><contentLength>6074495</contentLength><contentType>application/pdf</contentType><version>Accepted Manuscript</version><cronfaStatus>true</cronfaStatus><documentNotes>Author accepted manuscript document released under the terms of a Creative Commons CC-BY licence using the Swansea University Research Publications Policy (rights retention).</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://creativecommons.org/licenses/by/4.0/deed.en</licence></document></documents><OutputDurs/></rfc1807>
spelling	2025-02-20T15:25:26.8930249 v2 65886 2024-03-23 Effective Video Mirror Detection with Inconsistent Motion Cues 38cd1eebf16295dbe5e1ff6769d6af69 Alex Warren Alex Warren true false e75a68e11a20e5f1da94ee6e28ff5e76 0000-0001-7387-5180 Gary Tam Gary Tam true false 8d230434b6eadb1be5928241b0beecd0 Rynson Lau Rynson Lau true false 2024-03-23 MACS Image-based mirror detection has recently undergone rapid research due to its significance in applications such as robotic navigation, semantic segmentation and scene re-construction. Recently, VMD-Net was proposed as the first video mirror detection technique, by modeling dual correspondences between the inside and outside of the mirror both spatially and temporally. However, this approach is not reliable, as correspondences can occur completely inside or outside of the mirrors. In addition, the proposed dataset VMD-D contains many small mirrors, limiting its applicability to real-world scenarios. To address these problems, we developed a more challenging dataset that includes mirrors of various shapes and sizes at different locations of the frames, providing a better reflection of real-world scenarios. Next, we observed that the motions between the inside and outside of the mirror are often in-consistent. For instance, when moving in front of a mirror, the motion inside the mirror is often much smaller than the motion outside due to increased depth perception. With these observations, we propose modeling inconsistent motion cues to detect mirrors, and a new network with two novel modules. The Motion Attention Module (MAM) ex-plicitly models inconsistent motions around mirrors via optical flow, and the Motion-Guided Edge Detection Module (MEDM) uses motions to guide mirror edge feature learning. Experimental results on our proposed dataset show that our method outperforms state-of-the-arts. The code and dataset are available at ht tps: // gi th ub. com/ AlexAnthonyWarren/MG-VMD. Conference Paper/Proceeding/Abstract 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 17244 17252 IEEE 979-8-3503-5301-3 979-8-3503-5300-6 1063-6919 2575-7075 Representation learning, Shape, Image edge detection, Semantic segmentation, Object detection, Reflection, Pattern recognition 16 9 2024 2024-09-16 10.1109/cvpr52733.2024.01632 COLLEGE NANME Mathematics and Computer Science School COLLEGE CODE MACS Swansea University Not Required Alex is supported by a Swansea GTA Research Scholarship. This project is in part supported by a GRF grant from the Research Grants Council of Hong Kong (Ref.: 11211223). We gratefully acknowledge the support of the HEFCW HERC fund (W21/21HE) for the provision of GPU equipment used in this research. Alex is supported by a Swansea GTA Research Scholarship. This project is in part supported by a GRF grant from the Research Grants Council of Hong Kong (Ref.: 11211223). We gratefully acknowledge the support of the HEFCW HERC fund (W21/21HE) for the provision of GPU equipment used in this research. 2025-02-20T15:25:26.8930249 2024-03-23T18:38:09.9376188 Faculty of Science and Engineering School of Mathematics and Computer Science - Computer Science Alex Warren 1 Ke Xu 2 Jiaying Lin 3 Gary Tam 0000-0001-7387-5180 4 Rynson Lau 5 65886__29817__f739ba90ec0a4f189b62a73a302d042e.pdf cvpr2024_supp.pdf 2024-03-25T10:03:41.6058758 Output 6074495 application/pdf Accepted Manuscript true Author accepted manuscript document released under the terms of a Creative Commons CC-BY licence using the Swansea University Research Publications Policy (rights retention). true eng https://creativecommons.org/licenses/by/4.0/deed.en
title	Effective Video Mirror Detection with Inconsistent Motion Cues
spellingShingle	Effective Video Mirror Detection with Inconsistent Motion Cues Alex Warren Gary Tam Rynson Lau
title_short	Effective Video Mirror Detection with Inconsistent Motion Cues
title_full	Effective Video Mirror Detection with Inconsistent Motion Cues
title_fullStr	Effective Video Mirror Detection with Inconsistent Motion Cues
title_full_unstemmed	Effective Video Mirror Detection with Inconsistent Motion Cues
title_sort	Effective Video Mirror Detection with Inconsistent Motion Cues
author_id_str_mv	38cd1eebf16295dbe5e1ff6769d6af69 e75a68e11a20e5f1da94ee6e28ff5e76 8d230434b6eadb1be5928241b0beecd0
author_id_fullname_str_mv	38cd1eebf16295dbe5e1ff6769d6af69_*_Alex Warren e75a68e11a20e5f1da94ee6e28ff5e76__Gary Tam 8d230434b6eadb1be5928241b0beecd0_**_Rynson Lau
author	Alex Warren Gary Tam Rynson Lau
author2	Alex Warren Ke Xu Jiaying Lin Gary Tam Rynson Lau
format	Conference Paper/Proceeding/Abstract
container_title	2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
container_volume	2024
container_start_page	17244
publishDate	2024
institution	Swansea University
isbn	979-8-3503-5301-3 979-8-3503-5300-6
issn	1063-6919 2575-7075
doi_str_mv	10.1109/cvpr52733.2024.01632
publisher	IEEE
college_str	Faculty of Science and Engineering
hierarchytype
hierarchy_top_id	facultyofscienceandengineering
hierarchy_top_title	Faculty of Science and Engineering
hierarchy_parent_id	facultyofscienceandengineering
hierarchy_parent_title	Faculty of Science and Engineering
department_str	School of Mathematics and Computer Science - Computer Science{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Mathematics and Computer Science - Computer Science
document_store_str	1
active_str	0
description	Image-based mirror detection has recently undergone rapid research due to its significance in applications such as robotic navigation, semantic segmentation and scene re-construction. Recently, VMD-Net was proposed as the first video mirror detection technique, by modeling dual correspondences between the inside and outside of the mirror both spatially and temporally. However, this approach is not reliable, as correspondences can occur completely inside or outside of the mirrors. In addition, the proposed dataset VMD-D contains many small mirrors, limiting its applicability to real-world scenarios. To address these problems, we developed a more challenging dataset that includes mirrors of various shapes and sizes at different locations of the frames, providing a better reflection of real-world scenarios. Next, we observed that the motions between the inside and outside of the mirror are often in-consistent. For instance, when moving in front of a mirror, the motion inside the mirror is often much smaller than the motion outside due to increased depth perception. With these observations, we propose modeling inconsistent motion cues to detect mirrors, and a new network with two novel modules. The Motion Attention Module (MAM) ex-plicitly models inconsistent motions around mirrors via optical flow, and the Motion-Guided Edge Detection Module (MEDM) uses motions to guide mirror edge feature learning. Experimental results on our proposed dataset show that our method outperforms state-of-the-arts. The code and dataset are available at ht tps: // gi th ub. com/ AlexAnthonyWarren/MG-VMD.
published_date	2024-09-16T05:11:11Z
_version_	1859522313516482560
score	11.099629

Effective Video Mirror Detection with Inconsistent Motion Cues

Similar Items