首页 | 官方网站   微博 | 高级检索  
     


Collaborative Distribution Alignment for 2D image-based 3D shape retrieval
Affiliation:1. NYU Multimedia and Visual Computing Lab, USA;2. Department of Computer Science and Engineering, NYU Tandon School of Engineering, New York, USA;3. Department of Electrical and Computer Engineering, NYU Tandon School of Engineering, New York, USA;4. Department of Electrical and Computer Engineering, New York University Abu Dhabi, Abu Dhabi, UAE;5. Depts. of Rehabilitation Medicine and Neurology, NYU Langone Medical Center, USA;1. State Key Laboratory of Communication Content Cognition, People''s Daily Online, Beijing 100733, China;2. Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei 230088, China;3. School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China;4. Baidu Inc., Beijing 100105, China;1. the school of microelectronics, Tianjin University, Tianjin 300072, China;2. the school of electrical and information engineering, Tianjin University, Tianjin 300072, China;3. Tianjin Navigation Instrument Research Institute, Tianjin 300131, China;4. Baidu Inc., Beijing 100105, China
Abstract:Retrieving 3D shapes with 2D images has become a popular research area nowadays, and a great deal of work has been devoted to reducing the discrepancy between 3D shapes and 2D images to improve retrieval performance. However, most approaches ignore the semantic information and decision boundaries of the two domains, and cannot achieve both domain alignment and category alignment in one module. In this paper, a novel Collaborative Distribution Alignment (CDA) model is developed to address the above existing challenges. Specifically, we first adopt a dual-stream CNN, following a similarity guided constraint module, to generate discriminative embeddings for input 2D images and 3D shapes (described as multiple views). Subsequently, we explicitly introduce a joint domain-class alignment module to dynamically learn a class-discriminative and domain-agnostic feature space, which can narrow the distance between 2D image and 3D shape instances of the same underlying category, while pushing apart the instances from different categories. Furthermore, we apply a decision boundary refinement module to avoid generating class-ambiguity embeddings by dynamically adjusting inconsistencies between two discriminators. Extensive experiments and evaluations on two challenging benchmarks, MI3DOR and MI3DOR-2, demonstrate the superiority of the proposed CDA method for 2D image-based 3D shape retrieval task.
Keywords:3D shape retrieval  Cross-domain retrieval  Multi-view learning
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号