首页 | 官方网站   微博 | 高级检索  
     

基于循环一致性对抗网络的室内火焰图像场景迁移
引用本文:杨植凯,卜乐平,王腾,欧阳继能.基于循环一致性对抗网络的室内火焰图像场景迁移[J].光学精密工程,2020(3):745-758.
作者姓名:杨植凯  卜乐平  王腾  欧阳继能
作者单位:海军工程大学电气工程学院
基金项目:国家自然科学基金重点项目资助(No.41631072);国家自然科学基金面上项目资助(No.41671459)。
摘    要:基于深度学习的视频火灾探测模型的训练依赖于大量的正负样本数据,即火灾视频和带有干扰的场景视频。由于很多室内场合禁止点火,导致该场景下的火灾视频样本不足。本文基于生成对抗网络,将其他相似场景下录制的火焰迁移到指定场景,以此增广限制性场合下的火灾视频数据。文中提出将火焰内核预先植入场景使之具备完整的内容信息,再通过添加烟雾和地面反射等风格信息,完成场景与火焰的融合。该方法克服了现有多模态图像转换方法在图像转换过程中因丢失信息而造成的背景失真问题。同时为减少数据采集工作量,采用循环一致性生成对抗网络以解除训练图像必须严格匹配的限制。实验表明,与现有多模态图像转换相比,本文方法可以保证场景中火焰形态的多样性,迁移后的场景具有较高的视觉真实性,所得结果的FID与LPIPS值最小,分别为119.6和0.134 2。

关 键 词:图像转换  生成对抗网络  火焰图像合成  循环一致性生成对抗网络

Scenemigration of indoor flame image based on Cycle-Consistent adversarial networks
Affiliation:(School of Electrical Engineering,Naval University of Engineering,Wuhan 430033,China)
Abstract:The training of video fire detection models based on deep learning relies on a large number of positive and negative samples, namely, fire videos and scenario videos involving other disturbances similar to fires. In some instances, the fire video sampled from a scene is insufficient owing to the prohibition of ignition. In this thesis, it was proposed that the flames recorded in other similar scenarios be migrated into the specified scene to increase the data of the fire video in such restricted situations. To complete the content information, a flame kernel was previously implanted into the specified scene, and then style information, such as smoke and ground reflection, were added to fuse the scene and the flame. Our method eliminated the background distortion caused by the loss of information during image translation via the existing multimodal image translation. In addition, Cycle-Consistent adversarial networks were adopted to decrease the dataset quantity and remove the restriction of strict matching of the training images. Compared with other multimodal image-to-image translations, our method ensured that the fire in a scene was diverse, and the migration scene was more visually realistic. The minimum values of FID and LPIPS are achieved, which are 119.6 and 0.134 2, respectively.
Keywords:image translation  generative adversarial network  fire image synthesis  Cycle Generative Adversarial Network(CycleGAN)
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号