U-Net支气管超声弹性图像纵膈淋巴结分割 U-Net-based mediastinal lymph node segmentation method in bronchial ultrasound elastic images期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

U-Net支气管超声弹性图像纵膈淋巴结分割

引用本文：	刘羽,吴蓉蓉,唐璐,宋宁宁.U-Net支气管超声弹性图像纵膈淋巴结分割[J].中国图象图形学报,2022,27(10):3082-3091.

作者姓名：	刘羽吴蓉蓉唐璐宋宁宁

作者单位：	合肥工业大学生物医学工程系, 合肥 230009;徐州医科大学医学影像学院, 徐州 221004;南京市第一医院, 南京 210006

基金项目：	国家自然科学基金项目（82001912，62176081，61701160）

摘要：	目的超声弹性成像技术已逐步应用于支气管淋巴结良恶性的诊断，帮助确定肺癌分期。在支气管超声弹性图像中，淋巴结区域的精确定位对诊断准确度具有重要影响，但通常依赖专业医师的手动分割，费时费力。为此，本文设计了一种注意力上下文编码器网络（attention context encoder network，ACE-Net）。方法本文网络模型包括编码器、上下文提取器和解码器3部分。使用在ImageNet数据集上预训练且去掉平均池化层和全连接层的34层残差网络ResNet-34作为编码器提取特征，上下文提取器从编码器的输出中进一步提取高级语义信息，同时保留尽可能多的空间信息，基于AG （attention gate）的解码器可以抑制输入图像中的不相关区域，同时突出对当前任务更关键的特征。结果实验在本文收集的包含支气管超声弹性图像及对应分割标签的数据集上进行，与6种典型的U-Net结构深度网络模型的分割性能进行对比，数据集中的每幅图像中的淋巴结都由专业医师手动分割标注。基础U-Net网络得到淋巴结分割结果的Dice系数、敏感度和特异度分别为0.820 7、85.08%和96.82%，其他改进网络的分割性能在此基础上均有一定提高，本文方法的Dice系数、敏感度和特异度分别为0.845 1、87.92%和97.04%，Dice系数和敏感度在所有方法中取得了最优值，特异度取得了次优值。结论以U-Net为代表的深度学习模型在支气管超声弹性图像淋巴结分割问题中具有很大潜力，将上下文提取器和注意力机制融入U-Net网络可以一定程度提升分割精度。本文收集的数据集将有助于推动支气管超声弹性图像淋巴结分割问题的研究。
关键词：	超声弹性成像淋巴结分割深度学习 U-Net 上下文提取器注意力机制
收稿时间：	2021/3/24 0:00:00
修稿时间：	2021/4/19 0:00:00
U-Net-based mediastinal lymph node segmentation method in bronchial ultrasound elastic images

Liu Yu,Wu Rongrong,Tang Lu,Song Ningning.U-Net-based mediastinal lymph node segmentation method in bronchial ultrasound elastic images[J].Journal of Image and Graphics,2022,27(10):3082-3091.

Authors:	Liu Yu Wu Rongrong Tang Lu Song Ningning

Affiliation:	Department of Biomedical Engineering, Hefei University of Technology, Hefei 230009, China;School of Medical Imaging, Xuzhou Medical University, Xuzhou 221004, China; Nanjing First Hospital, Nanjing 210006, China

Abstract:	Objective Lung cancer threatens human health severely. It is one of the malignant tumors with the fastest increase in morbidity and mortality and the greatest threat to the life of the population. In the past 50 years, many countries have addressed that the incidence and mortality of lung cancer have increased significantly. The incidence and mortality of lung cancer rank the first among all the malignant tumors. Recent ultrasound elastography technology has been gradually applied to diagnose the benign and malignant bronchial lymph nodes to aid the degree analysis of lung cancer. Ultrasonic elastography provides more information than conventional two-dimensional ultrasound via the evaluation of lesion toughness. Color Doppler energy imaging superimposes the color coding system on the conventional ultrasound image. In general, the hardness of the diseased lymph node is relatively large, and the degree of deformation is small after being squeezed, which is represented as blue color in the elastic image. The normal lymph node is relatively soft, which is represented as red or green colors in the elastic image. Bronchial ultrasound elastography is generated through the squeezing deformation issues of the lymph nodes in related to the record of heartbeat, breathing movement and the pulsation of blood vessels around the lungs. In bronchial ultrasound elastic images, the precise positioning of the lymph node area is of great significance to the diagnosis accuracy of the disease. However, this kind of task is time-consuming and laborious due to its manual segmentation in clinical. We carried out the deep learning based automatic segmentation method of mediastinal lymph nodes in bronchial ultrasound elastic images via U-Net-type architectures. Method A dataset consisting of 205 bronchial ultrasound elastic images and corresponding segmentation labels is collected. The lymph nodes of each image are manually segmented and labeled. Based on this dataset, six classic deep network models based on U-Net are tested. The U-Net has an encoder-decoder structure. The encoder aims to capture more advanced semantic features and reduce the spatial dimension of the feature map gradually, while the decoder is used to restore spatial details and dimensions. We design a new U-Net-based bronchial ultrasound elastic image segmentation method based on the integration of context extractor and attention mechanism. To avoid gradient explosion and disappearance, the encoder is the ResNet-34 pretrained on ImageNet with no average pooling layer and the fully connected layer. The context extractor is used to extract high-level semantic information further from the output of the encoder while preserving as much spatial information as possible. The attention mechanism aims to select features that are more important to the current task. The prediction result of the segmentation network is the probability value of the pixel classification, so a binarization operation is performed by setting the threshold to 0.5. That is, the pixels are assigned to 0 if the probability value is less than 0.5 and otherwise is 1. In this way, the segmented binary image is obtained. Result To verify the performance of different networks, a five-fold cross validation evaluation is conducted on the dataset. That is, we divide the dataset into five equal parts in random, and four of them is as the training set each time and the remaining one is as the testing set. The preprocessing operations are related to data cropping, data augmentation and normalized operation on the training set. The input images and the ground-truth segmentation maps are resized to 320×320 pixels. Data augmentation approaches include random vertical flip and random angle rotation. The Adam optimizer is selected and the learning rate is set to 0.000 1. The batch size is set to 8. The number of epoch is 150. The GPU used in the experiment is GeForce RTX 2080Ti. The segmentation task is implemented using python3.7 under the Ubuntu16.04.1 operating system and the core framework is pytorch1.7.1. The results of Dice coefficient, sensitivity and specificity of U-Net network lymph node segmentation are 0.820 7, 85.08% and 96.82%, respectively. On this basis, the segmentation performances of other modified versions of U-Net are all improved to a certain extent. Among them, our Dice coefficient, sensitivity and specificity are 0.845 1, 87.92% and 97.04% of each, which are 0.024 4, 2.84% and 0.22% higher than the baseline U-Net, respectively. Compared to the other methods, the Dice coefficient and the sensitivity achieve the first place, while the specificity ranks the second. Conclusion Our analyses demonstrate that deep learning models represented by U-Net have great potential in the segmentation of mediastinal lymph nodes in bronchial ultrasound elastic images. Fused by the context extractor and attention mechanism, the integrated U-Net network can improve the segmentation accuracy to a certain extent. In addition, the illustrated dataset can promote the research of lymph node segmentation in bronchial ultrasound elastic images. Our method can be used for the segmentation of lymph nodes in bronchial ultrasound elastography images. It has potentials for the segmentation of more medical imaging organs and tumors as well. However, due to the relatively small scale of the dataset, there is still large room for further improvement on the segmentation performance, although the data augmentation approaches have been performed. To improve the segmentation accuracy further, it is required to increase the scale of the dataset in consistency.

Keywords:	ultrasound elastography lymph node segmentation deep learning U-Net context extractor attention mechanism

	点击此处可从《中国图象图形学报》浏览原始摘要信息
	点击此处可从《中国图象图形学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏