首页 | 官方网站   微博 | 高级检索  
     

基于多特征属性相似的糖尿病早期预测方法
引用本文:乔瀚,容芷君,许莹,但斌斌,赵慧.基于多特征属性相似的糖尿病早期预测方法[J].科学技术与工程,2021,21(36):15497-15502.
作者姓名:乔瀚  容芷君  许莹  但斌斌  赵慧
作者单位:武汉科技大学 工业工程系 湖北S武汉,;武汉科技大学 工业工程系 湖北S武汉;武汉市第五医院 湖北 武汉
基金项目:武汉市科技局,企业技术创新项目(201901070211288)
摘    要:考虑样本数据集的差异性和相关性对疾病预测结果有着直接影响,提出一种基于多特征属性患者相似的糖尿病早期预测方法,根据患者之间特征具有相似性这一特点,对患者特征进行混合属性相似预分组,再把分组结果导入随机森林分类器进行疾病预测。首先以临床概念作为患者的特征项,通过聚类定量化分析不同特征属性类型间的距离来度量患者之间的混合相似度,根据患者混合相似度将患者集预分组为多个患者相似组。最后以随机森林分类器对相似组进行细分类,得到最终的疾病预测结果,该结果与基于全样本数据的随机森林分类结果相比,分类准确率提高了8.3%;与基于单一属性相似组的随机森林分类结果相比,分类准确率提高了5.1%。结果表明该方法具有较高的预测准确率,可为糖尿病诊断预测提供支持。

关 键 词:患者相似性  特征属性  聚类  分类  糖尿病预测
收稿时间:2021/4/27 0:00:00
修稿时间:2021/10/5 0:00:00

DIABETES EARLY PREDICTION METHOD BASED ON SIMILAR GROUPS OF PATIENTS WITH MULTIPLE CHARACTERISTIC ATTRIBUTES
Qiao Han,Rong Zhijun,Xu Ying,Dan Binbin,Zhao Hui.DIABETES EARLY PREDICTION METHOD BASED ON SIMILAR GROUPS OF PATIENTS WITH MULTIPLE CHARACTERISTIC ATTRIBUTES[J].Science Technology and Engineering,2021,21(36):15497-15502.
Authors:Qiao Han  Rong Zhijun  Xu Ying  Dan Binbin  Zhao Hui
Affiliation:Wuhan University of Science and Technology;Wuhan Fifth Hospital
Abstract:Considering that the difference and correlation of sample data sets have a direct impact on disease prediction results, a method for early prediction of diabetes based on the similarity of patients with multi-feature attributes is proposed. According to the characteristics of the similarity between patients, the characteristics of patients are mixed. The attributes are similar to pre-grouping, and then the grouping results are imported into the random forest classifier for disease prediction.First, the clinical concept is used as the patient"s feature item, and the distance between different feature attribute types is measured by clustering and quantitative analysis to measure the mixed similarity between patients, and the patient set is pre-grouped into multiple patient similar groups according to the mixed similarity of patients.Finally, a random forest classifier is used to subdivide the similar groups to obtain the final disease prediction result. Compared with the random forest classification result based on the full sample data, the classification accuracy is increased by 8.3%; compared with the similar group based on a single attribute Compared with the random forest classification results, the classification accuracy rate has increased by 5.1%.The results show that this method has a high prediction accuracy rate and can provide support for the diagnosis and prediction of diabetes.
Keywords:Patient  similarity  Characteristic  attribute  Clustering  Classification  Diabetes  prediction
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号