首页 | 官方网站   微博 | 高级检索  
     

基于密度-距离的t混合模型流式数据聚类
引用本文:赵其杰,柯震南,陶靖,卢建霞.基于密度-距离的t混合模型流式数据聚类[J].仪器仪表学报,2017,38(9):2130-2137.
作者姓名:赵其杰  柯震南  陶靖  卢建霞
作者单位:1.上海大学机电工程与自动化学院上海200072;2.上海市智能制造及机器人重点实验室上海200072,上海大学机电工程与自动化学院上海200072,上海纳衍生物科技有限公司上海201108,上海大学机电工程与自动化学院上海200072
基金项目:上海市浦江人才计划(17PJ1432300)项目资助
摘    要:传统流式数据采用人工设门法分析,效率低下且依赖于专家。近几年,很多自动流式数据聚类算法纷纷被提出,然而针对数据量不多且分布稀疏的小样本类群始终没有很好的解决办法。提出了一种基于密度-距离的t-混合模型流式数据聚类优化方法,能够较好地解决小样本类群区分困难的问题。该方法通过密度-距离中心算法定位各类群的初始中心,作为t-混合算法的初值对样本数据进行处理,通过最大似然估计求出各类群对应的样本数目,从而实现样本聚类。实验表明,与经典模型算法相比,基于密度-距离的t-混合模型优化算法具有更好的稳定性和可靠性,对小样本类群以及混叠的类群具有较强的适应能力。

关 键 词:t  混合模型  密度  距离中心算法  流式细胞分析术  聚类算法

Clustering based on density distance and t mixture model in flow cytometry data
Zhao Qijie,Ke Zhennan,Tao Jing and Lu Jianxia.Clustering based on density distance and t mixture model in flow cytometry data[J].Chinese Journal of Scientific Instrument,2017,38(9):2130-2137.
Authors:Zhao Qijie  Ke Zhennan  Tao Jing and Lu Jianxia
Affiliation:1. School of Mechatronics Engineering and Automation, Shanghai University,Shanghai 200072, China; 2. Shanghai Key Laboratory of Intelligent Manufacturing and Robotics,Shanghai 200072, China,School of Mechatronics Engineering and Automation, Shanghai University,Shanghai 200072, China,Shanghai Nayan Biotechnology Co., Ltd,Shanghai 201108,China and School of Mechatronics Engineering and Automation, Shanghai University,Shanghai 200072, China
Abstract:Traditionally, the flow cytometry data is analyzed manually, which is inefficient and depends on expert experiences. In recent years, a lot of automatic cluster algorithms have been proposed. However, the clustering performance is not satisfied for sparse data with a random distribution. Therefore, this paper presents an automatic clustering method based on density distance center for t mixture model algorithm in flow cytometry data, which is suitable for rare samples. The proposed method finds the center of each group by density distance center algorithm and uses it as the initial value of t mixture model to estimate the sample data by maximum likelihood estimation. Compared with the classical algorithm, the result shows that the t mixture model based on density distance center has better stability and reliability, and can better fit small or mixed samples.
Keywords:t mixture model  density distance center algorithm  flow cytometry  clustering algorithm
本文献已被 CNKI 等数据库收录!
点击此处可从《仪器仪表学报》浏览原始摘要信息
点击此处可从《仪器仪表学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号