首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 140 毫秒
1.
一个新的模糊聚类有效性指标   总被引:3,自引:1,他引:2  
孔攀  邓辉文  黄艳艳  江欢 《计算机工程》2009,35(12):143-144
提出一个新的模糊聚类有效性指标。该指标能确定由模糊C-均值算法(FCM)所得模糊划分的最优划分和最优聚类数,结合了模糊聚类的紧致性和分离性信息,用类内加权平方误差和计算紧致性,用类间相似度计算分离性。在3个人造数据集和3个真实数据集上进行对比实验,结果证明该指标的性能优于其他有效性指标。  相似文献   

2.
针对直觉模糊集合数据的聚类有效性问题,提出了一种基于直觉模糊包含度的聚类有效性分析方法。该方法采用直觉模糊包含度和直觉模糊划分熵来评价直觉模糊聚类的有效性。其中,直觉模糊包含度通过增加非隶属度参数对模糊包含度进行直觉化扩展,用于评价类与类间包含的程度;而直觉模糊划分熵用于检验分类结果的可靠性。最后通过典型实例验证了该方法的有效性。  相似文献   

3.
一个改进的模糊聚类有效性指标   总被引:1,自引:0,他引:1       下载免费PDF全文
聚类有效性指标既可用来评价聚类结果的有效性,也可以用来确定最佳聚类数。根据模糊聚类的基本特性,提出了一种新的模糊聚类有效性指标。该指标结合了数据集的分布特征和数据隶属度两个重要因素来评价聚类结果,提高了判别的准确性。实验证明,该指标能对模糊聚类结果进行正确的评价,并自动获得最佳聚类数,特别是对类间有交叠的情况能够做出准确判定。  相似文献   

4.
针对直觉模糊集合数据的聚类有效性问题,给出了一种用于发现最优模糊划分的聚类有效性方法.该方法采用直觉模糊相关度和直觉模糊熵两个重要因子来评价直觉模糊聚类的有效性.其中,直觉模糊相关度通过增加非隶属度参数对模糊相关度进行直觉化扩展,用于评价类与类间相关度的大小,同时加入权重参数解决了样本数据各维特征分配不均匀的问题,而直觉模糊熵用于检验分类结果的可靠性.最后通过实例验证了该方法对于紧致的、良好分离的教据集分类效果理想,其在目标编群、目标识别等信息融合领域有良好的应用前景.  相似文献   

5.
结合模糊聚类的类内紧致性和类间分离性信息,提出一种新的模糊聚类有效性指标。该指标能够确定由模糊C-均值算法(FCM)所得模糊划分的最优划分和最佳聚类数。在1个人造数据集和4个真实数据集上进行对比实验,结果表明该指标性能的优越性。  相似文献   

6.
模糊聚类方法可以更有效地对复杂数据集进行分析,由于模糊聚类算法的种类繁多且聚类结果会随着输入的聚类个数的不同而改变,使得模糊聚类算法产生的结果不准确,因此,要获得准确的聚类结果必须确定模糊聚类个数k.目前已有的研究主要是利用多种模糊聚类有效性指标来确定最优聚类个数k,但是诸如SSD,PBM等模糊聚类指标会随着划分的聚类个数k的增加而单调递减,导致聚类个数k不准确.为此,文中提出了一种结合多目标优化算法的模糊聚类有效性指标(A Validity Index of Fuzzy Clustering Combined with Multi-obj ective Optimization Algorithm,OSACF),将模糊聚类度量指标与多目标优化算法(Multi-Obj ective Optimization Algorithm,MOEA)相结合来解决聚类最优个数k的问题.与使用聚类有效性指标不同,OSACF通过建立聚类个数k与聚类度量指标之间的双目标模型并使用MOEA优化该双目标模型来确定最优聚类个数k,避免了聚类有效性指标趋于单调递减的影响.另一方面,OSACF使用形态形似距离替代传统的欧氏距离度量,避免了聚类形状对计算聚类k值的影响.实验结果表明,OSACF结合MOEA得到的最优模糊聚类个数k比已有的聚类有效性指标获得的结果更准确.  相似文献   

7.
模糊聚类是模式识别、机器学习和图像处理等领域的重要研究内容。模糊C-均值聚类算法是最常用的模糊聚类实现算法,该算法需要预先给定聚类数才能对数据集进行聚类。提出了一种新的聚类有效性指标,对聚类结果进行有效性验证。该指标从划分熵、隶属度、几何结构角度,定义了紧凑度、分离度、重叠度三个重要特征测量。在此基础上,提出了一种最佳聚类数确定方法。将新聚类有效性指标和传统有效性指标在6个人工数据集和3个真实数据集进行实验验证。实验结果表明,所提出的指标和方法能够有效地对聚类结果进行评估,适合确定样本的最佳聚类数。  相似文献   

8.
为解决在模糊C-均值聚类应用过程中,模糊加权指数的确定缺乏理论依据和有效评价方法的问题,提出了一种基于子集测度的模糊加权指数进化计算方法。根据子集测度理论定义聚类有效性函数,在聚类过程中通过循环进化迭代计算聚类的有效性,并将其值反馈到模糊加权指数的变化中,使收敛到一个稳定解。通过实验得到的加权指数符合预期的结果,理论分析和实验结果表明了该方法的有效性。  相似文献   

9.
模糊聚类分析结果是否合理的问题属于模糊聚类有效性判定课题,其核心是模糊聚类有效性函数的构造。文中基于序关系定义了模糊划分模糊熵来描述模糊划分的模糊程度。考虑到现有的一类有效的模糊聚类有效性函数就是基于数据集的模糊划分的,因此文中也用模糊划分的模糊熵作为聚类有效性函数。实验表明,模糊划分的模糊熵作为模糊聚类的有效性函数是合理的、可行的。  相似文献   

10.
针对模糊C-均值聚类算法对初始化分类参数(包括起始聚类中心位置和初始化分类隶属度矩阵)的选择比较敏感而导致分类结果差异性较大,以及错误分类会给解决实际问题带来难以预料后果的不足,本文从反映数据聚类后类间分离性测度的划分系数入手,提出了可变加权划分系数的新概念,并用于数据分类效果的评价。实验结果表明,本文提出的评价方法不仅是可行的,而且比模糊C-均值聚类算法的目标函数作为数据分类效果的评价准则更好。  相似文献   

11.
一种新的聚类有效性函数   总被引:3,自引:1,他引:2       下载免费PDF全文
聚类有效性函数是用于评价聚类结果优劣的指标,准确地给出初始聚类类别数将使得聚类结果趋于合理化。根据模糊不确定性理论及聚类问题的基本特性,引入了新的紧密度度量指标DiU;c),在此基础上提出了一个旨在寻求最优聚类类别数的有效性函数。该函数基于数据集的紧密度与分离度特征,综合考虑了数据成员的隶属度及数据集的几何结构。实验结果表明该有效性函数能够发现最优的聚类类别数,对于分类结构较为明确的数据集表现出良好的性能,并且对于权重系数具有良好的鲁棒性。  相似文献   

12.
聚类的错误主要表现为两种形式:将原属不同类的数据分到同一个聚类和将原属同一类的数据分到不同聚类。文中提出类内不一致性和类间重叠度两个指标分别度量聚类中出现这两类错误的程度。一个好的模糊分割中包含的聚类错误应尽可能少。同时,聚类紧致度应尽可能大。基于这两个错误度量指标和紧致性度量,提出一种有效性函数来判断模糊聚类的有效性。实验结果表明,提出的有效性函数能有效判断最佳聚类数并且有较好的鲁棒性。  相似文献   

13.
神经模糊系统中模糊规则的优选   总被引:5,自引:0,他引:5  
贾立  俞金寿 《控制与决策》2002,17(3):306-309
提出一种基于两级聚类算法的自组织神经模糊系统,该系统采用两级聚类算法(改进的最近邻域聚类算法和Gustafson-Kessel模糊聚类算法)对输入/输出数据进行模糊聚类,并由模糊聚类的划分熵确定最优划分,建立模糊模型,模型精度可由梯度下降法进一步提高。仿真结果表明,这种神经模糊系统具有结构简单、规则数少、学习速度快以及建模精度高等特点。  相似文献   

14.
A new cluster validity index is proposed that determines the optimal partition and optimal number of clusters for fuzzy partitions obtained from the fuzzy c-means algorithm. The proposed validity index exploits an overlap measure and a separation measure between clusters. The overlap measure, which indicates the degree of overlap between fuzzy clusters, is obtained by computing an inter-cluster overlap. The separation measure, which indicates the isolation distance between fuzzy clusters, is obtained by computing a distance between fuzzy clusters. A good fuzzy partition is expected to have a low degree of overlap and a larger separation distance. Testing of the proposed index and nine previously formulated indexes on well-known data sets showed the superior effectiveness and reliability of the proposed index in comparison to other indexes.  相似文献   

15.
Clustering analysis is the process of separating data according to some similarity measure. A cluster consists of data which are more similar to each other than to other clusters. The similarity of a datum to a certain cluster can be defined as the distance of that datum to the prototype of that cluster. Typically, the prototype of a cluster is a real vector that is called the center of that cluster. In this paper, the prototype of a cluster is generalized to be a complex vector (complex center). A new distance measure is introduced. New formulas for the fuzzy membership and the fuzzy covariance matrix are introduced. Cluster validity measures are used to assess the goodness of the partitions obtained by the complex centers compared those obtained by the real centers. The validity measures used in this paper are the partition coefficient, classification entropy, partition index, separation index, Xie and Beni’s index, and Dunn’s index. It is shown in this paper that clustering with complex prototypes will give better partitions of the data than using real prototypes.  相似文献   

16.
Cluster validity indices are used for estimating the quality of partitions produced by clustering algorithms and for determining the number of clusters in data. Cluster validation is difficult task, because for the same data set more partitions exists regarding the level of details that fit natural groupings of a given data set. Even though several cluster validity indices exist, they are inefficient when clusters widely differ in density or size. We propose a clustering validity index that addresses these issues. It is based on compactness and overlap measures. The overlap measure, which indicates the degree of overlap between fuzzy clusters, is obtained by calculating the overlap rate of all data objects that belong strongly enough to two or more clusters. The compactness measure, which indicates the degree of similarity of data objects in a cluster, is calculated from membership values of data objects that are strongly enough associated to one cluster. We propose ratio and summation type of index using the same compactness and overlap measures. The maximal value of index denotes the optimal fuzzy partition that is expected to have a high compactness and a low degree of overlap among clusters. Testing many well-known previously formulated and proposed indices on well-known data sets showed the superior reliability and effectiveness of the proposed index in comparison to other indices especially when evaluating partitions with clusters that widely differ in size or density.  相似文献   

17.
Novel Cluster Validity Index for FCM Algorithm   总被引:5,自引:0,他引:5       下载免费PDF全文
How to determine an appropriate number of clusters is very important when implementing a specific clustering algorithm, like c-means, fuzzy c-means (FCM). In the literature, most cluster validity indices are originated from partition or geometrical property of the data set. In this paper, the authors developed a novel cluster validity index for FCM, based on the optimality test of FCM. Unlike the previous cluster validity indices, this novel cluster validity index is inherent in FCM itself. Comparison experiments show that the stability index can be used as cluster validity index for the fuzzy c-means.  相似文献   

18.
A validity measure for fuzzy clustering   总被引:42,自引:0,他引:42  
The authors present a fuzzy validity criterion based on a validity function which identifies compact and separate fuzzy c-partitions without assumptions as to the number of substructures inherent in the data. This function depends on the data set, geometric distance measure, distance between cluster centroids and more importantly on the fuzzy partition generated by any fuzzy algorithm used. The function is mathematically justified via its relationship to a well-defined hard clustering validity function, the separation index for which the condition of uniqueness has already been established. The performance of this validity function compares favorably to that of several others. The application of this validity function to color image segmentation in a computer color vision system for recognition of IC wafer defects which are otherwise impossible to detect using gray-scale image processing is discussed  相似文献   

19.
Fuzzy $c$-means (FCM) and its variants suffer from two problems---local minima and cluster validity---which have a direct impact on the formation of final clustering. There are two strategies---optimization and center initialization strategies---that address the problem of local minima. This paper proposes a center initialization approach based on a minimum spanning tree to keep FCM from local minima. With regard to cluster validity, various strategies have been proposed. On the basis of the fuzzy cluster validity index, this paper proposes a selection model that combines multiple pairs of a fuzzy clustering algorithm and cluster validity index to identify the number of clusters and simultaneously selects the optimal fuzzy clustering for a dataset. The promising performance of the proposed center-initialization method and selection model is demonstrated by experiments on real datasets.   相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号