首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.
基于梯形云的数量型关联规则挖掘方法   总被引:1,自引:0,他引:1  
本文所给出的数量型关联规则挖掘方法是将数据库中之数量型数据先基于梯形云进行概念划分.然后利用得到的概念对数量型数据进行布尔型处理.处理后的数据就可以使用成熟的布尔型关联规则挖掘算法进行挖掘。  相似文献   

2.
3.
Data warehousing and Data mining find enormous applications; RFID technology is one among them. A RFID data warehousing system with novel data cleaning, transformation and loading technique has been proposed in the previous work. The system has been dedicatedly implemented in one of the significant RFID applications tracking of goods in warehouses. The warehoused RFID data is in specific format and so an effective mining system is required to mine the needed information from the database. The existing mining algorithms are inefficient in extracting the information from the warehoused RFID data. In this paper, a novel data mining system is proposed, which effectively extracts the information regarding the nature of movement of the RFID tags. The proposed mining system generates an intermediate dataset (I-dataset) from the warehoused dataset. From the I-dataset, sequential patterns are mined with different pattern length combinations. From the mined sequential patterns, fuzzy rules are generated, which depicts the nature of movement of the RFID tags. The implementation results show that the proposed mining system performs well by extracting the significant RFID tags and its combinations and the nature of movement of the tags.  相似文献   

4.
在对目标活动数据进行处理的过程中,由于表面上只能看到目标活动的航迹情况,而不能更有效地发现不同活动之间的关联关系。对数据挖掘技术的概念进行了介绍,说明了数据挖掘的一般过程及在情报处理中的应用意义,提出了一种通过关联规则算法对已有目标活动数据进行数据挖掘的方法,并详细描述了数据挖掘技术在情报处理中应用的体系结构和实现方式。  相似文献   

5.
Most methods for structure-function analysis of the brain in medical images are usually based on voxel-wise statistical tests performed on registered magnetic resonance (MR) images across subjects. A major drawback of such methods is the inability to accurately locate regions that manifest nonlinear associations with clinical variables. In this paper, we propose Bayesian morphological analysis methods, based on a Bayesian-network representation, for the analysis of MR brain images. First, we describe how Bayesian networks (BNs) can represent probabilistic associations among voxels and clinical (function) variables. Second, we present a model-selection framework, which generates a BN that captures structure-function relationships from MR brain images and function variables. We demonstrate our methods in the context of determining associations between regional brain atrophy (as demonstrated on MR images of the brain), and functional deficits. We employ two data sets for this evaluation: the first contains MR images of 11 subjects, where associations between regional atrophy and a functional deficit are almost linear; the second data set contains MR images of the ventricles of 84 subjects, where the structure-function association is nonlinear. Our methods successfully identify voxel-wise morphological changes that are associated with functional deficits in both data sets, whereas standard statistical analysis (i.e., t-test and paired t-test) fails in the nonlinear-association case.  相似文献   

6.
关联规则挖掘是数据挖掘和知识发现中一门重要技术,但基于支持度-置信度框架的关联规则挖掘存在一些问题。文章引入了双向关联规则的概念,实现了基于遗传算法的双向关联规则挖掘算法。实验证明,它能弥补传统关联规则挖掘算法的不足。  相似文献   

7.
浅谈关联规则挖掘   总被引:3,自引:0,他引:3  
姚俊 《信息技术》2005,29(6):85-87
首先扼要地介绍了当前关联规则挖掘的研究情况和一些典型关联规则挖掘算法,并分析了传统关联规则挖掘算法的不足。与此同时,介绍了几种优化算法,进而指出关联规则的价值衡量方法。最后,展望了关联规则挖掘技术的未来研究方向。  相似文献   

8.
Understanding network traffic behaviour is crucial for managing and securing computer networks. One important technique is to mine frequent patterns or association rules from analysed traffic data. On the one hand, association rule mining usually generates a huge number of patterns and rules, many of them meaningless or user‐unwanted; on the other hand, association rule mining can miss some necessary knowledge if it does not consider the hierarchy relationships in the network traffic data. Aiming to address such issues, this paper proposes a hybrid association rule mining method for characterizing network traffic behaviour. Rather than frequent patterns, the proposed method generates non‐similar closed frequent patterns from network traffic data, which can significantly reduce the number of patterns. This method also proposes to derive new attributes from the original data to discover novel knowledge according to hierarchy relationships in network traffic data and user interests. Experiments performed on real network traffic data show that the proposed method is promising and can be used in real applications. Copyright © 2013 John Wiley & Sons, Ltd.  相似文献   

9.
关联规则是捕述Web用户行为特征的重要依据。传统的基于统计学的Web关联规则挖掘算法注重对现有数据的分析,不能针对Web关联挖掘的特点提供有效的预测手段和优化反馈措施。本文提出一种基于遗传算法的Web关联挖掘算法体系,实验证明它能。弥补传统Web关联挖掘算法的不足,为Web关联挖掘提供了一种新的思路。  相似文献   

10.
Association rules represent a promising technique to improve heart disease prediction. Unfortunately, when association rules are applied on a medical data set, they produce an extremely large number of rules. Most of such rules are medically irrelevant and the time required to find them can be impractical. A more important issue is that, in general, association rules are mined on the entire data set without validation on an independent sample. To solve these limitations, we introduce an algorithm that uses search constraints to reduce the number of rules, searches for association rules on a training set, and finally validates them on an independent test set. The medical significance of discovered rules is evaluated with support, confidence, and lift. Association rules are applied on a real data set containing medical records of patients with heart disease. In medical terms, association rules relate heart perfusion measurements and risk factors to the degree of disease in four specific arteries. Search constraints and test set validation significantly reduce the number of association rules and produce a set of rules with high predictive accuracy. We exhibit important rules with high confidence, high lift, or both, that remain valid on the test set on several runs. These rules represent valuable medical knowledge.  相似文献   

11.
Liao  Jiyong  Wu  Sheng  Liu  Ailian 《Wireless Personal Communications》2021,116(3):1639-1657

High utility itemsets mining has become a hot research topic in association rules mining. But many algorithms directly mine datasets, and there is a problem on dense datasets, that is, too many itemsets stored in each transaction. In the process of mining association rules, it takes a lot of storage space and affects the running efficiency of the algorithm. In the existing algorithms, there is a lack of efficient itemset mining algorithms for dense datasets. Aiming at this problem, a high utility itemsets mining algorithm based on divide-and-conquer strategy is proposed. Using the improved silhouette coefficient to select the best K-means cluster number, the datasets are divided into many smaller subclasses. Then, the association rules mining is performed by Boolean matrix compression operation on each subclass, and iteratively merge them to get the final mining results. We also analyze the time complexity of our method and Apriori algorithm. Finally, experimental results on several well-known real world datasets are conducted to show that the improved algorithm performs faster and consumes less memory on dense datasets, which can effectively improve the computational efficiency of the algorithm.

  相似文献   

12.
遗传算法在Web关联挖掘中的应用研究   总被引:3,自引:1,他引:2  
关联规则是描述Web用户行为特征的重要依据。传统的基于统计学的Web关联规则挖掘算法注重对现有数据的分析,不能针对Web关联挖掘的特点提供有效的预测手段和优化反馈措施。文章提出一种基于遗传算法的Web关联挖掘算法体系,实验证明它能弥补传统Web关联挖掘算法的不足,为Web关联挖掘提供了一种新的思路。  相似文献   

13.
基于关联规则的Web挖掘技术研究   总被引:1,自引:1,他引:0  
关联规则是Web挖掘中一个重要的研究领域。为了挖掘出隐藏在数据间的相互关系,将关联规则的概念引入到Web挖掘系统中,把用户的访问路径以关联规则的形式表现出来。基于Apriori算法的思想,给出了适合Web挖掘用户访问的新Apriori算法规则及其模式,最后将结果在一些较简单的网页上进行了验证,取得了较好的应用效果。  相似文献   

14.
利用加权关联规则方法提取护理人员职场偏差行为问卷研究中的规则,样本包括539名护理人员.鉴于偏差行为影响因素重要性的不一致,基于传统的关联规则Apriori算法,设计并实现了加权关联规则的挖掘算法,应用于护理人员偏差行为的研究.实验表明,与Apriori算法相比,加权关联规则方法提取出更多的规则,并且可以增强某些规则的支持度和置信度,使这些规则更容易被发现,说明该方法是可行的.  相似文献   

15.
具有语义最小支持度的关联规则挖掘方法   总被引:1,自引:0,他引:1  
现有的关联规则挖掘方法中,大多采用单一的最小支持度.实际上,应该根据数据的特点设置不同的最小支持度.文中针对这一问题,将语义信息引入关联规则挖掘之中,提出了具有语义最小支持度的关联规则挖掘方法.该方法首先计算项目之间的语义相关度,然后根据候选集的语义相关度对候选集合进行过滤,最后根据候选集的语义相关度,确定其语义最小支持度.实验表明:具有语义最小支持度的关联规则挖掘方法比传统的关联规则挖掘方法能够更好地实现关联规则的挖掘.  相似文献   

16.
多维关联规则是数据挖掘中的一个重要研究方向,由此提出了一种高效的多维关联规则挖掘算法,该方法通过引入MDPI-tree(多维谓词索引树)结构,有效地将数据立方体技术和频繁项集挖掘算法FP-Growth结合起来,能用于挖掘维间和混合维关联规则.最后将此算法应用于移动通信交叉销售模型,通过实验验证算法的有效性和实用性.  相似文献   

17.
数据挖掘就是从大量的数据中挖掘出有用的信息。数据挖掘与传统的数据分析的本质区别是数据挖掘是在没有明确假设的前提下去挖掘信息、发现知识。文章分析了数据挖掘算法的关联规则特性,对其在股票市场中的应用进行了重点,以便更好的应用在更多的领域。  相似文献   

18.
关联规则挖掘的软集包含度方法   总被引:2,自引:0,他引:2  
耿生玲  李永明  刘震 《电子学报》2013,41(4):804-809
本文在深入研究软集数据分析的基础上,将包含度引入软集数据关联规则挖掘中,利用包含度理论描述属性集之间的量化关系,给出软集上属性集间的包含度、关联规则和最大关联规则的概念,讨论包含度和可信度之间的联系.在此基础上给出利用包含度在事务数据软集中挖掘满足给定的支持度和可信度阈值的软关联规则方法,以及最大软关联规则的提取算法.理论证明和实例分析表明该关联规则挖掘方法是有效的,并通过实验对算法的性能进行了比较.  相似文献   

19.
数据挖掘是关联规则中一个重要的研究方向.对关联规则的数据挖掘和遗传算法进行概述,阐述关联规则数据挖掘的意义,提出一种采用改进型遗传算法的关联规则的提取算法,并从编码方法、适应度函数的构造和变异、选择、交叉算子设计方面进行讨论和分析,最后结合一个具体实例进行应用.实验证明这种算法是有效的.  相似文献   

20.
针对传统关联规则可视化挖掘方法不利于处理多值属性数据、缺乏展现数据间的频繁模式和关联模式以及效率低下等问题,提出了基于KAF因子和CHF因子的Apriori改进算法进行多值属性关联规则挖掘,实现了一种新的基于概念格的多值属性关联规则可视化方法.运用概念格理论对多值属性数据进行了重新定义和分类,建立了较为完整的挖掘过程参数调整策略,方便用户选择关键属性值进行规则挖掘分析,提高了算法运行速度和挖掘效率.以概念格结构将多值数据组织起来,实现了对频繁项集的可视化展示,以及关联规则的多模式可视化展示.实验结果表明,改进后的挖掘算法具有更好的性能,所提出的可视化形式和已有成果相比具有良好的展现效果.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号