首页 | 官方网站   微博 | 高级检索  
     


Efficient mining of sequential patterns with time constraints: Reducing the combinations
Authors:F Masseglia  P Poncelet  M Teisseire
Affiliation:1. INRIA, 2004 Route des Lucioles – BP 93, 06902 Sophia Antipolis, France;2. EMA-LGI2P/Site EERIE, Parc Scientifique Georges, Besse, 30035 Nîmes Cedex 1, France;3. LIRMM UMR CNRS 5506, 161 Rue Ada, 34392 Montpellier Cedex 5, France;1. Unidad de Arritmias, Hospital Universitario Virgen del Rocío, Sevilla, España;2. Unidad de Arritmias, Hospital Universitario 12 de Octubre, Madrid, España;3. Unidad de Arritmias, Hospital Universitario de Burgos, Burgos, España;1. Unidad de Arritmias, Hospital Universitario 12 de Octubre, Madrid, España;2. Unidad de Arritmias, Hospital Universitario de Burgos, Burgos, España;3. Unidad de Arritmias, Hospital Universitario de Alicante, Alicante, España;1. CNRS, IRIT, 2 rue C. Camichel, 31071 Toulouse, France;2. CNRS, LAAS, 7 avenue du colonel Roche, 31400 Toulouse, France;3. IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Spain;4. UPV/EHU, University of the Basque Country, 20018 Donostia, Spain;5. Univ. de Toulouse, INP, INSA, LAAS, 31400 Toulouse, France;1. Service de Pharmacologie-Toxicologie, Hôpital Dupuytren, 2 Avenue Martin Luther King, 87042 Limoges, France;2. Groupement de Recherche Eau, Sol, Environnement, Université de Limoges, Faculté des Sciences, 123 Avenue Albert Thomas, 87060 Limoges, France;3. Institut de Mathématiques de Toulouse (UMR CNRS 5219), 31062 Toulouse, France;4. Service de Gérontologie clinique, Hôpital Dupuytren, 2 Avenue Martin Luther King, 87042 Limoges, France
Abstract:In this paper we consider the problem of discovering sequential patterns by handling time constraints as defined in the Gsp algorithm. While sequential patterns could be seen as temporal relationships between facts embedded in the database where considered facts are merely characteristics of individuals or observations of individual behavior, generalized sequential patterns aim to provide the end user with a more flexible handling of the transactions embedded in the database. We thus propose a new efficient algorithm, called Gtc (Graph for Time Constraints) for mining such patterns in very large databases. It is based on the idea that handling time constraints in the earlier stage of the data mining process can be highly beneficial. One of the most significant new feature of our approach is that handling of time constraints can be easily taken into account in traditional levelwise approaches since it is carried out prior to and separately from the counting step of a data sequence. Our test shows that the proposed algorithm performs significantly faster than a state-of-the-art sequence mining algorithm.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号