首页 | 官方网站   微博 | 高级检索  
     

符号化近似SAX在时序数据挖掘中的应用研究
引用本文:刘懿,鲍德沛,杨泽红,赵雁南,贾培发,王家钦.符号化近似SAX在时序数据挖掘中的应用研究[J].计算机工程与应用,2006,42(27):191-193.
作者姓名:刘懿  鲍德沛  杨泽红  赵雁南  贾培发  王家钦
作者单位:清华大学计算机科学与技术系,北京,100084
摘    要:聚类是数据挖掘研究中最常见的一种方法,可以作为规则发现、异常发现等其它数据挖掘操作的基础,一直以来都是数据挖掘的研究热点之一。股票数据是一种典型的时间序列数据,利用股票数据进行时间序列数据挖掘的研究既有一定的实际应用价值,也是国内外的热点问题之一。文章首次将一种新型符号化方法SAX1]应用到标准普尔500指数的股票数据的聚类研究中,使用传统的欧氏距离和动态时间弯曲两种时间序列相似性度量方法进行实验。实验结果表明将SAX应用到股票数据聚类操作,可以得到更好的趋势聚类效果和更高的效率。

关 键 词:符号化近似  时间序列  聚类  数据挖掘
文章编号:1002-8331-(2006)27-0191-03
收稿时间:2006-02-01
修稿时间:2006-02-01

Application Research of a New Symbolic Approximation Method-SAX in Time Series Mining
LIU Yi,BAO De-pei,YANG Ze-hong,ZHAO Yan-nan,JIA Pei-fa,WANG Jia-qin.Application Research of a New Symbolic Approximation Method-SAX in Time Series Mining[J].Computer Engineering and Applications,2006,42(27):191-193.
Authors:LIU Yi  BAO De-pei  YANG Ze-hong  ZHAO Yan-nan  JIA Pei-fa  WANG Jia-qin
Affiliation:Computer Science and Technology Department,Tsinghua University,Beijing 100084
Abstract:Clustering is one of the most common data mining methods,being a hot topic in its own right as an exploratory tool,and also a subroutine in more complex algorithms such as rule discovery and abnormal discovery.As a typical time series data,stock data has been widely used in data mining research.In this paper,a new symbolic method-SAX1] is used in stock data clustering analysis.We apply the new method on stock data which is obtained from the Standard & Poor 500 index.Moreover,we use two similarity measure method including Euclidean and Dynamic Time Warping in our experiments.The experiments result shows that clustering can better focus on the whole trend and the efficiency can be improved with the help of SAX.
Keywords:Symbolic Approximation(SAX)  time series  clustering  data mining
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号