首页 | 官方网站   微博 | 高级检索  
     


Normalized Lempel-Ziv complexity and its application in bio-sequence analysis
Authors:Yi Zhang  Junkang Hao  Changjie Zhou  Kai Chang
Affiliation:1.Department of Mathematics,Hebei University of Science and Technology,Shijiazhuang,People’s Republic of China;2.Physical Education Department,Hebei University of Science and Technology,Shijiazhuang,People’s Republic of China;3.Department of Automatic Control, School of Information Science and Technology,Beijing Institute of Technology,Beijing,People’s Republic of China
Abstract:In this article, we propose a new method to measure DNA similarity based on a normalized Lempel-Ziv complexity scheme. The new method can weaken the effect of sequence length on complexity measurement and save computation time. Firstly, a DNA sequence is transformed into three (0,1)-sequences based on a scheme, which considers “A” and “non-A” , “G” and “non-G”, “C” and “non-C” bases respectively. Then, the normalized Lempel-Ziv complexity of the three (0,1)-sequences constitute a 3D vector. Finally, by the 3D vector, one may characterize DNA sequences and compute similarity matrix for them. The examination of similarities of two sets of DNA sequences illustrates the utility of the method in local and global similarity analysis.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号