首页 | 本学科首页   官方微博 | 高级检索  
     

基于最大频繁序列的蛋白质分类算法
引用本文:卫锦花,吴陈. 基于最大频繁序列的蛋白质分类算法[J]. 江苏科技大学学报(社会科学版), 2007, 21(Z1): 79-83
作者姓名:卫锦花  吴陈
作者单位:江苏科技大学电子信息学院,江苏科技大学电子信息学院 江苏 镇江 212003,江苏 镇江 212003
摘    要:针对现有基于频繁模式的分类算法未考虑完全频繁模式所产生的大量无效序列,提出了一种基于最大频繁序列的蛋白质分类算法,此算法每一类都以独有的最大频繁模式作为代表,执行模式裁减和测试数据分类。实验表明该算法在继承传统算法优点的同时提高了结果的精确度,降低了模式的冗余度,此应用增加了分类的生物信息学意义。

关 键 词:蛋白质序列  分类  最大频繁序列
文章编号:1673-4807(2007)-0079-05

Protein Sequence Classification Algorithm Based on Maximal Frequent Sequence
WEI Jinhua,WU Chen. Protein Sequence Classification Algorithm Based on Maximal Frequent Sequence[J]. Journal of Jiangsu University of Science and Technology(Natural Science Edition), 2007, 21(Z1): 79-83
Authors:WEI Jinhua  WU Chen
Abstract:Aimed at the massive invalid sequences caused by the complete frequent patterns,which is not con- sidered in the existing classification algorithm,a protein sequence classification algorithm is proposed based on the maximal frequent sequence.In this algorithm each class can be presented by the particular maximal frequent pattern,then the pattern can be reduced and the test data can be classified.Experiments show that this algorithm can improve the precision of results and reduce the redundancy of the pattern with remaining the advantages of the traditional algorithm,the bioinformatics meaning can then be increased through such an application.
Keywords:protein sequence  classification  maximal frequent sequence
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号