首页 | 本学科首页   官方微博 | 高级检索  
     检索      

A New Algorithm for Mining Frequent Pattern
作者姓名:李力  靳蕃
作者单位:SchoolofComputerandCommunicationEngineering,SouthwestjiaotongUniversity,Chengdu
摘    要:Mining frequent pattern in transaction database,time-series databases,and many other kinds of databases have been studied popularly in data mining research.Most of the previous studies adopt Apriori-like candidate set generation-and-test approach.However,candidate set generation is very costly.Han J.proposed a novel algorithm FP-growth that could generate frequent pattern without candidate set.Based on the analysis of the algorithm FP-growth,this paper proposes a concept of equivalent FP-tree and proposes an improved algorithm,denoted as FP-growth,which is much faster in speed,and easy to realize,FP-growth adopts a modifeid structure of FP-tree and header table,and only generates a header table in each recursive operation and projects the tree to the original FP-tree,The two algorithms get the same frequent pattern set in the same transaction database,but the performance study on computer shows that the speed of the improved algorithm,FP-growth,is at least two times as fast as that of FP-growth.

关 键 词:算法  重复模式案  数据库  FP树  FP路径

A New Algorithm for Mining Frequent Pattern
Abstract:Mining frequent pattern in transaction database, time-series databases, and many other kinds of databases have been studied popularly in data mining research. Most of the previous studies adopt Apriori-like candidate set generation-and-test approach. However, candidate set generation is very costly. Han J. proposed a novel algorithm FP-growth that could generate frequent pattern without candidate set. Based on the analysis of the algorithm FP-growth, this paper proposes a concept of equivalent FP-tree and proposes an improved algorithm, denoted as FP-growth*, which is much faster in speed, and easy to realize. FP-growth* adopts a modified structure of FP-tree and header table, and only generates a header table in each recursive operation and projects the tree to the original FP-tree. The two algorithms get the same frequent pattern set in the same transaction database, but the performance study on computer shows that the speed of the improved algorithm, FP-growth*, is at least two times as fast as that of FP-growth.
Keywords:data mining  algorithm  frequent pattern set  FP-growth
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号