首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于属性关联及匹配差异度的数据流异常检测
引用本文:琚春华,李耀林.基于属性关联及匹配差异度的数据流异常检测[J].西南交通大学学报,2013,26(1):107-115.
作者姓名:琚春华  李耀林
作者单位:浙江工商大学计算机与信息工程学院;浙江工商大学现代商贸研究中心
基金项目:国家自然科学基金资助项目(71071141);浙江省自然科学基金重点项目(Z1091224);教育部博士点基金资助项目(20103326110001)
摘    要:为解决类别属性数据流异常点检测问题,针对事务数据流环境,提出了基于属性关联及匹配差异度的数据流异常检测模型AAMDD(attribute associations and match difference degree).AAMDD模型离线构建一个关联规则库,并对其进行增量式更新.同时,利用时间敏感型滑动窗口(time-sensitive sliding windows,TimeSW)维护数据流数据,每经过一个时间跨度,就将当前窗口中每条数据包含的项集与关联规则库进行匹配,计算匹配差异度,根据匹配差异度的不同在线检测异常点.此外,给出了与AAMDD模型相对应的算法AAMDD-algorithm.实验结果表明,AAMDD-algorithm比FODFP-Stream算法的效率和检测精确度分别平均提高了37.43%和5.51%,并且AAMDD-algorithm的查全率保持在77%以上,可用于事务型数据流异常检测. 

关 键 词:数据流    关联规则    差异度    增量式异常检测    概念漂移
收稿时间:2011-11-20

Outlier Detection Model for Data Streams Based on Attribute Associations and Match Difference Degree
JU Chunhua,LI Yaolin.Outlier Detection Model for Data Streams Based on Attribute Associations and Match Difference Degree[J].Journal of Southwest Jiaotong University,2013,26(1):107-115.
Authors:JU Chunhua  LI Yaolin
Institution:1(1.College of Computer Science and Information Engineering,Zhejiang Gongshang University,Hangzhou 310018,China;2.Center for Studies of Modern Business,Zhejiang Gongshang University,Hangzhou 310018,China)
Abstract:In order to solve the problem of outlier detection for categorical data streams, an outlier detection model for data streams based on attribute associations and match difference degree was proposed, called as AAMDD. This model builds an association rule library off-line and updates it with the incremental method. Meanwhile, it maintains the data streams by using time-sensitive sliding windows (TimeSW). In a time step, the AAMDD matches data in current window with association rules in the association rule library and calculates the match difference degree (MDD). Then, outliers can be identified on-line through different MDDs. An algorithm for the AAMDD was given, called as AAMDD-algorithm. The experiment results show that compared with the FODFP-Stream algorithm, the AAMDD-algorithm has on average 5.51%and 37.43%improvements respectively in detection precision and efficiency, and its recall is above 77%. It can be used to detect outliers in transaction data streams. 
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《西南交通大学学报》浏览原始摘要信息
点击此处可从《西南交通大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号