首页 | 本学科首页   官方微博 | 高级检索  
     检索      

IMPROVING THE INTERESTINGNESS OF WEB USAGE MINING
作者姓名:杨怡玲  管旭东  尤晋元
作者单位:Dept. of Computer Science & Eng.,Shanghai Jiaotong Univ.,Shanghai 200030,China
摘    要:IntroductionWeb usage mining is the application of datamining to the web server's log in order to discoverthe behavior patterns of the web site visitors.Thebehavior patterns found should be highly interest-ing.That is,they should be valid,novel,poten-tially useful,and ultimately understandable1] .Finding frequently visited page groups is animportant topic in web usage mining.Intuitively,the frequently visited page group ( FVPG) is a setof web pages thatare often requested together by anumbe…


IMPROVING THE INTERESTINGNESS OF WEB USAGE MINING
YANG Yi ling,GUAN Xu dong,YOU Jin yuan.IMPROVING THE INTERESTINGNESS OF WEB USAGE MINING[J].Journal of Shanghai Jiaotong university,2002,7(1):15-22.
Authors:YANG Yi ling  GUAN Xu dong  YOU Jin yuan
Institution:Dept. of Computer Science & Eng., Shanghai Jiaotong Univ., Shanghai 200030, China
Abstract:Improvement on mining the frequently visited groups of web pages was studied. First, in the data preprocessing phrase, we introduce an extra frame filtering step that reduces the negative influence of frame pages on the result page groups. Through recognizing the frame pages in the site documents and constructing the frame subframe relation set, the subframe pages that influence the final mining result can be efficiently filtered. Second, we enhance the mining algorithm with the consideration of both the site topology and the content of the web pages. By the introduction of the normalized content link ratio of the web page and the group interlink degree of the page group, the enhanced algorithm concentrates more on the content pages that are less interlinked together. The experiments show that the new approach can effectively reveal more interesting page groups, which would not be found without these enhancements.
Keywords:data mining  web mining  web usage mining  log analysis  interestingness enhancement
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号