首页 | 本学科首页   官方微博 | 高级检索  
     检索      

搜索引擎中基于Bayes分类的网页更新研究
引用本文:赵新慧.搜索引擎中基于Bayes分类的网页更新研究[J].交通与计算机,2005,23(5):63-65.
作者姓名:赵新慧
作者单位:辽宁石油化工大学,抚顺,113001
摘    要:在网络无限扩张的同时,网页也在频繁地变化,搜索引擎往往要定期更新它所检索的网页,需耗费大量时间和系统资源,因此提高更新效率是搜索引擎技术的关键.文章比较了目前存在的两种更新方法:统一更新方法和个体更新方法,指出两种方法优劣所在,提出一种改进的基于Bayes分类的网页更新方法.

关 键 词:搜索引擎  更新度  更新策略
收稿时间:04 21 2005 12:00AM
修稿时间:2005年4月21日

Classified Refresh Policy of Web Pages in Search Engine Based on Bayes Theory
ZHAO Xinhui.Classified Refresh Policy of Web Pages in Search Engine Based on Bayes Theory[J].Computer and Communications,2005,23(5):63-65.
Authors:ZHAO Xinhui
Abstract:The Web is huge and the Web pages are updated frequently. The index maintained by a search engine has to refresh Web pages periodically. This is extremely time and resource consuming because the search engine needs to crawl the Web and download Web pages to refresh its index. Therefore, improving the refresh efficiency is the key technology of the search engine. This paper compares uniform refresh policy and proportional refresh policy, and points out their advantages and disadvantages. Finally, this paper presents a reformed method called classified refresh policy based on Bayes Theory.
Keywords:search engine  freshness  refresh policy
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号