基于搜索引擎的有害信息监控系统的设计与实现 Research and implementation of Bad Information Detection System based on search engine期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于搜索引擎的有害信息监控系统的设计与实现

引用本文：	张晓梅,苏斌,王竹林,杨柳.基于搜索引擎的有害信息监控系统的设计与实现[J].铁路计算机应用,2007,16(12):38-41.

作者姓名：	张晓梅苏斌王竹林杨柳

作者单位：	西南交通大学,信息网络中心,成都,610031

摘要：	在对搜索引擎核心技术进行研究的基础上,设计并实现一种采用主动扫描探测方法进行有害信息监控的系统.基于bot包设计网络蜘蛛模块,实现对html、asp、php和jsp等网页的自动抓取;采用反向最大匹配和二级哈希散列算法,实现中文分词;开发信息索引模块,实现对网页的批量和增量索引;开发有害信息检索模块,实现有害信息监控及预警功能.最后通过集成各模块,实现有害信息监控系统.
关键词：	搜索引擎有害信息监控网络蜘蛛中文分词信息索引信息检索
文章编号：	1005-8451（2007）11-0038-04
收稿时间：	2007-03-27
修稿时间：	2007年3月27日
Research and implementation of Bad Information Detection System based on search engine

ZHANG Xiao-mei,SU Bin,WANG Zhu-lin,YANG Liu.Research and implementation of Bad Information Detection System based on search engine[J].Railway Computer Application,2007,16(12):38-41.

Authors:	ZHANG Xiao-mei SU Bin WANG Zhu-lin YANG Liu

Abstract:	Based on the research of the kernel technology of search engine, Bad Information Detection System was proposed and implemented through initiative scanning .Html, asp, php, jsp and other style Web pages could be found automatically from lnternet by using the spider module based on bot package. Chinese word segmentation was implemented with reverse- going maximum matching method and two level hash arithmetic.Information index module for batch and incremental indexing and information searching module for bad information detection and alarming were implemented. At last, Bad Information System was established through module integration.

Keywords:	search engine bad information detection spider word segmentation information index information search
本文献已被维普万方数据等数据库收录！
	点击此处可从《铁路计算机应用》浏览原始摘要信息
	点击此处可从《铁路计算机应用》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏