首页 | 本学科首页   官方微博 | 高级检索  
     

基于词典的法律案例自动归类系统的开发
引用本文:官礼和,杨刚,李永礼. 基于词典的法律案例自动归类系统的开发[J]. 重庆交通大学学报(自然科学版), 2004, 23(1): 116-120
作者姓名:官礼和  杨刚  李永礼
作者单位:重庆交通学院,计算机与信息学院,重庆,400074;兰州大学,信息科学与工程学院,甘肃,兰州,730000
摘    要:笔者详细讨论并成功开发了"法律案例分析系统"的一个子系统—"法律案例自动归类系统".系统首先通过大量的法律案例训练文档得到树结构中每个类(叶子类和中间类)的类特征词权值表,然后在此基础上计算新法律案例文档相对于各个类的累加权值,最后累加权值最大并且是叶子类的类即是该法律案例应归入的类.笔者还给出并分析了用到的两个重要公式(特征词权值公式和类累加权值公式),详细介绍了系统的核心—基于词典的分词算法.实验表明本系统具有很好的通用性和扩展性,归类准确率较理想.

关 键 词:累加权值  特征词  类特征词权值表  词频  特征词典
文章编号:1001-716X(2004)01-0116-05
修稿时间:2003-02-20

Developing of the law-case automatic categorizing system based on law lexicons
Abstract:The paper discussed and developed a Law-case automatic categorizing system, which makeis a subsystem of the "Law-case Analyzing System".Firstly,the characteristic-word weight tables of each category are gained out of a mass of law-case training documents which have been categorized already.Secondly,the weight summation is conducted for each category based on the weight tables of the characteristic Words.Finally,the related law case falls under the category which is at the leafage of the Category tree and gets the biggest weiht sum.The paper also presents and andyzes two important formaula. the characteristic word weight formula and the weight-summing formula,puts forward a new word-parting algorithm based on law lexicons,which is the core module of the system. The experiment shows the excellent generality,expansibility and satisfactory categorizing nicety of the system.
Keywords:sum weight  characteristic words   characteristic-word weight tables   word frequency  characteristic lexicons
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号