首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种用于高速公路通行情况分析的收费数据挖掘方法
引用本文:赵怀鑫,邓然然,张英杰,丁明航,孙朝云,李伟.一种用于高速公路通行情况分析的收费数据挖掘方法[J].中国公路学报,2018,31(8):155-164.
作者姓名:赵怀鑫  邓然然  张英杰  丁明航  孙朝云  李伟
作者单位:1. 长安大学 信息工程学院, 陕西 西安 710064;2. 陕西省交通运输厅, 陕西 西安 710075
基金项目:陕西省自然科学基础研究计划重大基础研究项目(2017ZDJC-23);交通运输部科技项目(2011-364-812-050);陕西省自然科学基础研究计划青年项目(2017JQ5014);陕西省交通运输厅交通科研项目(15-45r)
摘    要:为更好地对高速公路通行情况进行分析,利用高速公路海量收费数据,提出了一种用于高速公路通行情况分析的数据挖掘方法。首先,在海量的贵州省高速公路收费数据中,筛选出指定进站名称及出站名称的数据并删除部分字段,仅保留与研究相关的内容,利用车辆进入收费站的时间和驶出收费站的时间计算出其在该路段上行驶的总时长,将行驶时长字段加入原数据。然后,采用孤立点检测算法清洗该数据,剔除其中异常值。完成上述预处理过程后,使用快速峰值聚类算法对行驶时长进行聚类分析,首先计算每条数据之间的距离,将距离矩阵作为该算法的输入并输出聚类结果;对比所采用的算法与K-Means算法对于行驶时长这一指标的聚类效果,可明显地看出该算法的聚类结果更接近于实际情况;然后将春节期间与2月第4周的收费数据进行聚类,通过对比可明显得出节假日期间各个车型通行比例的变化;将上述结果结合不同车型在不同时段的平均通行时间进行分析。研究结果表明:所提出的方法可有效地将在某段高速公路通行的车辆进行分类,并且分类结果与真实运行过程中车辆在高速公路上的通行情况一致,可为高速公路的运营管理以及维护方向提供合理的科学依据和数据支持。

关 键 词:交通工程  高速公路数据分析  数据挖掘  收费数据  快速峰值聚类  孤立点检测  
收稿时间:2017-08-11

Method of Mining Fee Data for Expressway Traffic Analysis
ZHAO Huai-xin,DENG Ran-ran,ZHANG Ying-jie,DING Ming-hang,SUN Zhao-yun,LI Wei.Method of Mining Fee Data for Expressway Traffic Analysis[J].China Journal of Highway and Transport,2018,31(8):155-164.
Authors:ZHAO Huai-xin  DENG Ran-ran  ZHANG Ying-jie  DING Ming-hang  SUN Zhao-yun  LI Wei
Institution:1. School of Information Engineering, Chang'an University, Xi'an 710064, Shaanxi, China;2. Department of Transport of Shaanxi Province, Xi'an 710075, Shaanxi, China
Abstract:To analyze expressway traffic situations more effectively, a data-mining method that can be used to analyze expressway traffic conditions by using the data of expressway mass data collection is proposed. First, the data of the specified entering station and leaving station were selected, and some fields were deleted from the vast number of Guizhou expressway fee data, so that only the data related to this study were retained. The time of driving into the entering station and driving out of the leaving station was used to calculate the vehicles' staying time between the two toll stations, and the transit time was added to the original data. Then, the outlier detection algorithm was used to clean the data and to reject outliers. Clustering by fast search and find of density peaks was used to perform cluster analysis on driving duration after completing the above pretreatment process. The distance between each piece of data was first calculated, and then the distance matrix was used as the input of the algorithm, and the clustering result was exported. The clustering effects of the algorithm used and of the K-means algorithm on the driving duration were compared. The clustering result of the proposed algorithm was closer to the actual situation. Then, the fee data for the fourth week in February and the Spring Festival were clustered separately. Comparing the results shows that the proportion of traffic in each model during the holiday season changed. The average transit time of different models at different time periods was analyzed by combining the above results. The results show that the method used can effectively classify vehicles passing through a certain expressway, and the classification result is consistent with the traffic condition of the vehicles on the expressway during real operation. This method can provide a scientific basis and data support for expressway operation management and maintenance directions.
Keywords:traffic engineering  expressway data analyzing  data mining  fee data  clustering by fast search and find of density peaks  outlier detection  
本文献已被 CNKI 等数据库收录!
点击此处可从《中国公路学报》浏览原始摘要信息
点击此处可从《中国公路学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号