首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Automating a framework to extract and analyse transport related social media content: The potential and the challenges
Institution:1. Faculty of Engineering, Information Systems Department, Mondragon Unibertsitatea, Loramendi, 4, Arrasate, 20500, Spain;2. IK4-IDEKO, Intelligent Software department, Arriaga Kalea, 2, 20870 Elgoibar, Spain;3. Transport Department, School of Civil Engineering, Universitat Politècnica de Valencia, Camino de Vera s/n, 46022, Valencia, Spain
Abstract:Harnessing the potential of new generation transport data and increasing public participation are high on the agenda for transport stakeholders and the broader community. The initial phase in the program of research reported here proposed a framework for mining transport-related information from social media, demonstrated and evaluated it using transport-related tweets associated with three football matches as case studies. The goal of this paper is to extend and complement the previous published studies. It reports an extended analysis of the research results, highlighting and elaborating the challenges that need to be addressed before a large-scale application of the framework can take place. The focus is specifically on the automatic harvesting of relevant, valuable information from Twitter. The results from automatically mining transport related messages in two scenarios are presented i.e. with a small-scale labelled dataset and with a large-scale dataset of 3.7 m tweets. Tweets authored by individuals that mention a need for transport, express an opinion about transport services or report an event, with respect to different transport modes, were mined. The challenges faced in automatically analysing Twitter messages, written in Twitter’s specific language, are illustrated. The results presented show a strong degree of success in the identification of transport related tweets, with similar success in identifying tweets that expressed an opinion about transport services. The identification of tweets that expressed a need for transport services or reported an event was more challenging, a finding mirrored during the human based message annotation process. Overall, the results demonstrate the potential of automatic extraction of valuable information from tweets while pointing to areas where challenges were encountered and additional research is needed. The impact of a successful solution to these challenges (thereby creating efficient harvesting systems) would be to enable travellers to participate more effectively in the improvement of transport services.
Keywords:Mining Twitter for transport information  Social media  Text mining  Opinion mining  Twitter
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号