首页 | 本学科首页   官方微博 | 高级检索  
     检索      

复杂网络的重叠社区发现并行算法
引用本文:滕飞,戴荣杰,任晓春.复杂网络的重叠社区发现并行算法[J].西南交通大学学报,2019,54(1):211-218.
作者姓名:滕飞  戴荣杰  任晓春
基金项目:国家自然科学基金资助项目(61573292,61572407);四川省软科学研究计划资助项目(2016ZR0034)
摘    要:随着网络规模的快速增长,传统社区发现算法难以处理大规模网络数据和满足复杂网络的可扩展分析需求. 本文提出一种适用于大规模复杂网络的重叠社区发现算法PHLink. 该算法根据复杂网络的无标度特性将节点建立连边的原因进行分析和归类,用以识别网络中具有重叠性的社区结构,并采用MapReduce计算框架对网络进行分割和冗余存储,减弱了图计算的耦合性,解决了社区发现算法的分布式计算问题. 通过真实网络测试,PHLink算法可以大幅度降低边计算的复杂度,对于无标度特性明显的复杂网络提取0.1%的枢纽节点即可节省94%以上的计算量,较传统算法具有较高的稳定性和准确性,并且在Hadoop平台有良好的加速性和伸缩性,可以处理千万级连边规模的大规模复杂网络. 

关 键 词:复杂网络    重叠社区    社区发现    无标度    并行算法    Hadoop
收稿时间:2016-04-18

Parallel Algorithm for Overlapping Community Detection in Complex Networks
TENG Fei,DAI Rongjie,REN Xiaochun.Parallel Algorithm for Overlapping Community Detection in Complex Networks[J].Journal of Southwest Jiaotong University,2019,54(1):211-218.
Authors:TENG Fei  DAI Rongjie  REN Xiaochun
Abstract:With the increasing size of complex networks, traditional detection methods are difficult to scale to larger networks. This paper proposes a parallel hierarchical link(PHLink) algorithm to discover overlapping communities. By studying the power-law degree distribution of complex networks, we distinguish the motivations of why two nodes intend to establish a connection so that the complexity for community detection can be reduced. PHLink is implemented using distributed computing for large-scale networks via graph segmentation and redundant storage. Experiments validate that PHLink can accurately discover network communities and the overlaps between them. For scale-free networks, removing 0.1% of the hub nodes reduces the computation time by 94%. Meanwhile, PHLink demonstrates good speed-up and scale-up on Hadoop, which can scale to very large complex networks with millions of edges. 
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《西南交通大学学报》浏览原始摘要信息
点击此处可从《西南交通大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号