首页 | 本学科首页   官方微博 | 高级检索  
     

A Fast Algorithm for Support Vector Clustering
引用本文:吕常魁,姜澄宇,王宁生. A Fast Algorithm for Support Vector Clustering[J]. 西南交通大学学报(英文版), 2004, 12(2): 136-140
作者姓名:吕常魁  姜澄宇  王宁生
作者单位:CIMSResearchCenter,NanjingUniversityofAeronautics&Astronautics,Nanjing210016,China
基金项目:TheNationalHighTechnologyResearchandDevelopmentProgramofChina (No .86 3 5 11 930 0 0 9)
摘    要:Support Vector Clustering (SVC) is a kernel-based unsupervised learning clustering method. The main drawback of SVC is its high computational complexity in getting the adjacency matrix describing the connectivity for each pairs of points. Based on the proximity graph model, the Euclidean distance in Hilbert space is calculated using a Gaussian kernel, which is the right criterion to generate a minimum spanning tree using Kruskal‘s algorithm. Then the connectivity estimation is lowered by only checking the linkages between the edges that construct the main stem of the MST ( Minimum Spanning Tree), in which the non-compatibility degree is originally defined to support the edge selection during linkage estimations. This new approach is experimentally analyzed.The results show that the revised algorithm has a better performance than the proximity graph model with faster speed, optimized clustering quality and strong ability to noise suppression, which makes SVC scalable to large data sets.

关 键 词:SVC 支持向量聚类 支持向量机 邻近曲线 最小生成树

A Fast Algorithm for Support Vector Clustering
Abstract:Support Vector Clustering (SVC) is a kernel-based unsupervised learning clustering method. The main drawback of SVC is its high computational complexity in getting the adjacency matrix describing the connectivity for each pairs of points. Based on the proximity graph model [3], the Euclidean distance in Hilbert space is calculated using a Gaussian kernel, which is the right criterion to generate a minimum spanning tree using Kruskal's algorithm. Then the connectivity estimation is lowered by only checking the linkages between the edges that construct the main stem of the MST (Minimum Spanning Tree), in which the non-compatibility degree is originally defined to support the edge selection during linkage estimations. This new approach is experimentally analyzed. The results show that the revised algorithm has a better performance than the proximity graph model with faster speed, optimized clustering quality and strong ability to noise suppression, which makes SVC scalable to large data sets.
Keywords:Support vector machines  Support vector clustering  Proximity graph  Minimum spanning tree
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号