详细信息
        基于Tukey规则与初始中心点优化的K?means聚类改进算法     被引量:5
Improved K?means Clustering Algorithm Based on Tukey Rule and Initial Center Point Optimization
文献类型:期刊文献
中文题名:基于Tukey规则与初始中心点优化的K?means聚类改进算法
英文题名:Improved K?means Clustering Algorithm Based on Tukey Rule and Initial Center Point Optimization
作者:柳菁[1];邱紫滢[1];郭茂祖[2];余冬华[1]
机构:[1]绍兴文理学院计算机科学与工程系,绍兴312000;[2]北京建筑大学电子信息工程学院,北京100044
年份:2023
卷号:38
期号:3
起止页码:643
中文期刊名:数据采集与处理
外文期刊名:Journal of Data Acquisition and Processing
收录:CSTPCD、、Scopus、CSCD_E2023_2024、北大核心、CSCD、北大核心2020
基金:国家自然科学基金(62002227);绍兴文理学院校级科研项目(2021LG004)。
语种:中文
中文关键词:数据挖掘;K?means聚类算法;Tukey规则;中心点优化
外文关键词:data mining;K?means clustering algorithm;Tukey rule;center point optimization
中文摘要:针对K?means聚类算法存在的初始中心点选择及异常点、离群点极易影响聚类结果等待改进问题,提出了一个基于Tukey规则与优化初始中心点选择的K?means改进算法。该算法利用Tukey规则构造核心与非核心子集,将聚类过程划分成2个阶段。同时,在核心子集上执行中心点逐个递增优化选择策略,选出初始中心点。在来自UCI的20个数据集上聚类结果表明,本文提出的算法优于K?means++聚类算法,有效地提升了聚类性能。
外文摘要:Aiming at shortcomings of the K-means algorithm to be improved,such as selection of initial center points and the problems that abnormal points and outliers can easily affect the clustering results,this paper proposes an improved K-means algorithm based on Tukey rules and optimizing initial center points selection.The proposed algorithm uses Tukey rules to construct core and non-core subsets,and divides the clustering process into two stages.At the same time,the strategy of increasing the center points one by one is implemented on the core subset to optimize the initial center points.The clustering results on 20 realworld datasets from UCI show that the proposed algorithm is better than the most popular K?means++clustering algorithm and effectively improves the clustering performance.
参考文献:
                                
                                正在载入数据...
                    
 
            