当前位置: 首页 > 文章 > 基于改进的ISODATA算法的大样本数据聚类方法研究 内蒙古农业大学学报(自然科学版) 2013,34 (1) 133-137
Position: Home > Articles > RESEARCH OF LARGE SAMPLE DATA CLUSTERING METHOD BASED ON IMPROVED ISODATA ALGORITHM Journal of Inner Mongolia Agricultural University(Natural Science Edition) 2013,34 (1) 133-137

基于改进的ISODATA算法的大样本数据聚类方法研究

作  者:
张丽娜;姜新华;那日苏
单  位:
内蒙古农业大学计算机与信息工程学院;内蒙古师范大学物理与电子信息学院
关键词:
ISODATA;大样本;黄金分割法;数据聚类
摘  要:
针对数量大、数据结构复杂、离散度大的样本数据的聚类分析,采用ISODATA算法实现。ISODATA算法是1种基于统计模式识别的非监督学习动态聚类方法,是大样本数据聚类分析常用的方法,但该算法需要预先确定初始聚类参数。本文提出了基于黄金分割法来度量聚类的有效性,该方法能动态计算聚类度量参数,以此实现大样本数据的有效聚类。实验证明:该方法能够合理、有效的进行数据聚类。
译  名:
RESEARCH OF LARGE SAMPLE DATA CLUSTERING METHOD BASED ON IMPROVED ISODATA ALGORITHM
作  者:
ZHANG Li-na1,JIANG Xin-hua2,NA Ri-su1(1.College of Physics and Electronic Information Science,Inner Mongolia Normal University,Huhhot,010022,China; 2.College of Computer and Information Engineering,Inner Mongolia Agricultural University,Huhhot,010018,China)
关键词:
ISODATA;large sample data;golden section method;data clustering
摘  要:
How to extract effective feature data form the large sample,complex structures and dispersion data is the key and difficult of the pattern recognition,the ISODATA algorithm is one of the common algorithm of large samples data clustering.While,the inadequacies of the algorithm is need to pre-determine initial cluster parameters.The paper proposed to measure the effectiveness of clustering based on the golden section method,the method can dynamically calculate the clustering metrics,and achieve effective clustering of large sample data.The results show that the method can select the most representative and best characteristic features from the original large sample data.

相似文章

计量
文章访问数: 10
HTML全文浏览量: 0
PDF下载量: 1

所属期刊

推荐期刊