k means produces empty clusters

blue-sky

I've implemented a k means clustering algorithm as described at http://faculty.uscupstate.edu/atzacheva/SHIM450/KMeansExample.doc

For some datasets I find that the number of clusters generated do not always equal initial k. Is this to be expected?

I think it is to be expected as after each iteration each data point is added to the closest cluster, but this does not guarantee that all clusters will be populated. Each cluster is reinitialised after each iteration so some clusters may not be sufficiently close to points to have any point added to them.

blue-sky

This paper describes that empty clusters for k means can occur http://www.academypublisher.com/ijrte/vol01/no01/ijrte0101220226.pdf

One of the major problems of the k-means algorithm is that it may produce empty clusters depending on initial center vectors. For static execution of the k-means, this problem is considered insignificant and can be solved by executing the algorithm for a number of times.

本文收集自互联网,转载请注明来源。

如有侵权,请联系[email protected] 删除。

编辑于
0

我来说两句

0条评论
登录后参与评论

相关文章

来自分类Dev

K-means clustering uniqueness of solution

来自分类Dev

(C++) K-Means Clustering trouble

来自分类Dev

(C ++)K-Means聚类问题

来自分类Dev

如何使用TensorFlow实施k-means?

来自分类Dev

K-Means GridSearchCV超参数调整

来自分类Dev

k Means中的收敛是什么?

来自分类Dev

openCV k-means调用断言失败

来自分类Dev

(C ++)K-Means聚类问题

来自分类Dev

python k-means聚类文本

来自分类Dev

正确实现k-means算法

来自分类Dev

是什么使距离k-medoid比“ k-means”更好?

来自分类Dev

是什么使距离k-medoid比“ k-means”更好?

来自分类Dev

K-Means VS K-模式?(文本聚类)

来自分类Dev

R k-means算法定制中心

来自分类Dev

Cost function value in k-means clustering mahout

来自分类Dev

Apache Spark K-Means集群-输入RDD

来自分类Dev

Python将k-means集群关联到实例

来自分类Dev

kdtree是否用于加快k-means聚类?

来自分类Dev

ggplot K-Means群集中心和群集

来自分类Dev

如何手动设置K-means集群的中心?

来自分类Dev

为sklearn k-means添加标签

来自分类Dev

消息不适合sklearn k-means收敛实现

来自分类Dev

解释K-Means cluster_centers_输出

来自分类Dev

执行K-Means算法时检索索引

来自分类Dev

无法使用SPSS Modeler 16运行k-means

来自分类Dev

使用python K-MEANS进行Hadoop流式传输

来自分类Dev

K-Means,如何访问群集中的点?

来自分类Dev

为sklearn k-means添加标签

来自分类Dev

在线k-means中聚类数量的良好估计