Clustering algorithms are a type of unsupervised learning that groups unlabeled data points together based on similarities. There are many different clustering methods that can group data either hierarchically or into flat clusters, and either exclusively or with overlap. K-means clustering is a simple algorithm that assigns data points to clusters based on proximity to randomly assigned cluster centroids, and is useful despite limitations around sensitivity to outliers and requiring pre-specification of the number of clusters. Clustering has applications across many domains including marketing, astronomy, seismology, and genomics.
Related topics: