AAAI Publications, The Twenty-Sixth International FLAIRS Conference

Font Size: 
Data Clustering and Similarity
Julien Soler, Fabien Tencé, Laurent Gaubert, Cédric Buche

Last modified: 2013-05-19


In this article, we study the notion of similarity within the context of cluster analysis. We begin by studying different distances commonly used for this task and highlight certain important properties that they might have, such as the use of data distribution or reduced sensitivity to the curse of dimensionality. Then we study inter- and intra-cluster similarities. We identify how the choices made can influence the nature of the clusters.

Full Text: PDF