Finding Meaning of Clusters

Authors

Yutaka Matsuo and Yukio Ohsawa

Track:

Contents

Downloads:

Abstract:

Clustering is an important data exploration task in chance discovery as well as in data mining. The first hierarchical clustering dates back to 1951 by K. Florek; since then, there have been numerous algorithms. However, there is no consensus among researchers as to what constitutes a cluster; the choice of the cluster is application-dependent. Although clustering is sometimes evaluated by interpretability of clusters, few studies have been done to reveal the interpretation aspect of clusters. This paper explains development of a new clustering algorithm by graph-based partitioning which aims to simplify interpretation of clusters. Two typical cluster types are considered: a star and a diamond. A star is a cluster with explicit shared context, represented by a central node. A diamond is a cluster with shared context, whose main cause of the context is implicit and hidden. These two types are very easy to understand. We elicit these types of clusters from a given weighted linkage graph. Minimization of weight of the graph cut is also considered. We show some examples and explain the effectiveness of our method.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.