AAAI Publications, The Twenty-Sixth International FLAIRS Conference

Font Size: 
MDL-Based Unsupervised Attribute Ranking
Zdravko Markov

Last modified: 2013-05-19


In the present paper we propose an unsupervised attribute ranking method based on evaluating the quality of clustering that each attribute produces by partitioning the data into subsets according to its values. We use the Minimum Description Length (MDL) principle to evaluate the quality of clustering and describe an algorithm for attribute ranking and a related clustering algorithm. Both algorithms are empirically evaluated on benchmark data sets. The experiments show that the MDL-based ranking performs closely to the supervised information gain ranking and thus improves the performance of the EM and k-means clustering algorithms in purely unsupervised setting.


Attribute Selection, Clustering, MDL

Full Text: PDF