Abstract
Clustering is an exploratory technique in data mining that aligns objects which have a maximum degree of similarity in the same group. The real-world data are usually mixed in nature, i.e., it can contain both numeric and nominal data. Performance degradation is a major challenge in existing mixed data clustering due to multiple iterations and increased complexities. We propose an integrated framework using frequent pattern analysis, frequent pattern-based framework for mixed data clustering (FPMC) algorithm, to cluster mixed data in a competent way by performing a one-time clustering along with attribute reduction. This algorithm comes under divide-and-conquer paradigm, with three phases, namely crack, transformation, and merging. The results are promising when the algorithm is applied on benchmark datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques. Morgan Kaufmann, USA.
Hipp, J., Myka, A., Wirth, R., & Güntzer, U. (1998). A new algorithm for faster mining of generalized association rules
Liu, B., Ma, Y., & Wong, C. K. (2002). Improving an association rule based classifier. Principles of Data Mining and Knowledge Discovery Lecture Notes in Computer Science, (1910, 2000), 504–509.
Wang, K., Tang, L., Han, J., & Liu, J. (2002). Top down FP-growth for association rule mining. Advances in Knowledge Discovery and Data Mining Lecture Notes in Computer Science, 2336, 334–340
Huan, Z. (1998). Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Mining and Knowledge Discovery, 2, 283–304.
Huang, Z. (1997). Clustering large data sets with mixed numeric and categorical values. In Asia Conference of Knowledge Discovery and Data.
Ahmad, A., & Dey, L. (2007). A k-mean clustering algorithm for mixed numeric and categorical Data.
He, Z., Xu, X., & Deng, S. (2005). Clustering mixed numeric and categorical data: A cluster ensemble approach.
Acknowledgments
This work is supported by the DST Funded Project, (SR/CSI/81/2011) under Cognitive Science Research Initiative in the Department of Computer Science, Amrita School of Arts and Sciences, Amrita Vishwa Vidyapeetham University, Kochi.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this paper
Cite this paper
Asok, A., Jisha, T.J., Ashok, S., Judy, M.V. (2016). Integrated Framework Using Frequent Pattern for Clustering Numeric and Nominal Data Sets. In: Satapathy, S., Joshi, A., Modi, N., Pathak, N. (eds) Proceedings of International Conference on ICT for Sustainable Development. Advances in Intelligent Systems and Computing, vol 408. Springer, Singapore. https://doi.org/10.1007/978-981-10-0129-1_55
Download citation
DOI: https://doi.org/10.1007/978-981-10-0129-1_55
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0127-7
Online ISBN: 978-981-10-0129-1
eBook Packages: EngineeringEngineering (R0)