Clustering of the Multi-Value Documents based on Probabilistic Features Association Mechanism
P Gopala Krishna1, D Lalitha Bhaskari2
1P Gopala Krishna*, Associate Professor, Department of IT, Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad, India.
2D Lalitha Bhaskari, Professor, Department of CS & SE, Andhra University College of Engineering, Andhra University, Visakhapatnam, India
Manuscript received on October 12, 2019. | Revised Manuscript received on 22 October, 2019. | Manuscript published on November 10, 2019. | PP: 1576-1581 | Volume-9 Issue-1, November 2019. | Retrieval Number: A4538119119/2019©BEIESP | DOI: 10.35940/ijitee.A4538.119119
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: It is becoming increasingly difficult to cluster multi-valued data in data mining because of the multiple data interval values of individual functions. Identifying a clustering model that is appropriate for these disguised multi-valued data deployments in data analysis applications is an open problem. To answer this question, this paper proposes a feature selection based on the probabilistic features association mechanism (PFAM). The problem is mainly due to the difficulty in identifying the class information and the multiple values for each individual features. This work explores the problem of unsupervised feature selection through computing the probabilistic association score and multi-value data reformation for effective clustering in multivariate datasets. By minimizing a reformation clustering error, it can conserve together the degree of similarity and the categorization information of the actual data contents. The proposed approach is evaluated the clustering purity and Normalized Mutual Information on multivariate document datasets. The experimental evaluation shows the improvisation of the proposed approach.
Keywords: Feature Selection, Probability Association, Clustering, Multi-value Document
Scope of the Article: Clustering