Full-Text Download	Subscribe Now Recommend the Paper

	An Improved feature selection based on neighborhood positive approximation rough set in document classification
¹ Mrs. Leena. H. Patil, ² Dr. Mohammed Atique, ^1,Research Scholar,* Department of Computer Science and Engineering, Sant Gadge Baba Amravati University, Amravati, India ^2, Associate Professor, Department of Computer Science and Engineering, Sant Gadge Baba Amravati University, Amravati, India Email: ¹harshleena23@rediffmail.com, ²mohd.atique@gmail.com

Abstract .Feature selection is a challenging problem in the field of machine learning, pattern recognition and data mining. Feature Subset Selection becomes an important preprocessing part in the area of data mining. In rough set theory, the problem of feature selection, called as attribute reduction, aims to retain the discriminatory power of original features. A large number of features is the problem in text categorization. Most of the features are noisy, redundant, relevant or irrelevant noise that can mislead the classifier and it may have different predictive power. Therefore, feature selection is often used in text categorization. It is most important to reduce dimensionality of the data to get smaller subset of features and relevant information within efficient computational time as time complexity is the major issue in feature selection. To deal with these problem many feature selection algorithms are available, still such algorithms are often computationally time consuming, and possess the problem of accuracy and stability. To overcome these problems we developed a framework based on neighborhood positive approximation rough set for feature subset selection in which the size of the neighborhood depends on the threshold value δ. In the proposed framework we obtain several representative and rank preservation of significance measures of attributes. In this paper firstly document preprocessing is performed. Secondly, a neighborhood positive approximation is used to accelerate the attribute reduction. Thirdly result validations based on classifiers are performed. Experimental results show that the improved feature selection based on neighborhood positive approximation rough set model becomes more efficient in terms of the stability, computational time and accuracy in dealing with large datasets.

Keywords : Introduction, document Preprocessing, Feature Sele
Citation: Leena Patil, Mohammed Atique ,"An Improved feature selection based on neighborhood positive approximation rough set in document classification", International Journal of Soft Computing and Software Engineering [JSCSE], Vol. 5, No. 1, pp. 13-30, 2015, Doi: 10.7321/jscse.v5.n1.2
URL: http://dx.doi.org/10.7321/jscse.v5.n1.2

Subscribe Now

Email : * Email
Subscribe to receive free TOC's JSCSE by email

Subscribe

Recommend To Friend

Email : * Email

People