Guru, D. S. and Suhil, Mahamad and Lavanya Narayana Raju and Vinay Kumar, N. (2018) An alternative framework for univariate filter based feature selection for text categorization. Pattern Recognition Letters, 103. 23 - 31. ISSN 1872-7344
Full text not available from this repository. (Request a copy)Abstract
In this paper, we introduce an alternative framework for selecting a most relevant subset of the original set of features for the purpose of text categorization. Given a feature set and a local feature evaluation function (such as chi-square measure, mutual information etc.,) the proposed framework ranks the features in groups instead of ranking individual features. A group of features with rth rank is more powerful than the group of features with (r+1)th rank. Each group is made up of a subset of features which are supposed to be capable of discriminating every class from every other class. The added advantage of the proposed framework is that it automatically eliminates the redundant features while selecting features without requirement of study of features in combination. Further the proposed framework also helps in handling overlapping classes effectively through selection of low ranked yet powerful features. An extensive experimentation has been conducted on three benchmarking datasets using four different local feature evaluation functions with Support Vector Machine and Naïve Bayes classifiers to bring out the effectiveness of the proposed framework over the respective conventional counterparts.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Dimensionality reduction, Filter based feature selection, Text classification, Feature redundancy |
Subjects: | D Physical Science > Computer Science |
Divisions: | Department of > Computer Science |
Depositing User: | Manjula P Library Assistant |
Date Deposited: | 12 Jul 2019 05:51 |
Last Modified: | 12 Jul 2019 05:51 |
URI: | http://eprints.uni-mysore.ac.in/id/eprint/5117 |
Actions (login required)
View Item |