On the Feature Selection Criterion Based on an ...

Recommend Documents

Mar 11, 2010 - Authorized licensed use limited to: Louisiana Tech University. Downloaded on July 12,2010 at 00:49:08 UTC from IEEE Xplore. Restrictions ...

feature selection based on the composition of fisher criterion and ...

Aug 22, 2007 - Fisher criterion is used to remove features that are noisy or ... Feature selection; Fisher criterion; Principal feature analysis ..... max-dependency,.

On the Feature Selection and Classification Based on Information ...

Nov 26, 2017 - Cambria [3] suggested the combination of both methods: using machine learning to provide the limitations of the sentiment knowledge. On the ...

A feature selection method based on an improved ... - SAGE Journals

A feature selection method based on an improved fruit fly optimization algorithm in the process of numerical control milling. Min Yuan and Mei Wang. Abstract.

An Evaluation of Sampling on Filter-Based Feature Selection Methods

Proceedings of the Twenty-Third International Florida Artificial Intelligence Research Society Conference (FLAIRS 2010) .... In this study, we use filter-based feature ranking tech- .... multaneously on the given software engineering data sets.

S5 Table. Model selection based on the Akaike Information Criterion ...

Model selection based on the Akaike Information. Criterion. Model 1: lm (X~Latitude), Model 2: lm (X~Altitude), Model 3: lm. (X~Latitude*Altitude), Model 4: lm ...

An improved convergence criterion based on

Aug 7, 2015 - governing equations, the converged value differs greatly in different ..... Numerical test 2: influence of different factors in the same physical.

Feature Selection on Classification of Medical Datasets based on ...

The two algorithms are applied to three medical data sets The results show that ... Extracting useful data from arbitrarily large data collections or data streams is ...

Feature Selection Based on Genetic Algorithms for On-Line Signature ...

are applied to a feature selection problem in on-line signature ver- ification. ..... Video-Based Biometric Person Authentication, AVBPA, ser. LNCS-3546. Springer ...

Feature Selection Techniques on Thyroid

Keywords: Component; Feature Selecrion Methods; Data Mining; Breast ... Nowadays, we are capable to collect and generate data more than before. .... Weka provides the environment to perform many machine learning algorithm and feature.

Feature Selection based on the Local Lift Dependence Scale - MDPI

Jan 30, 2018 - Machine Learning Repository [12], namely, the Congressional Voting Records ...... Neto, U.M.B.; Dougherty, E.R. Error Estimation for Pattern ...

Unsupervised Feature Selection Based on the Distribution ... - CiteSeerX

Comparisons are done in Weka framework [22]. To show the .... workbench for data miningâ, In The Data Mining and Knowledge Discovery Handbook, pp.

An Akaike Criterion based on Kullback Symmetric Divergence in the ...

Bezza Hafadi & Abdallah Mkhadri, Afrika Statistika, Vol.2, nÂ°1, 2007, pp.1-21. An Akaike Criterion based on Kullback Symmetric Divergence in the Presence of ...

Gabor Feature Selection Based on Information Gain - Science Direct

in scientific research such as bioinformatics, machine learning and computer .... The information gain (IG) is the difference between entropy of the class and .... In Advances in Computer Science and Information Technology Springer, 2011, pp.

Feature Selection Based on Mutual Correlation - Google Sites

word ânoâ) and the mammogram Wisconsin Diagnostic Breast Center (WDBC) data (30 features, 357 ..... 15th IAPR Intern

Supervised Feature Subset Selection based on ... - Semantic Scholar

Rural Institute-Deemed University, Gandhigram, Tamilnadu, India in 1999. ... the Department of Computer Science, Periyar University, Salem, Tamilnadu, India.

improved feature selection based on a mutual ...

maximizing the relevancy based on normalized mutual information (NMI). .... AVIRIS sensor in Northwestern Indiana over the Indian Pines test site [16].

Feature Selection Method Based on High-Resolution ...

Jun 22, 2018 - In the investigated images (obtained from satellite and UAV images), the variable .... The linear kernel function is a special case of RBF.

done to extract singular regions like loop, delta, and whorl. Many matching ..... composed of three main blocks: directional image extraction, PoincarÃ¨ ..... [24] C. Blum and A. Roli, Metaheuristics in combinatorial optimisation: Overview and ...

A Novel Feature Selection Approach Based on FODPSO and SVM

FODPSO and SVM. Pedram Ghamisi, Student Member, IEEE, Micael S. Couceiro, Member, IEEE, ...... IEEE JSTARS, IEEE GRSL and Pattern Recognition Letter.

A Novel Feature Selection Approach Based on FODPSO and SVM

address the curse of dimensionality and reduce the redundancy of hyperspectral .... In 1995, Eberhart and Kennedy proposed the PSO algorithm for the first time ...

Unsupervised Feature Selection Based on Space Filling Concept

Jun 27, 2017 - of the proposed measure is described in the following proposition. Proposition For all subsets of ..... Letters 112 (2016) p. 203â215. ... [17] P. Mitra, C. A. Murthy, S. K. Pal, Unsupervised feature selection using feature similarit

Feature Selection Based on Enhanced Cuckoo Search for Breast ...

Apr 24, 2016 - Breast Cancer Classification in Mammogram Image. Circuits and Systems, 7, 327-338. http://dx.doi.org/10.4236/cs.2016.74028. Feature ...

Feature Subset Selection Based on ANN Sensitivity Analysis - CiteSeerX

Abstract: - Feature subset selection is a central issue in a vast diversity of ..... ANN sensitivity indexes case training (1500) test (500) s1 s2 s3 s4 s5 s6 s7 s8 a).

On the Feature Selection Criterion Based on an ...

Download PDF

1 downloads 0 Views 131KB Size Report

Comment

paper. This work was supported in part by the Louisiana Board of. Regents under P-KSFI Grant LEQSF (2007-12)-ENH-PKSFI-PRS-03. REFERENCES. [1].

1342

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,

Comments

IðX1 ; !Þ þ

m X

IðXi ; !Þ

Abstract—We derive the feature selection criterion presented in [1] and [2] from the multidimensional mutual information between features and the class. Our derivation: 1) specifies and validates the lower-order dependency assumptions of the criterion and 2) mathematically justifies the utility of the criterion by relating it to Bayes classification error. Index Terms—Feature selection, entropy, mutual information, Bayes classification error, entropy estimation.

Ç INTRODUCTION

GIVEN an initial set of n features, the goal of mutual information based feature selection is to select a subset of m features that maximizes the multidimensional (joint) mutual information [3] between features and the class, given as IðX; !Þ ¼ IðX1 ; ; Xm ; !Þ X X P ðX1 ; ; Xm ; !Þ ; ¼ P ðX1 ; ; Xm ; !Þ log P ðX1 ; ; Xm ÞP ð!Þ ! X ;;X m

ð1Þ where X is a feature vector, Xi is a feature, and ! ¼ f!1 ; ; !k g is the class variable. Hellman and Raviv’s equivocation bound [4] shows that the Bayes classification error is upper bounded by 1 2 Hð!jXÞ, where Hð!jXÞ is the class-conditional entropy. Because Hð!jXÞ ¼ Hð!Þ IðX; !Þ [5], maximizing (1) minimizes Helman and Raviv’s bound on Bayes classification error, thus justifying its application as a feature selection criterion. Histograms and continuous kernels are two popular nonparametric “plug-in” estimators [6] of mutual information. However, when the dimensionality is high, estimating (1) with histograms becomes impractical because of its complexity, which grows exponentially with the number of features. On the other hand, estimating (1) with a high-dimensional kernel (see [7]) often demands large training sample sizes, which may be unrealistic for the problem at hand. Considering these issues, Battiti [1] and Peng et al. [2] proposed a low-dimensional approximation to (1). The approximation selects features that maximize class-separability and simultaneously minimize dependencies between feature pairs. The approximation is given as

. The authors are with the Center for Secure Cyberspace, Computer Science, Louisiana Tech University, Nethken Hall, 600 W. Arizona Ave., Ruston, LA 71272. E-mail: {ksb011, phoha}@latech.edu. Manuscript received 21 May 2009; revised 5 Oct. 2009; accepted 11 Oct. 2009; published online 11 Mar. 2010. Recommended for acceptance by M. Figueiredo. For information on obtaining reprints of this article, please send e-mail to: [email protected], and reference IEEECS Log Number TPAMI-2009-05-0322. Digital Object Identifier no. 10.1109/TPAMI.2010.62. Published by the IEEE Computer Society

! IðXi ; Xj Þ ;

ð2Þ

where IðX1 ; !Þ represents the selection of the first feature that maximizes the class-separability and ! m X X IðXi ; !Þ IðXi ; Xj Þ i¼2

0162-8828/10/$26.00 ß 2010 IEEE

X j