Feature Selection from Huge Feature Sets - Colorado State University ...

Recommend Documents

Feature Selection from Huge Feature Sets - CS.Colostate.EDU.

[email protected]. Abstract. The number of features that can be computed over an image is, for practical purposes, limitless. Unfortunately, the number of ...

Feature Selection - Arizona State University - ASU

tures not only advances existing feature selection research but ... Feature selection, as a dimensionality reduction ..... When new data instances come, an online.

Robust Object Detection Using Fast Feature Selection from Huge ...

ABSTRACT. This paper describes an efficient feature selection method that quickly selects a small subset out of a given huge feature set; for building robust ...

feature - Mississippi State University

One effective way of exploring large scientific data sets is a process called feature mining. The two ..... graphics. He received his BS in computer science from.

Feature Selection in Imbalance data sets - CiteSeerX

Feature selection methods have been used these days in the various fields. Like information retrieval and filtering, text classification, risk management, web ...

Combining feature selection and feature

constructed feature synthesizes the information contained in pairs of strongly ..... with the highest information of the pair and the white dot corresponds to the.

Feature-Extraction and Feature-Selection for ...

transformation converts the saliency measure to decibels [32]. ANN-SNR ..... http://securityaffairs.co/wordpress/14641/cyber-crime/us-critical-infrastructure-under-cyber-attacks.html. ... Available: http://tech911.info/html/body_it_training_help.html

Feature-Extraction and Feature-Selection for ...

The CDX network traffic data under analysis was collected at AFIT [43] and was captured in the libpcap file format .... Training methods, such as backpropagation (used here), are used to ...... Business Analytics and Optimization, 1st ed., vol.

My title - Feature Selection @ ASU - Arizona State University

published in machine learning or statistic domain can be found in [18, 35]. In [47], the authors .... 44, 62] from feature list, as this may improve the prediction accuracy. ... benchmark datasets for sharing with other researchers. Below we go ...

Consistency Based Feature Selection - Osaka University

2 Division of Intelligent Sys Sci, Osaka University, Ibaraki, Osaka 567, Japan. Abstract. ... features that can distinguish classes as if with the full set. In other ... different class labels: for example, let us assume there are n matching patterns

Feature Selection - public.asu.edu

basic introduction to feature selection including basic concepts, classifications of existing systems, recent development, and applications. Synonyms.

Fuzzy feature selection

types of vehicles: a double Decker bus, Chevrolet van,. Saab 9000 and Opel Manta 400. The images were acquired by a camera looking downwards at the.

Advancing Feature Selection Research

For unequal sample sizes and unequal variance case the t-score can be ..... [27] Carmen Lai, Marcel J T Reinders, Laura J van't Veer, and Lodewyk F A Wessels ...

feature selection - Core

Dee Omar, Ian Conway, Elaine Watts, Chris Lewis, and John Love for their co- operation and support. Thanks are also due to Dr. Carlos Vieira in providing help ...

FEATURE SELECTION AND LANGUAGE

to cursive script recognition (e.g., [4]). An important ..... The author would like to thank Dr. Tim Barnum and Gary Herring of USPS and Dr. John Tan of Arthur D.

feature selection - AIRCC

M.E.S College, Marampally, Aluva, Kerala, India ... approach in feature selection is ranking the individual features according to some criteria and then search for an optimal ... Computer Science & Information Technology (CS & IT) any prior ...

Feature Selection for SVMs

t Barnhill Biolnformatics.com, Savannah, Georgia, USA. tt CBCL MIT ... We introduce a method of feature selection for Support Vector Machines. The method is ...

COST-SENSITIVE FEATURE SELECTION

In this approach an application of the ant algorithm as a search engine is proposed. ... takes advantage of cost of features, where the cost is connected with the cost of sensors. The algorithm as .... In the cycle ant colony optimisation algorithm a

quantitative evaluation of feature sets

feature sets, segmentation algorithms and color constancy algorithms are ..... evaluate computer vision tools that support discovering scene semantics.

Stable Feature Selection from Brain sMRI

Mar 25, 2015 - the (multi-set) Dice coefficient (Dice 1945). Experiments demonstrate that ..... has infinite capacity and a cost (in red) 1. 2 (yi â (Â¯Î¾i + Î³i)), where.

On Biclustering with Feature Selection for Microarray Data Sets

The proliferation of massive datasets brings with it a series of special computational challenges. This data avalanche arises in a wide range of scientific and.

Dynamic Feature Selection with Fuzzy-Rough Sets - IEEE Xplore

Aberystwyth University. Aberystwyth, SY23 3DB, UK. Email: {rrd09, ncm, qqs}@aber.ac.uk. AbstractâVarious strategies have been exploited for the task.

Feature Selection for Density Level-Sets - Google Sites

approach generalizes one-class support vector machines and can be equiv- ... of the new method on network intrusion dete

Feature Selection in Parkinson's Disease: A Rough Sets ... - CiteSeerX

Feature Selection in Parkinson's Disease: A Rough. Sets Approach. Kenneth Revett. University of Westminster affiliation. Harrow School of Computer Science.

Feature Selection from Huge Feature Sets - Colorado State University ...

Download PDF

13 downloads 117435 Views 107KB Size Report

Comment

a result, it is important to develop techniques for selecting features from very large .... There are two ways for an application to use our feature selection system.

Feature Selection from Huge Feature Sets Bruce A. Draper Computer Science Department Colorado State University Fort Collins, CO 80523, USA [email protected]

José Bins Faculdade de Informática Pontifícia Universidade Católica (RS) Porto Alegre, RS 90619-900, Brazil [email protected]

Abstract The number of features that can be computed over an image is, for practical purposes, limitless. Unfortunately, the number of features that can be computed and exploited by most computer vision systems is considerably less. As a result, it is important to develop techniques for selecting features from very large data sets that include many irrelevant or redundant features. This work addresses the feature selection problem by proposing a three-step algorithm. The first step uses a variation of the well known Relief algorithm [11] to remove irrelevance; the second step clusters features using K-means to remove redundancy; and the third step is a standard combinatorial feature selection algorithm. This three-step combination is shown to be more effective than standard feature selection algorithms for large data sets with lots of irrelevant and redundant features. It is also shown to be no worse than standard techniques for data sets that do not have these properties. Finally, we show a third experiment in which a data set with 4096 features is reduced to 5% of its original size with very little information loss.

1. Introduction The number of features that can be computed over an image is, for all practical purposes, limitless. Examples of common image features include Fourier coefficients, PCA eigenvectors, Zernike moments, Gabor energy functions, Wavelet responses, probe sets, and histogram/correlogram features, not to mention simple set features like the mean and standard deviation of an image. Moreover, this list of popular features barely scratches the surface; in [23], Tieu and Viola compute 45; 000 image features without using any of those mentioned above. The feature set available for object classification is therefore very large. Unfortunately, large feature sets are problematic. Realtime systems cannot afford the time to compute or apply

them. More relevent to this paper, statistical model fitting and/or supervised learning systems generally do not have enough labeled training instances to fit accurate models over very large feature spaces, due to finite sample effects [9]. At the same time, in many cases it is difficult or impossible to know without training which features are relevent to a given task, and which are effectively noise. As a result, the ability to select features from a huge feature set is critical for computer vision. Given a feature set of size M , the feature selection problem is to find a feature subset of size n (n