Discriminative Sparse Neighbor Approximation for Imbalanced ... - arXiv

Recommend Documents

discriminative sparse representations for cervigram image ... - CiteSeerX

ABSTRACT. This paper presents an algorithm using discriminative sparse representations to segment tissues in optical images of the uterine cervix. Because of ...

Reconstructive and Discriminative Sparse Representation for Visual

representation to the problem of Visual Object Categorization which aims at predicting whether ... in order to learn a reconstructive and discriminative dictionary.

SVM-KNN: Discriminative Nearest Neighbor ... - Google Sites

Michael Maire. Jitendra Malik. Computer Science Division, EECS Department. Univ. of California, Berkeley, CA 94720 ... y

Regularized Dictionary Learning for Sparse Approximation - CiteSeerX

anteed to find a local minimum. 3. REGULARIZED DICTIONARY LEARNING (RDL) ..... hours was recorded from BBC radio 3, which plays mostly classical music ...

efficient sparse approximation methods for medical imaging

METHODS FOR MEDICAL IMAGING by. Ray Maleh. A dissertation submitted in partial fulfillment of the requirements for the degree of. Doctor of Philosophy.

Approximation Bounds for Sparse Principal Component Analysis

Jun 18, 2012 - Algorithm 1 Frank-Wolfe algorithm for computing Ï(Ï). Input: Ï .... junior faculty fellowship, a Howard B. Wentz Jr. award and a gift from Google.

An Independent Process Approximation to Sparse Random ... - arXiv

Sep 30, 2015 - uniform distribution over simple graphs with exactly kT triangles and kS edges not involved in a triangle. As a corollary, for kS = o(n) and kT ...

Nonconvex Sorted $\ell_1 $ Minimization for Sparse Approximation

Sep 29, 2014 - OC] 29 Sep 2014. Noname manuscript No. (will be inserted by the editor). Nonconvex Sorted l1 Minimization for Sparse. Approximation.

Sparse multiresolution stochastic approximation for uncertainty ...

Annual Research Briefs 2012. 93. Sparse multiresolution stochastic approximation for uncertainty quantification. By D. Schiavazzi, A. Doostan AND G. Iaccarino.

Sparse Representation Based Discriminative Canonical Correlation ...

AbstractâCanonical correlation analysis (CCA) has been widely used in pattern recognition and machine learning. However, both CCA and its extensions ...

Discriminative Sparsity for Sonar ATR - arXiv

Jan 1, 2016 - Naval Surface Warfare Center (NSWC), we obtained compelling ..... the RAW SAS dataset, the pipes and toruses which we call. Fig. 9: Scaled ...

Discriminative Bayesian Dictionary Learning for Classification - arXiv

Mar 27, 2015 - where Ac â R|K|Ã|Ic| is the sub-matrix related to the cth class. The ith column of A is denoted as Î±i â R|K|. To learn a sparse representation of ...

Sparse MDMO: Learning a Discriminative Feature for Micro

In this paper we propose a sparse MDMO feature that learns an effective dictionary from a micro-expression video dataset. In particular, a new distance metric is.

Discriminative Mixtures of Sparse Latent Fields for Risk Management

the resulting optimization problem is the graphical LASSO. (GLASSO) of [14] ..... is to test robustness of a fund or company under fantasy worst-case scenarios,.

Anti-sparse coding for approximate nearest neighbor search

Oct 25, 2011 - with. Jh(x) = Ax â y2. 2/2 + hxâ. (3) for some decreasing values of h. As h â 0 ..... Centre de recherche INRIA Bordeaux â Sud Ouest : Domaine ...

Sparse-TDA - arXiv

Jan 12, 2017 - Wei Guo, Student Member, IEEE, Krithika Manohar, Steven L. Brunton and Ashis G. ..... 0004 and Dr. Tom Hogan for helpful discussions.

DATA-SPARSE APPROXIMATION TO THE OPERATOR ... - CiteSeerX

Jul 29, 2003 - ment stiffness matrices discretising an elliptic operator L. The required ... to be an elliptic operator and that the paper [2] ensures that (zkI â Lh).

Improved Sparse Fourier Approximation Results: Faster

tion guarantees for functions whose sequences of Fourier series coefficents are ... Keywords Fourier Analysis Â· Sparse Approximation Â· Fast Fourier Trans-.

Average Energy Approximation - arXiv

Dec 5, 2012 - terms of the number of bosons N and the number of single-particle ..... in the following way. First, let S un stand for the entropy of the N un.

Sampling approach to sparse approximation ... - Semantic Scholar

Mar 4, 2016 - optimization algorithm, for determining degrees of freedom (the number of used ..... âbestâ c that converges to c0 in the limit of M/N â â.

An approximation algorithm for approximation rank arXiv ... - CiteSeerX

Sep 11, 2008 - We will use the following form of Hoeffding's inequality [Hoe63]: ... nonzero singular values of A, we see by the Cauchy-Schwarz inequality that.

Analog Sparse Approximation with Applications to Compressed

Nov 17, 2011 - signal models based on sparse representations, where a signal is ... model, and Î» is a parameter denoting the relative tradeoff between the data .... constructed that provably solve the optimization programs in (1) and are ... The LCA

Average Energy Approximation - arXiv

Dec 5, 2012 - Bethel College ... mechanics, a density of states function, probabilities of ... The best place to further explore the exact statement of the average ...

The Saddlepoint Approximation - arXiv

Apr 25, 2014 - The Saddlepoint Approximation: Unified Random. Coding Asymptotics for Fixed and Varying Rates. Jonathan Scarlett. University of Cambridge.

Discriminative Sparse Neighbor Approximation for Imbalanced ... - arXiv

Download PDF

6 downloads 19162 Views 2MB Size Report

Comment

Feb 3, 2016 - and edge detection (classification) with varying degree of data imbalance and .... begins with a cost-sensitive random decision Forest (CS-RF),.

1

Discriminative Sparse Neighbor Approximation for Imbalanced Learning

5

Class density (%)

Abstract—Data imbalance is common in many vision tasks where one or more classes are rare. Without addressing this issue conventional methods tend to be biased toward the majority class with poor predictive accuracy for the minority class. These methods further deteriorate on small, imbalanced data that has a large degree of class overlap. In this study, we propose a novel discriminative sparse neighbor approximation (DSNA) method to ameliorate the effect of class-imbalance during prediction. Specifically, given a test sample, we first traverse it through a cost-sensitive decision forest to collect a good subset of training examples in its local neighborhood. Then we generate from this subset several class-discriminating but overlapping clusters and model each as an affine subspace. From these subspaces, the proposed DSNA iteratively seeks an optimal approximation of the test sample and outputs an unbiased prediction. We show that our method not only effectively mitigates the imbalance issue, but also allows the prediction to extrapolate to unseen data. The latter capability is crucial for achieving accurate prediction on small dataset with limited samples. The proposed imbalanced learning method can be applied to both classification and regression tasks at a wide range of imbalance levels. It significantly outperforms the state-of-the-art methods that do not possess an imbalance handling mechanism, and is found to perform comparably or even better than recent deep learning methods by using handcrafted features only.

FG-NET MORPH

4

Age > 60

3 2 1 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 4850 52 54 56 58 60 62 64 66 6870

Age

4 7 x10

# of 35×35 patches

arXiv:1602.01197v1 [cs.CV] 3 Feb 2016

Chen Huang, Chen Change Loy, Member, IEEE, and Xiaoou Tang, Fellow, IEEE

5

Non-edge cluster (= # of all edge patches) Sorted edge cluster

6 ~10

Top5 edge cluster centroids:

3

Last5 edge cluster centroids:

1 0

50

Edge cluster

100

150

Fig. 1. Data imbalance in (a) age estimation and (b) edge detection. The given datasets characterize the underlying imbalanced distributions that can be seen as intrinsic in these problems. f2

PCA

f2

Fisher

Index Terms—Imbalanced learning, decision forest, discriminative sparse neighbor approximation, data extrapolation.

I. I NTRODUCTION

D

ATA imbalance exists in many vision tasks ranging from low-level edge detection [1] to high-level facial age estimation [2] and head pose estimation [3]. For instance, in age estimation, there are often many more images of the youth than the old on the widely used FG-NET [2] and MORPH [4] datasets. In edge detection, various image edge structures [5] obey a power-law distribution, as shown in Figure 1. Without handling this imbalance issue conventional vision algorithms have a strong learning bias towards the majority class with poor predictive accuracy for the minority class, usually of equal or more interest (e.g. rare edges may convey the most important semantic information about natural images). The insufficient learning for the minority class is due to the complete lack of representation by a limited number of or even no examples, especially in the presence of small datasets. For instance, FG-NET age dataset has 1002 images in total with only 8 images over 60 years old. Certain age classes of 60+ ages have no images at all. This reveals a bigger challenge on unseen data extrapolation from the few All the authors are with the Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong. E-mail: {chuang,ccloy,xtang}@ie.cuhk.edu.hk.

Age