Two models for Bayesian supervised dimension reduction

Recommend Documents

Supervised Dimension Reduction Using Bayesian Mixture Modeling

h â¼ G0 where Ïh's are weights constructed in a âstick breakingâ manner and Î½â h's are called âatomsâ ... The inference procedure is a Gibbs sampling scheme which can be broken ..... Dunson, D. B., X. Ya, and C. Lawrence (2008). The matr

Supervised dimension reduction with topic models

of data when searching for a new space. Extensive experiments with adaptations to three models demonstrate that our framework can yield scalable and ...

Supervised dimension reduction mappings

e n fo rc e id e n titiy su ch th a t. â ij d. E. (yi , yj. )2 is ma x im u m. (in tro d u c in g sla ck v a ria b le s if n e c e ssa ry. ) and. â i yi. =0 . SNE probab. p j. |i. = exp.

Dimension Reduction for Supervised Ordering - Toshihiro Kamishima

where sgn(x) is a sign function that takes 1 if x>0, 0 if x=0, and â1 ... The regression curve corresponds ... solved by using an SVM designed for supervised ordering as in [10]. ... posed into ordered pairs, and the algorithm learns probabil-.

Local Kernel Dimension Reduction in Approximate Bayesian

Aug 16, 2017 - CO] 16 Aug 2017. Local Kernel Dimension Reduction in. Approximate Bayesian Computation. Jin Zhou. Kenji Fukumizu. July 13, 2018.

Bayesian Learning with Gaussian Processes for Supervised

Gaussian processes (GP) for Bayesian learning. Our purpose is ... Among traditional image classification algorithms, the maximum ... somewhat precludes its practical use for applications with .... extra computation for the involved optimization.

Most Informative Dimension Reduction

us to estimate expectation values of functions of x and y. We also assume that the ..... Using the Pythagorean property, (which is an equality here) we have that ...

Supervised Models for Coreference Resolution - Association for ...

Aug 7, 2009 - 1 nominated [Hillary Rodham Clinton]2. 2 as. [[his]1. 3 secretary of state]3 ..... tion extractor on the training texts. Following Flo- rian et al. (2004) ...

A Monte Carlo Comparison of Two Linear Dimension Reduction ...

Tubbs, Coberly, and Young (1982) using a Monte Carlo simulation. The two ..... Barker at Brigham Young University (Rencher, 1995) on a possible link between.

Dimension Reduction Techniques for Training Polynomial ... - CiteSeerX

Training techniques for polynomial networks fall into sev- eral categories. The first ...... Illustration of random dimension reduction in feature space. In the figure ...

Dimension Reduction for Multivariate Emulation - MUCM

stimulating discussions around multivariate emulation and climate models. Technical Report ... 8 Gaussian Process Latent Variable Model (GPLVM). 13.

Prescriptions for Dimension Reduction, with ...

Feb 8, 2006 - 91D99 Mathematical sociology, 91E45 Measurement and performance ..... average MAXVAR px .46 .45 pm .20 .21 pxm. -.14. -.13. R2 .49 .51.

Sampling-Based Dimension Reduction for Subspace Approximation

We also show a dimension reduction type of result for this problem ... k-dimensional subspace we want to find s of them such that the p-th ... The p = 2 case for subspace approximation (also known as low-rank ..... For t = 1 to k do: (a) Pick a sampl

Comparing Dimension Reduction Techniques for ... - Semantic Scholar

reduction techniques for the text clustering problem, using five benchmark data ... Semantic Indexing (LSI), Document Frequency (DF) and Random Projection.

Reduction techniques for the finitistic dimension

Aug 13, 2018 - RT] 10 Aug 2018. REDUCTION TECHNIQUES FOR THE FINITISTIC DIMENSION. EDWARD L. GREEN, CHRYSOSTOMOS PSAROUDAKIS, ...

Distance Preserving Dimension Reduction for Manifold ... - CiteSeerX

DPDR significantly reduces computing time of manifold learning algorithms and produces low-dimensional param- eterizations as accurate as those obtained ...

Convex Optimization Methods for Dimension Reduction and ...

Apr 4, 2009 - arXiv:0904.0691v1 [stat.ME] 4 Apr 2009. Convex Optimization Methods for Dimension Reduction and. Coefficient Estimation in Multivariate ...

Dimension reduction for model-based clustering

Aug 7, 2015 - models can be computed via the EM algorithm (Dempster et al, 1977; Fraley and Raftery, 2002), while model selection could be based on the ...

Dimension reduction for periodic boundary value ...

Dec 15, 2010 - to prove the Hopf bifurcation theorem for functional-differential ...... [6] Ferenc Hartung, Tibor Krisztin, Hans-Otto Walther, and Jianhong Wu.

Multifeatures Fusion and Nonlinear Dimension Reduction for

Jan 17, 2016 - its simplicity and computation [5]. Spectrum analysis is also effective in tracking fault of bearing [6]. Spectral kurtosis in frequency domain is an ...

Discriminative Models for Semi-Supervised ... - Semantic Scholar

and structured learning tasks in NLP that are traditionally ... supervised learners for other NLP tasks. ... text classi

Supervised Learning Classification Models for ... - Semantic Scholar

May 14, 2014 - Performance analysis of predictive models based on 10 fold cross-validation .... propy also validates the protein sequences by Procheck python module to eliminate .... graphical plot of FP rate on X-axis vs. TP rate on Y axis. .... PHP

Statistical Models for Unsupervised, Semi-supervised

Oct 30, 2012 - Transliteration is a process of converting a word written in one script to ..... words. The English, Hindi and Urdu words are written in Latin, ...

Discriminative Models for Semi-Supervised ... - Semantic Scholar

Discriminative Models for Semi-Supervised Natural Language Learning. Sajib Dasgupta .... text classification using suppo

Two models for Bayesian supervised dimension reduction

Download PDF

3 downloads 0 Views 391KB Size Report

Comment

Simultaneous dimension reduction and regression or supervised dimension ..... A natural choice for a prior for ââ1 is a Wishart distribution W(df, p, VD) with df ...

Two models for Bayesian supervised dimension reduction B Y K AI M AO Department of Statistical Science

Duke University, Durham NC 27708-0251, U.S.A. [email protected] Q IANG W U Department of Mathematics

Michigan State University, East Lansing MI 48824, U.S.A. [email protected] F ENG L IANG Department of Statistics University of Illinois at Urbana-Champaign, IL 61820, U.S.A.

[email protected]

S AYAN

and M UKHERJEE

Department of Statistical Science, Institute for Genome Sciences & Policy, Department of Computer Science

Duke University, Durham NC 27708-0251, U.S.A. [email protected]

1

Abstract We study and develop two Bayesian frameworks for supervised dimension reduction that apply to nonlinear manifolds: Bayesian mixtures of inverse regressions and gradient based methods. Formal probabilistic models with likelihoods and priors are given for both methods and efficient posterior estimates of the effective dimension reduction space and predictive factors can be obtained by a Gibbs sampling procedure. In the case of the gradient based methods estimates of conditional dependence between covariates predictive of the response can also be inferred. Relations to manifold learning and Bayesian factor models are made explicit. The utility of the approach is illustrated on simulated and real examples. Some Key Words: Supervised dimension reduction, graphical models, manifold learning, inverse regression, factor models

2

1 Introduction Simultaneous dimension reduction and regression or supervised dimension reduction (SDR) formulates the problem of high-dimensional regression as finding a low-dimensional subspace or manifold that contains predictive information of the response variable (Li, 1991; Vlassis et al., 2001; Xia et al., 2002; Fukumizu et al., 2003; Li et al., 2004; Goldberger et al., 2005; Fukumizu et al., 2005; Globerson and Roweis, 2006; Martin-M´erino and R´oman, 2006; Nilsson et al., 2007; Cook, 2007; Wu et al., 2007; Tokdar et al., 2008; Mukherjee et al., 2009). SDR methods seek to replace the original predictors with projections onto this low dimensional subspace. This low dimensional subspace is often called the effective dimension reduction (e.d.r.) space. A variety of methods for SDR have been proposed. They can be subdivided into three categories: methods based on gradients of the regression function (Xia et al., 2002; Mukherjee and Zhou, 2006; Mukherjee and Wu, 2006; Mukherjee et al., 2009), methods based on inverse regression (Li, 1991; Cook and Weisberg, 1991; Hastie and Tibshirani, 1996b; Sugiyama, 2007; Cook, 2007), and methods based on forward regression (Friedman and Stuetzle, 1981; Tokdar et al., 2008). In this paper we develop Bayesian methodology for two of the approaches to SDR: the first method called Bayesian mixtures of inverse regression (BMI) extends the model-based approach of Cook (2007), the second method Bayesian gradient learning (BAGL) adapts the gradient estimation methods in Xia et al. (2002); Mukherjee and Zhou (2006); Mukherjee and Wu (2006); Mukherjee et al. (2009) to a Bayesian model based setting. In both settings parametric and non-parametric models will be stated or implied. The salient point of both approaches is that they apply to data generated from distributions where the support of the predictive subspace is not a linear subspace of the covariates but is instead a non-linear manifold. The projection is still linear but it will contain the non-linear manifold that is relevant to prediction. The underlying model in supervised dimension reduction is given p-dimensional covariates X and a response Y the following holds Y = g(b′1 X, · · · , b′d X, ε) (1) where the span of the vectors B = (b1 , ..., bd ) is referred to as the effective dimension reduction (e.d.r.) space and ε is noise. One expects d