Gaussian Mixtures with Missing Data: an Efficient ... - Semantic Scholar

Recommend Documents

Efficient EM Training of Gaussian Mixtures with Missing Data

AbstractâIn data-mining applications, we are frequently faced with a large ... Note that an extensive study of Gaussian mixture training and missing value ...

speech enhancement with missing data ... - Semantic Scholar

Aurora database is in turn obtained from the TIDigit database ... filter bank [19] with centre frequencies spaced linearly in ERB- rate from 50 to 3750 Hz. The ...

Bankruptcy Prediction with Missing Data - Semantic Scholar

Keywords: Missing data, Ensemble model, Nearest Neighbors,. Bankruptcy Prediction. 1. Introduction. The business failure has been widely studied, trying to.

data envelopment analysis with missing data: a ... - Semantic Scholar

Nov 22, 2013 - International Journal of Information Technology & Decision Making. Vol. ... many areas, such as ranking,2,3 resource allocation,4,5 commercial ...

Clustering with Gaussian Mixtures - Google Sites

Clustering with. Gaussian Mixtures. Andrew W. Moore. Professor. School of Computer Science. Carnegie Mellon University w

Using Gaussian Mixtures for Hindi Speech ... - Semantic Scholar

Dec 4, 2011 - Indian languages ASR system, the higher range of Gaussian mixtures (i.e. 64 and above), .... Besides this, in Hindi language there are some.

A Hybrid BTF Model Based on Gaussian Mixtures - Semantic Scholar

a step-wise prediction of the whole fixed view BTF texture subspace. .... pixel-wise analytical BRDF models. ..... [1] M. Alex, O. Vasilescu, and D. Terzopoulos.

An Algorithmic Approach to Missing Data Problem ... - Semantic Scholar

Modeling Human Aspects in Software Development. Gul Calikli. Data Science ...... Telecom3) come from the billing and charging system pri- marily comprised of ...

Pattern classification with missing data: a review - Semantic Scholar

pattern xn has d input features (without any missing data) and an output ..... links from the other units (hidden and missing) to the missing unit with a unit delay.

Analysis with missing data in prevention research - Semantic Scholar

This research was supported in part by National Institute on Alcoholism and Alcohol ... comments about the state of missing data analysis in prevention and other.

Data envelopment analysis with missing values ... - Semantic Scholar

Keywords: Data envelopment analysis; Missing values; Interval DEA. 1. .... In such a setting, the units are free to assign any value within the intervals so as to ...

Breast Cancer Data Analytics With Missing Values - Semantic Scholar

Breast Cancer Data Analytics With Missing Values: A study on Ethnic, Age and Income Groups. Santosh Tirunagariâ, Norman Pohâ , Hajara Abdulrahmanâ¡, ...

Linear Multi View Reconstruction with Missing Data - Semantic Scholar

[9,18], but they require careful analysis and selection of image data in order ... the allowance of arbitrary missing data, with three points visible in all views.

Clustering Pairwise Distances with Missing Data - Semantic Scholar

language terms with the Google distance, a semantic distance recently introduced by Cilibrasi and ... As the cuts are usually NP-hard to optimize, appropriate ...

Scalable Tensor Factorizations with Missing Data - Semantic Scholar

The problem of missing data is ubiquitous in domains such as biomedical ...... of its ability to recover the underlying factors in the presence of missing data.

Forecasting with Missing Data: Application to ... - Semantic Scholar

The autocorrelation function (ACF) and the partial autocorrelation function (PACF) are the main tools used in the ARIMA model identification stage. However ...

Dealing with Missing Software Project Data - Semantic Scholar

somewhat overlooked problem, namely missing data. There are various .... generate a wide range of socio-technical issues [8, 9], however, our concern in this ...

dealing with missing data

We require that the referent object has no missing data. The two adapted rough set approaches boil down to the original approaches when there are no missing ...

Extracting Assumptions from Missing Data - Semantic Scholar

Yahoo, while âSolaris/Cast Awayâ, (which is not a movie, but rather a DVD combo) only appears on Yahoo. Apparently, both IMDB and Yahoo movies are ...

Mixtures of Gaussian Processes - DBS

Ëf(x) to an input x given the training data is a superposition of the ..... Lernen der Struktur nichtlinearer AbhÃ¤ngingkeiten mit graphischen Mod- ellen.

Efficient Data Association in Visual Sensor Networks with Missing ...

such as airports, subway stations, busy streets, and pub- ... association problem can be solved by Bayesian inference [4â. 8]. ..... The last line of (12) ensures.

Efficient Methods for Dealing with Missing Data in ... - CiteSeerX

call. Our approach is based on the approximation of the input data ... At the time of the research for this paper, a visiting researcher at the Center for. Biological and Computational Learning, MIT. E-mail: [email protected] ...

Efficient Methods for Dealing with Missing Data in Supervised Learning

We present efficient algorithms for dealing with the problem of missing inputs (incomplete feature vectors) during training and re- call. Our approach is based on ...

Analyzing Data Sets with Missing Data: An Empirical Evaluation of ...

Analyzing Data Sets with Missing Data: An. Empirical Evaluation of Imputation Methods and Likelihood-Based Methods. Ingunn Myrtveit, Erik Stensrud, Member, ...

Gaussian Mixtures with Missing Data: an Efficient ... - Semantic Scholar

Download PDF

0 downloads 0 Views 43KB Size Report

Comment

{delallea,courvila,bengioy}@iro.umontreal.ca http://www.iro.umontreal.ca/â¼lisa. In data-mining ... Processing Systems 6, San Mateo, CA. Morgan Kaufmann.

Gaussian Mixtures with Missing Data: an Efficient EM Training Algorithm Olivier Delalleau, Aaron Courville and Yoshua Bengio Dept. IRO, Universit´e de Montr´eal C.P. 6128, Montreal, Qc, H3C 3J7, Canada {delallea,courvila,bengioy}@iro.umontreal.ca http://www.iro.umontreal.ca/∼lisa In data-mining applications, we are frequently faced with a large fraction of missing entries in the data matrix, which is problematic for most discriminant machine learning algorithms. A solution that we explore here is the use of a generative model (a mixture of Gaussians with full covariances) to learn the underlying data distribution and replace missing values by their conditional expectation given the observed variables. Since training a Gaussian mixture with many different patterns of missing values can be computationally very expensive, we introduce a spanning-tree based algorithm that significantly speeds up training in these conditions. Such mixtures of Gaussians can be applied directly to supervised problems (Ghahramani and Jordan, 1994), but we observe that using them for missing value imputation before applying a separate discriminant learning algorithm yields better results. Our contributions are two-fold: 1. We explain why the basic EM training algorithm is not practical in large-dimensional applications in the presence of missing values, and we propose a novel training algorithm that significantly speeds up training by EM. The algorithm we propose relies on the idea to re-use the computations performed on one training sample as a basis for the next sample, in order to obtain the quantities required by the EM update equations. We show how these computations can be minimized by ordering samples in such a way that two consecutive samples have similar “missing patterns”, i.e. share missing values for similar variables. On 28x28 images with random squares of 5x5 pixels being forced to missing values, we obtain a speed-up on the order of 8 compared to standard EM training. 2. We show, both by visual inspection on image data (figure 1) and by feeding the imputed values to another classification algorithm (figure 2), how a mixture of Gaussians can model the data distribution so as to provide a valuable tool for missing values imputation.

References Ghahramani, Z. and Jordan, M. I. (1994). Supervised learning from incomplete data via an EM approach. In Cowan, D., Tesauro, G., and Alspector, J., editors, Advances in Neural Information Processing Systems 6, San Mateo, CA. Morgan Kaufmann. Topic: learning algorithms Preference: poster

1

Figure 1: Imputation of missing values in images (on the left, two digits with missing values in grey, and on the right the corresponding imputation predicted by the mixture of Gaussians).

Gaussian mixture Neural network Kernel ridge regression

0.65 0.6 0.55 0.5 0.45 0.4 0.35 0.3

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Figure 2: Test mean-squared error (Abalone) when the proportion of missing values increases. “Gaussian mixture” is the mixture alone, used as a classifier. Both “Neural network” and “Kernel ridge regression” have been trained using the conditional mean imputation of missing values provided by the same Gaussian mixture. We see that combining a discriminant algorithm with the generative Gaussian mixture model works better than the generative model alone.

2