Predicting Turbulence Using Partial Least Squares ... - CiteSeerX

Predicting Turbulence Using Partial Least Squares Regression and an Artificial Neural Network Valliappa Lakshmanan1,2 1 Cooperative

Institute of Mesoscale Meteorological Studies University of Oklahoma

2 Radar

Research and Development Division National Severe Storms Laboratory

Jan. 2010

Lakshmanan et. al (OU/NSSL)

PLS and NN

8th Conf. on AI, Atlanta, GA

1 / 15

Introduction

Basic Idea

1

Use partial least squares regression to reduce number of variables

2

Remove all-zero observations

3

Use ANN to estimate probability


PLS and NN


2 / 15

Introduction

Dataset

Training dataset: 136 columns, 103,990 rows Testing dataset: 50,127 rows Aim: predict turbulence Challenge: too many variables, turbulence is rare.


PLS and NN


3 / 15

Attribute Transformation

Partial Least Squares Regression

PLS regression is like PCA except: instead of finding the eigen vectors of the attribute matrix, we find the eigen vectors of the covariance of the attribute matrix and the predictand. Better fit for the classification problem. See [Hastie et al., 2001, Mevik and Wehrens, 2007].


PLS and NN


4 / 15


RMS error by number of components

6 components are enough: each component is a linear combination of 134 attributes (many of whose weights are zero). Lakshmanan et. al (OU/NSSL)

PLS and NN


5 / 15


Significant Attributes of Significant Components The attributes with non-zero weights of the six most significant components: 1

STONE Linear

2

RPVORT Linear and RSTAB Linear

3

RICH Linear, ROL Linear, RPVORT Linear, RSTAB Linear, and SATRI Linear

4

ROL Linear, RSTAB Linear and SATRI Linear

5

PRES SFC Linear, ROL Linear, RSTAB Linear, and STONE Linear

6

dbz DNGood40, dbz DNGood80 NSSL DBZ40 Wedge0 5 mindistance, nssl 18dbz top DNGood0 80, PRES SFC Linear, ROL Linear, RSTAB Linear, SMHGT Linear, STONE Linear, and MSLMA MSL Linear


PLS and NN


6 / 15


Significant Attributes dbz DNGood40 dbz DNGood80 NSSL DBZ40 Wedge0 5 mindistance nssl 18dbz top DNGood0 80 PRES SFC Linear RICH Linear ROL Linear RPVORT Linear RSTAB Linear SATRI Linear SMHGT Linear STONE Linear MSLMA MSL Linear Note: not in order of significance. Lakshmanan et. al (OU/NSSL)

PLS and NN


7 / 15

Preprocessing

Remove zeroes

The 103,990 input rows were randomly assigned 2:1 into a training and validation set i.e. 69327 rows were used for training and 34663 rows were used for validation. The transformed data had lots of rows where all the inputs were zero. About 6% of the all-zero observations were turbulent. All-zero observations were removed from the training and validation sets At run-time for any row whose transformed values are all zero, output will be 0.06.


PLS and NN


8 / 15

Neural Network

Neural Network Architecture

6 inputs (the 6 transformed components) 1 output (0/1 for turbulence) 3 hidden nodes in one layer Input and output nodes had a sigmoid transfer function Hidden nodes had a tanh transfer function Cross-entropy minimized so that predicted value is a probability


PLS and NN


9 / 15

Neural Network

Validation error

No overfitting ...


PLS and NN


10 / 15

Results and Conclusions

NN Skill on validation set

The neural network outputs are clustered between 0.4 and 0.6 indicating that the neural network was not able to separate out the two classes very well. Lakshmanan et. al (OU/NSSL)

PLS and NN


11 / 15


Analysis

The neural network was not able to separate out the no-turbulence observations from the turbulence observations very well. Only 1-2% of the provided observations were useful data: all-zero observations consisted of 98-99% of the data. A better prediction algorithm may be possible, but only if there are more usable observations of no-turbulence in the presence of weather.


PLS and NN


12 / 15


Sharpness on test dataset

In the absence of ground truth, the only measure we can compute is sharpness. Lakshmanan et. al (OU/NSSL)

PLS and NN


13 / 15


Acknowledgements

Funding for this research was provided under NOAA-OU Cooperative Agreement NA17RJ1227. The author also thanks Gillian Peguero and John Williams for organizing the contest this year. This dataset provided me with the motivation to learn to apply PLS regression.


PLS and NN


14 / 15


References

Hastie, T., Tibshirani, R., and Friedman, J. (2001). The Elements of Statistical Learning. Springer. Mevik, B. and Wehrens, R. (2007). The pls package: Principal component and partial least squares regression in R. J. Statistical Software, 18(2):1–24.


PLS and NN


15 / 15

Predicting Turbulence Using Partial Least Squares ... - CiteSeerX

Predicting Turbulence Using Partial Least Squares ... - CiteSeerX

Suggest Documents

Predicting Turbulence Using Partial Least Squares ... - CiteSeerX

PARTIAL LEAST SQUARES - Undip

Confirmatory Analysis with Partial Least Squares - CiteSeerX

A Partial Least Squares Approach - CiteSeerX

Confirmatory Analysis with Partial Least Squares - CiteSeerX

Partial least squares path modeling

Voice Conversion Using Partial Least Squares Regression - CiteSeerX

Nonlinear partial least squares - Automatica

partial least squares - Newcastle University eTheses

Partial Least Squares Structural Equation Modeling

Using partial least squares in operations management research: A ...

Partial Least Squares Structural Equation Modeling

Metabolomics and partial least squares-discriminant analysis

Bayesian Sparse Partial Least Squares - Computational Intelligence ...

bagging-partial least squares regression - OSA Publishing

Partial Least-Squares Regression for Routine

Partial Diffusion Recursive Least-Squares for Distributed

Partial least squares structural equation modeling

Using partial least squares in operations management research: A ...

Partial Least Squares Model based Process Monitoring using Near ...

Partial Least Squares Structural Equation Modeling

Partial Least Squares Image Clustering - DCC/UFMG

consistent partial least squares path modeling

CONSISTENT PLS Consistent Partial Least Squares ...