Resampling techniques for statistical modeling - ULB

Recommend Documents

Statistical Signal Decomposition Techniques for Analyzing ... - ULB Bonn

Sep 19, 2014 - Analyzing Time-Variable Satellite Gravimetry Data .... In this thesis, statistical decomposition methods are investigated to perform signal-noise ...

2001: resampling methods for input modeling - CiteSeerX

sampling, bootstrap resampling of empirical distribution functions, and uniformly randomizing empirical distribution functions. Direct resampling uses new ...

Resampling approach to statistical inference ... - Semantic Scholar

Of course, since across-subjects statistical data analy- ses are usually ... In this study, we applied bootstrap analysis to data from an experiment aimed at ...

exponential smoothing and resampling techniques ... - Semantic Scholar

EXPONENTIAL SMOOTHING AND RESAMPLING. TECHNIQUES IN TIME SERIES PREDICTION. Maria Manuela Neves. CEAUL and Mathematics Department, ...

understanding probability and statistical inference through resampling

See http:lljsi.cbs.n1isale-iase.htm for details. UNDERSTANDING PROBABILITY AND STATISTICAL. INFERENCE THROUGH RESAMPLING. Clifford Koriold.

Resampling techniques and neural networks: some recent

such as the bootstrap or the subsampling, which can be used as a simulation tool to ... (4) provided that the integral exists and the optimization problem has a ...

Novel Resampling Improves Statistical Power for ... - Semantic Scholar

Mar 7, 2017 - As a result, statistical power for QTL detection may not be optimal if the QTL ... contribute little to the test statistic can improve statistical power.

Resampling Techniques for Real Estate Appraisals: Testing ... - MDPI

Aug 30, 2018 - With regards to the estimation function, it is assumed as the best ..... For this purpose, the statistical survey concerned six different samples representative of the .... cellar area (SUC), measured in square meters, sqm;. â¡.

Handbook Statistical foundations of machine learning - ULB

Jun 2, 2017 - timedia, web and data mining, computational finance, business intelligence. Outline ...... E[ax + by] = aE[x] + bE[y], a â R,b â R ...... Ac- cording to the hypothesis testing terminology, randomization tests make the null ...... wh

Survey of resampling techniques for improving classification ... - arXiv

Aug 22, 2016 - For instance, there are only a few iPhone models while the number of ... 7. n redundant = 1 (number of redundant features). 8. n clusters per ...

the potential of spectral resampling techniques for the simulation of ...

tral resampling between similarly performing hyperspectral sensors. Some experiments regarding the techniques of spectral simulation for an upcoming ...

RESAMPLING PROBABILITY VALUES FOR

A permutation method is presented to calculate resampling probability values for ... between cognitive and social psychologists on the issue of the continuity hypothesis, i.e. ... ordinal variation (consensus), yielding one-sided approximate prob

PharmacokineticâPharmacodynamic Modeling of Opioids - ULB

sufentanil, each with equilibration half-lives of about 6 min. Methadone ..... 2.1 min. 29.1. 24.7. 4. 3. EEG power spectrum analysis (spectral edge). 18. Alfentanil.

THORAX 3D MODELING FROM COSTOVERTEBRAL JOINT ... - ULB

virtual palpation [6] in a custom-made software called. lhpFusionBox [7] . Kinematics was processed using orientation vector position (OVP) method and helical ...

Pharmacokinetic-Pharmacodynamic Modeling of Alpha ... - ULB

Ramsgate Road, Sandwich, United Kingdom1; LAP&P Consultants, Leiden, The Netherlands2; and. School of Pharmacy and Pharmaceutical Sciences, The ...

Modern Multivariate Statistical Techniques - Journal of Statistical ...

Feb 9, 2009 - are also available in one or other R package, or from elsewhere on the net. At the end of ... practical data analysis, and coding exercises.

Investigating Statistical Techniques for Sentence-Level Event ...

identification and classification of various event types in .... gan in 1998 investigated the development of tech- ..... Educational and Psychological Measurement,.

Application of Multivariate Statistical Techniques for ... - Ceacsu.edu.pk

multivariate methods such as the factor analysis, cluster analysis and discriminant analysis were used. The application multivariable statistical methods offer a ...

Regression-based techniques for statistical ... - Semantic Scholar

or excessively high for Allison and Gorman's (1993) and White,. Rusch, Kazdin ... thus, it is necessary to test whether the p values associated with these coefficients ...... James and improved general approximation tests in the split-plot design.

Statistical Downscaling Techniques for Global ... - Semantic Scholar

2 Dept of Civil and Environmental Engineering, University of Washington ..... (The Yakima River at Parker USGS 12505000), and simulated flood risk for the ...

Using Statistical Techniques for Data Mining

Stock Market provides an ample opportunity to experiment and use various ... our academic course on Data Mining, a review of the various techniques was.

Adaptive Statistical Optimization Techniques for ... - Semantic Scholar

Page 1. 1. Adaptive Statistical Optimization Techniques for. Firewall Packet ... Second, we present a new packet filtering optimization technique that uses ...

Application of Multivariate Statistical Techniques for the ...

Multivariate statistical techniques such as factor analysis (FA), cluster ... interpretation of a large complex water quality data set of three cities (Lahore, ... polluted (MP) and highly polluted (HP) sites, based on the similarity of water quality

development of multivariate statistical techniques for ...

Dec 13, 2014 - Arvisenet, G., L. Billy, P. Poinot, E. Vigneau, D. Bertrand, and C. Prost (2008). Effect of apple particle state on the release of volatile compounds ...

Resampling techniques for statistical modeling - ULB

Download PDF

28 downloads 278 Views 237KB Size Report

Comment

(e.g. the average) for which the analysis is simple. ... conclusions drawn from a parametric analysis. .... Boostrap is a data-based simulation method for statistical.

Resampling techniques for statistical modeling Gianluca Bontempi Département d’Informatique Boulevard de Triomphe - CP 212 http://www.ulb.ac.be/di

Resampling techniques – p.1/41

Rationale All the methods presented so far are strongly based on two assumptions: 1. the form of distribution underlying the data is parametric and known. The only uncertainty concerns the values of some parameters.

2. we focus on parameters (e.g. the mean) and corresponding estimators (e.g. the average) for which the analysis is simple.

Resampling techniques – p.2/41

Rationale(II)

For a generic parameter in a parametric setting, the distribution of the statistic cannot be easily defined. For a generic problem, when only a set of observations are available, the parametric assumption cannot be made without strongly biasing the procedure. Two alternatives: 1. make simpler the complex problem, with the risk of oversimplifying. 2. use computer intensive techniques. Note that, also when the configuration is parametric, computer intensive methods could still be useful to assess the robustness of conclusions drawn from a parametric analysis.

Resampling techniques – p.3/41

Estimation of arbitrary statistics

Consider a set of data points sampled from a one-dimensional distribution of a r.v. .

Var

Bias

Suppose that our quantity of interest is the mean of . It is straightforward to compute the estimate of the mean, the bias and the variance of the estimator:

Consider now another quantity of interest, for example the median or a mode of the distribution. While it is easy to have an estimate of these quantities, their accuracy is difficult to be computed.

variance Var

In other terms,given an arbitrary estimator , the analytical form of the and the bias Bias

is typically not available.

Resampling techniques – p.4/41

Example: the patch data Eight subjects wore medical patches designed to infuse a certain hormone in the blood. Each subject had his hormone levels measured after wearing three different patches: a placebo, an “old” patch and a “new” patch. The goal is to show bioequivalence. In other terms the Food and Drug Administration (FDA) will approve the new patch for sale only if the new patch is bioequivalent to the old one. The FDA criterion is

#

! "

old placebo

new old

Resampling techniques – p.5/41

Example: the patch data (II) subj

plac

old

new

z=old-plac

y=new-old

1

9243

17649

16449

8406

-1200

2

9671

12013

14614

2342

2601

3

11792

19979

17274

8187

-2705

...

...

...

...

...

...

8

18806

29044

26325

10238

-2719

6342

-452.3

.

! "

#

. *+'

/

,

$

0

!

#

-,

% &

*)('

mean:

Estimate

Does it satisfy the FDA criterion? What about its accuracy, bias, variance? Resampling techniques – p.6/41

Jacknife 1

The jacknife (or leave-one-out) resampling technique aims at providing a computational procedure to estimate the variance and the bias of a generic estimator . The technique was first proposed by Quenouille in 1949. The technique is based on removing samples from the available dataset and recalculating the estimator. It is a general-purpose tool which is easy to implement and solves a number of problems.

Resampling techniques – p.7/41

Jacknife for the mean

5

6

5

4

3

2

In order to show the theoretical foundation of the jacknife, we apply first this technique to the estimator of the mean . Suppose we have available a dataset . Let us remove the th sample from and let us calculate the leave-one-out (l-o-o) mean estimate from the remaining samples

and

3

4

3

if we know both

.

4

that is, we can calculate

Observe that the following relation holds

Resampling techniques – p.8/41

as a very

7

!

!

!

8

7

Suppose that we wish to estimate some parameter complex statistic of the data points

Jacknife for an arbitrary statistic

!

!

!

;

9 :

!

!

!

8

4

7

3

We first compute

These psudovalues assume the same role as the sampled mean.

4

3

4

3

4

3

3