Resampling techniques for statistical modeling - ULB
Recommend Documents
Sep 19, 2014 - Analyzing Time-Variable Satellite Gravimetry Data .... In this thesis, statistical decomposition methods are investigated to perform signal-noise ...
sampling, bootstrap resampling of empirical distribution functions, and uniformly randomizing empirical distribu- tion functions. Direct resampling uses new ...
Of course, since across-subjects statistical data analy- ses are usually ... In this study, we applied bootstrap analysis to data from an experiment aimed at ...
EXPONENTIAL SMOOTHING AND RESAMPLING. TECHNIQUES IN TIME SERIES PREDICTION. Maria Manuela Neves. CEAUL and Mathematics Department, ...
See http:lljsi.cbs.n1isale-iase.htm for details. UNDERSTANDING PROBABILITY
AND STATISTICAL. INFERENCE THROUGH RESAMPLING. Clifford Koriold.
such as the bootstrap or the subsampling, which can be used as a simulation tool to ... (4) provided that the integral exists and the optimization problem has a ...
Mar 7, 2017 - As a result, statistical power for QTL detection may not be optimal if the QTL ... contribute little to the test statistic can improve statistical power.
Aug 30, 2018 - With regards to the estimation function, it is assumed as the best ..... For this purpose, the statistical survey concerned six different samples representative of the .... cellar area (SUC), measured in square meters, sqm;. â¡.
Jun 2, 2017 - timedia, web and data mining, computational finance, business intelligence. Outline ...... E[ax + by] = aE[x] + bE[y], a â R,b â R ...... Ac- cording to the hypothesis testing terminology, randomization tests make the null ...... wh
Aug 22, 2016 - For instance, there are only a few iPhone models while the number of ... 7. n redundant = 1 (number of redundant features). 8. n clusters per ...
tral resampling between similarly performing hyperspectral sensors. Some experiments regarding the techniques of spectral simulation for an upcoming ...
A permutation method is presented to calculate resampling probabil- ity values for ... between cognitive and social psychologists on the issue of the continuity hy- pothesis, i.e. ... ordinal variation (consensus), yielding one-sided approximate prob
sufentanil, each with equilibration half-lives of about 6 min. Methadone ..... 2.1 min. 29.1. 24.7. 4. 3. EEG power spectrum analysis (spectral edge). 18. Alfentanil.
virtual palpation [6] in a custom-made software called. lhpFusionBox [7] . Kinematics was processed using orientation vector position (OVP) method and helical ...
Ramsgate Road, Sandwich, United Kingdom1; LAP&P Consultants, Leiden, The Netherlands2; and. School of Pharmacy and Pharmaceutical Sciences, The ...
Feb 9, 2009 - are also available in one or other R package, or from elsewhere on the net. At the end of ... practical data analysis, and coding exercises.
identification and classification of various event types in .... gan in 1998 investigated the development of tech- ..... Educational and Psychological Measurement,.
multivariate methods such as the factor analysis, cluster analysis and
discriminant analysis were used. The application multivariable statistical methods
offer a ...
or excessively high for Allison and Gorman's (1993) and White,. Rusch, Kazdin ... thus, it is necessary to test whether the p values associated with these coefficients ...... James and improved general approximation tests in the split-plot design.
2 Dept of Civil and Environmental Engineering, University of Washington ..... (The Yakima River at Parker USGS 12505000), and simulated flood risk for the ...
Stock Market provides an ample opportunity to experiment and use various ... our academic course on Data Mining, a review of the various techniques was.
Page 1. 1. Adaptive Statistical Optimization Techniques for. Firewall Packet ... Second, we present a new packet filtering optimization technique that uses ...
Multivariate statistical techniques such as factor analysis (FA), cluster ... interpretation of a large complex water quality data set of three cities (Lahore, ... polluted (MP) and highly polluted (HP) sites, based on the similarity of water quality
Dec 13, 2014 - Arvisenet, G., L. Billy, P. Poinot, E. Vigneau, D. Bertrand, and C. Prost (2008). Effect of apple particle state on the release of volatile compounds ...
Resampling techniques for statistical modeling - ULB
(e.g. the average) for which the analysis is simple. ... conclusions drawn from a parametric analysis. .... Boostrap is a data-based simulation method for statistical.
Resampling techniques for statistical modeling Gianluca Bontempi Département d’Informatique Boulevard de Triomphe - CP 212 http://www.ulb.ac.be/di
Resampling techniques – p.1/41
Rationale All the methods presented so far are strongly based on two assumptions: 1. the form of distribution underlying the data is parametric and known. The only uncertainty concerns the values of some parameters.
2. we focus on parameters (e.g. the mean) and corresponding estimators (e.g. the average) for which the analysis is simple.
Resampling techniques – p.2/41
Rationale(II)
For a generic parameter in a parametric setting, the distribution of the statistic cannot be easily defined. For a generic problem, when only a set of observations are available, the parametric assumption cannot be made without strongly biasing the procedure. Two alternatives: 1. make simpler the complex problem, with the risk of oversimplifying. 2. use computer intensive techniques. Note that, also when the configuration is parametric, computer intensive methods could still be useful to assess the robustness of conclusions drawn from a parametric analysis.
Resampling techniques – p.3/41
Estimation of arbitrary statistics
Consider a set of data points sampled from a one-dimensional distribution of a r.v. .
Var
Bias
Suppose that our quantity of interest is the mean of . It is straightforward to compute the estimate of the mean, the bias and the variance of the estimator:
Consider now another quantity of interest, for example the median or a mode of the distribution. While it is easy to have an estimate of these quantities, their accuracy is difficult to be computed.
variance Var
In other terms,given an arbitrary estimator , the analytical form of the and the bias Bias
is typically not available.
Resampling techniques – p.4/41
Example: the patch data Eight subjects wore medical patches designed to infuse a certain hormone in the blood. Each subject had his hormone levels measured after wearing three different patches: a placebo, an “old” patch and a “new” patch. The goal is to show bioequivalence. In other terms the Food and Drug Administration (FDA) will approve the new patch for sale only if the new patch is bioequivalent to the old one. The FDA criterion is
#
! "
old placebo
new old
Resampling techniques – p.5/41
Example: the patch data (II) subj
plac
old
new
z=old-plac
y=new-old
1
9243
17649
16449
8406
-1200
2
9671
12013
14614
2342
2601
3
11792
19979
17274
8187
-2705
...
...
...
...
...
...
8
18806
29044
26325
10238
-2719
6342
-452.3
.
! "
#
. *+'
/
,
$
0
!
#
-,
% &
*)('
mean:
Estimate
Does it satisfy the FDA criterion? What about its accuracy, bias, variance? Resampling techniques – p.6/41
Jacknife 1
The jacknife (or leave-one-out) resampling technique aims at providing a computational procedure to estimate the variance and the bias of a generic estimator . The technique was first proposed by Quenouille in 1949. The technique is based on removing samples from the available dataset and recalculating the estimator. It is a general-purpose tool which is easy to implement and solves a number of problems.
Resampling techniques – p.7/41
Jacknife for the mean
5
6
5
4
3
2
In order to show the theoretical foundation of the jacknife, we apply first this technique to the estimator of the mean . Suppose we have available a dataset . Let us remove the th sample from and let us calculate the leave-one-out (l-o-o) mean estimate from the remaining samples
and
3
4
3
if we know both
.
4
that is, we can calculate
Observe that the following relation holds
Resampling techniques – p.8/41
as a very
7
!
!
!
8
7
Suppose that we wish to estimate some parameter complex statistic of the data points
Jacknife for an arbitrary statistic
!
!
!
;
9 :
!
!
!
8
4
7
3
We first compute
These psudovalues assume the same role as the sampled mean.