Some simpler statistical tests for rejecting outliers in quantitative data*

Recommend Documents

ON TESTS FOR SOME STATISTICAL HYPOTHESES FOR ...

Sen (1979) considered fixed sample and se- quential tests for the null hypothesis of bi- variate symmetry of the joint distribution of. (X, Y ). The alternatives are ...

Statistical tests, permutation tests

The role of a statistical test is to decide whether some parameter of the reference .... statistics. An example is the test for the correlation matrix R (eq. 4.14, end of ..... –2.5 –2.0 –1.5 –1.0 –0.5 0.0. 0.5. 1.0. 1.5. Variable 1. 0. 20. 40. 60.

Data Processing for Outliers Detection

Outlier detection is an important branch in data pre processing and data mining, ..... these reasons recently semi supervised outlier detection methods (Li et al., ...

Some Nonparametric Statistical Tests for Quick ... - Clinical Chemistry

present discussion is limited to those nonparametric ... nonparametric tests have been described (2-4). ... practical situations in the field, where immediate and.

Statistical tests

seven such exact or approximate tests for the null-hypothesis above. The two ... alternatives while the Ï2-test only allows for a two-sided alternative hypothesis.

Bayesian t tests for accepting and rejecting the null hypothesis

Aug 27, 2008 - alternative if the alternative is a small effect (Âµ 5 10). It is an undeniable ..... the Bayes factor provides unbounded support for the null hypothesis ...

Statistical tests for symmetries in polarimetric ...

Simon Zwieback and Irena Hajnsek. AbstractâThe second order statistics are among the most important observables in Synthetic Aperture Radar (SAR) po-.

Statistical tests for normality

encouraging academics to share statistics support resources ...... Consider the students ability when advising on the best technique. They have ...... using additional SPSS software and is beyond the scope of stats support but EFA is commonly.

Outliers and data descriptions

Keywords: pattern recognition, one-class problems, outlier detection, Support Vector Machines, Support ... data description, and introduce the free variables.

Data analysis and statistical tests for near-infrared ... - CiteSeerX

Data analysis and statistical tests for near-infrared functional studies of the brain. Angelo Sassarolia, Yunjie Tonga, Christian BeneÅ¡b and Sergio Fantinia.

Methods for Statistical Inference of Triangle Taste Tests Data and ...

Mar 1, 2014 - the existing statistical inference methods of triangle taste tests methods ... Binomial Distribution; Hypothesis Test; Triangle Taste Tests; Sensory ...

Multidimensional statistical tests for imprecise data - Semantic Scholar

Multidimensional tests for imprecise data. 4. Wuhan, 1. June 2006. Uncertainty modeling using imprecise quantities. Requirements: â¢Adequate description of ...

I. Statistical Hypotheses Tests with a New Quantitative ... - Rainfor

A further benefit of ap- plying quantitative techniques to data analysis is that they act as a spur for conscious attempts to refine the methodology of data collection ...

Unit Root Tests with Additive Outliers

on the Dickey–Fuller tests based on the trend component is proposed as a .... As is well-known, the Dickey-Fuller unit root test takes as null hypothesis H0 : ρ = 0 ...

Some Tests of Specification for Panel Data - NYU Stern

We use information technology and tools to increase productivity and facilitate new forms ... are different as well as t

Some Tests of Specification for Panel Data - NYU Stern

number of Monte Carlo experiments to investigate the relative performance of the ... Section 5 contains the application

some statistical and logical considerations when rescoring tests

passing score a rescoring will result in the change of some pass/fail ... tially pass, and the policy used to incorporate the rescore into the pass/fail decision.

Some difficult-to-pass tests of randomness - Journal of Statistical ...

at Florida State University and the University of Hong Kong, and it has been .... A standard goodness-of-fit test compares the 100 lumped gcd counts with the ...

Detecting Outliers using Transduction and Statistical Testingâ

The pseudo-code of the algorithm, which we call StrOUD (from Strangeness based OUtlier Detection algorithm) is shown in Figure 2. It contains a rule to accept.

Propagation of outliers in multivariate data - arXiv

Mar 3, 2009 - mators for multivariate location such as M-estimators [Maronna (1976)], S- estimators [Davies (1987), LopuhaÃ¤ (1989)], CM-estimators [Kent ...

100 Statistical Tests

Jun 15, 2006 - The American Statistical Association for Table 16 adapted from Massey. .... provides a list of the statistical tables required for the tests followed ...

Statistical Tests - Google Sites

intensive methods that rely on repeated sampling from empirical data sets and ..... that it can be done once by the data

Generous statistical tests

Generous statistical tests. T. V. Hromadka II Ð R. J. Whitley Ð. S. B. Horton Ð M. J. Smith Ð J. M. Lindquist. Published online: 11 October 2007.

100 Statistical Tests

Jun 15, 2006 - R.E. and Pearson, E.S. (n.d.) 'Tests of normality'; Harcourt Brace ...... Test 98 To test whether the mean angles of two independent circular.

Some simpler statistical tests for rejecting outliers in quantitative data*

Download PDF

24 downloads 96408 Views 129KB Size Report

Comment

Dixon's Q-test: If you have a single outlier, and your data has a normal ... -âStatistical Treatment of Analytical Data: Outliers (Chapter 6)â by Z.B. Alfassi, Z. Boger ...

Some simpler statistical tests for rejecting outliers in quantitative data* Large data sets (N>100):

∑ For small data sets: ∑ 1 Rule of Huge Error: If you have a single outlier, then you can discard it with 98% confidence if any of the following conditions are met. |

|

5

8

6

8

14

5

15

4

Dixon’s Q-test: If you have a single outlier, and your data has a normal distribution, then you can discard the outlier if . Order the data values in increasing or decreasing order, such that the outlier is the final data point (xN). 3

7

8

10

11

13

*Compiled from: -Personal webpage of Prof. James K. Hardy, Dept. of Chemistry, University of Akron, “Statistical Treatment of Data” at http://ull.chemistry.uakron.edu/analytical/Statistics/. This has good notes for basic statistics and refers to specific tests for the rejection of data and discusses large and small sample sets. -“Dixon's Q-test: Detection of a single outlier”, which includes an Applet for doing Q-test calculations and a brief discussion on rejecting data from small data sets, on the University of Athen’s Department of Chemistry website at http://www.chem.uoa.gr/Applets/AppletQtest/Appl_Qtest2.html. Note: although much of the department’s website is in Greek, this page is in English. -“Statistical Treatment of Analytical Data: Outliers (Chapter 6)” by Z.B. Alfassi, Z. Boger and Y. Ronen. CRC Press: 2005. This chapter is available for reading through Google books if your library doesn’t have a copy.

Qcrit Values for Dixon's Q-test Outliers Risk of false rejection (%) Data points N 0.5 1 5 3 0.994 0.988 0.941 4 0.926 0.889 0.765 5 0.821 0.780 0.642 6 0.740 0.698 0.560 7 0.680 0.637 0.507 8 0.725 0.683 0.554 9 0.677 0.635 0.512 10 0.639 0.597 0.477 11 0.713 0.679 0.576 12 0.675 0.642 0.546 13 0.649 0.615 0.521

10 0.886 0.679 0.557 0.482 0.434 0.479 0.441 0.409 0.517 0.490 0.467

Grubbs’ T-test: This test can be used to evaluate multiple possible outliers. Start with the furthest | outlier, | , and discard it if T > Tcrit. |

|

Tcrit Values for Grubbs' T-test for Outliers Data points Risk of false rejection (%) N 0.1 0.5 1 5 3 1.155 1.155 1.155 1.153 4 1.496 1.496 1.492 1.463 5 1.780 1.764 1.749 1.672 6 2.011 1.973 1.944 1.822 7 2.201 2.139 2.097 1.938 8 2.358 2.274 2.221 2.032 9 2.492 2.387 2.323 2.110 10 2.606 2.482 2.410 2.176 15 2.997 2.806 2.705 2.409 20 3.230 3.001 2.884 2.557 25 3.389 3.135 3.009 2.663 50 3.789 3.483 3.336 2.956 100 4.084 3.754 3.600 3.207

If you discard the outlier, and suspect others, then recalculate furthest point.

10 1.148 1.425 1.602 1.729 1.828 1.909 1.977 2.036 2.247 2.385 2.486 2.768 3.017

and s in order to evaluate the next