Variance, Standard deviation Exercises: 1. What does variance ...

33 downloads 47 Views 315KB Size Report
scale, if it would be the case it would be the square of the unit of measure. ... 10 variance. 0.00. 1.67. 0.67. 1.33. 333.33 used std deviation. 0.00. 1.29 .... The variance will not change, the disparities between persons are not affected if every  ...
Variance, Standard deviation

Exercises: 1. What does variance measure?

2. How do we compute a variance?

3. What is the difference between variance and standard deviation?

4. What is the meaning of the variance when it is negative?

5. If I add 2 to all my observations, how variance and mean will vary?

6. If I multiply the result of my observations by 3, how variance and mean will vary?

7. What is the impact of one or few values different from the others on the variance?

___________________________________________________________ 5th ISTA seminar on statistics S Grégoire August 1999 2WEBVARI

1

We are not able to compute variance mentally. The variance indicates the variability of a list of values. It is an average distance from the mean on the observations we have. The more different from each other our data are, the greater is the variance. Variability is a fundamental basis for statistics, if there is no variability in the phenomenon, we usually do not need statistics, if we want help from statistics we will have to measure more than one to assess variability of the phenomenon. The variance use the distance of our values from their mean. If the values are grouped near to the mean the variance will be little. Usually the variance is not accompanied with the measure scale, if it would be the case it would be the square of the unit of measure. The standard deviation when we see its formula seems more complicated than the variance (there is a square root); but it is practically easier to understand. It shows how far are the values from the mean on average in the same scale as the measure (meters, number of seeds, weight…) How do we compute a variance? Variance value 1 value 2 value 3 value 4 mean variance std deviation varP std dev P measure measure measure measure mean

from from from from

difference difference difference difference mean

mean mean mean mean

from from from from

value value value value

mean mean mean mean

1 2 3 4

value value value value

1 2 3 4

Case 1 10 10 10 10 10 0.00 0.00 0.00 0.00

Case 2 8.5 9.5 10.5 11.5 10 1.67 1.29 1.25 1.12

Case 3 9 10 10 11 10 0.67 0.82 0.50 0.71

Case 4 9 9 11 11 10 1.33 1.15 1.00 1.00

Case 5 -10 0 20 30 10 333.33 18.26 250.00 15.81

0.00 0.00 0.00 0.00 0

1.50 0.50 0.50 1.50 1

1.00 0.00 0.00 1.00 0.5

1.00 1.00 1.00 1.00 1

20.00 10.00 10.00 20.00 15

0 0 0 0 0

-1.5 -0.5 0.5 1.5 0

-1 0 0 1 0

-1 -1 1 1 0

-20 -10 10 20 0

___________________________________________________________ 5th ISTA seminar on statistics S Grégoire August 1999 2WEBVARI

used used

would have been nice

not appropriate

2

To quantify distance from the mean we must sum distances in the common sense, which are always positive, as if we measure a distance with a wooden rule. If we use the difference between the values and the mean we would obtain a value which would indicate the fact that values are rather lower or rather bigger the mean. (see not appropriate of above table) The easier way would have been to use the average distance from the mean. This is the value, which is related to our common sense, which is equivalent to use a meter and measure distances between points (see would have been nice from above table) This approach was proposed among others to describe variability, but was not kept by mathematicians, it would have simplified our understanding if it had been kept.

From the Web: A demonstration of how is computed a variance

http://ouvaxa.cats.ohiou.edu/~wallace/class/standrd.html

Variance value 1 value 2 value 3

case 1 10 10 10

mean 10 variance 0 std deviation 0

case 2 9 10 11

Case 3 8 10 12

Case 4 4 10 16

Case 5 -10 10 30

10 1 1

10 4 2

10 36 6

10 400 20

the 3 values the average the average the average the average are equal to distance to distance to distance to distance to the mean the the mean is 1 the mean is 2 the mean is 6 the mean is distance from 20 them to the mean is 0

NB: the values had been chosen for the purpose of the simplification of the understanding

___________________________________________________________ 5th ISTA seminar on statistics S Grégoire August 1999 2WEBVARI

3

How does the scale of the values affects variance and mean? Variance

2

3

add 3

multiply 3

90

4

6

80

5

9

6

12

7

15

60

8

18

50

9

21

10

24

11

27

12

30

sum 55 mean 5.5 Variance variance P 8.25 ratio mean ratio variance

75 7.5 9.17 8.25 1.36 1

165 16.5 82.50 74.25 3 9

std deviation 3.03

3.03

9.08

Values

1 2 3 4 5 6 7 8 9 10

70

mean

40

variance

30 20 10 0 Values

add

multiply

• •

If we add a constant to values, the mean will increase of this constant. If we multiply our values by a constant , the mean will be multiplied by this constant.



If we add a constant to values, the dispersion of the values from the mean is not changed, so the variance is not affected and remains the same. If we multiply our values by a constant, the standard deviation is multiplied by this constant, the variance is multiplied by the square of this constant



Example about salaries: Not everyone have the same salary in our laboratory. We can compute a mean salary for the laboratory and a variance of the salary in the laboratory. If every one get a special bonus of 10 000 FF, each salary will be increased by 10 000 FF, as well as the mean salary. The variance will not change, the disparities between persons are not affected if every one has the same bonus.

If we change the currency from FF to Euros, we still have exactly the same amount of money, but both mean and variance are different. This shows the unit of measure has an influence on mean and variance, we can not compare them for objects having different measure units. Usually in germination, purity,… the units used are always the same; but in pathology tests for instance if we compare methods the units of measurements might be different, be aware. ___________________________________________________________ 5th ISTA seminar on statistics S Grégoire August 1999 2WEBVARI

4

From the Web: Put data on a histogram and see how mean and standard deviation are affected. http://www.ruf.rice.edu/~lane/stat_sim/descriptive/index.html

look at effect of sample size on mean and variance http://www.ruf.rice.edu/~lane/stat_sim/sampling_dist/index.html

___________________________________________________________ 5th ISTA seminar on statistics S Grégoire August 1999 2WEBVARI

5