Random Variables and Random Vectors Good Review Materials ...

31 downloads 106695 Views 240KB Size Report
A random variable is associated with a probability distribution over these real ... Only finitely many possible values for the random variable: X ∈ {a1, a2, …, an}.
Good Review Materials http://www.imageprocessingbook.com/DIP2E/dip2e_downloads/review_material_downloads.htm

Random Variables and Random Vectors

• (Gonzales & Woods review materials) • Chapt. 1: Linear Algebra Review • Chapt. 2: Probability, Random Variables, Random Vectors

Tim Marks, Cognitive Science Department

Tim Marks, Cognitive Science Department

Random variables

Discrete random variables

• Samples from a random variable are real numbers

• Discrete random variables have a pmf (probability mass function), P P(X = a) = P(a) • Example: Coin flip 1

– A random variable is associated with a probability distribution over these real values – Two types of random variables • Discrete – Only finitely many possible values for the random variable: X ∈ {a1, a2, …, an} – (Could also have a countable infinity of possible values) » e.g., the random variable could take any positive integer value – Each possible value has a finite probability of occurring.

X = 0 if heads X = 1 if tails – What is the pmf of this random variable?

• Continuous – Infinitely many possible values for the random variable – E.g., X ∈ {Real numbers}

P (a )

0.5

0 0

Tim Marks, Cognitive Science Department

Continuous random variables

• Discrete random variables have a pmf (probability mass function), P 1 P(X = a) = P(a) P (a ) • Example: Die roll

• Continuous random variables have a pdf (probability density function), p • Example: Uniform distribution p(x)

p(1.3) = ?

0 0 1

2

3

4

5

6

a

p(2.4) = ?

What is the probability that X = 1.3 exactly: P(X = 1.3) = ?

0.5

0.5

Tim Marks, Cognitive Science Department

a

Tim Marks, Cognitive Science Department

Discrete random variables

X ∈ {1, 2, 3, 4, 5, 6} – What is the pmf of this random variable?

1

1

2

x

Probability corresponds to area under the pdf. 1.5 P(1 < X < 1.5) = ∫? p(x) dx = 0.25

Tim Marks, Cognitive Science Department

1

–1

Continuous random variables

Random variables

• What is the total area under any pdf ?

• How much change do you have on you?



– Asking a person (chosen at random) that question can be thought of as sampling from a random variable.

∫ p( x)dx = 1?

−∞

• Example continuous random variable: Human heights p(h)

p(h)

• Is the random variable “Amount of change people carry” discrete or continuous?

p(h)

h

h

h

Tim Marks, Cognitive Science Department

Tim Marks, Cognitive Science Department

Random variables: Mean & Variance

An important type of random variable

• These formulas can be used to find the mean and variance of a random variable when its true probability distribution is known. Definition Mean µ

µ = E(X)

Variance

E ((X − µ) 2 )

Var(X)

Discrete r.v.

Continuous r.v.

µ = ∑ ai P (ai )

µ=

2

i

∫ x p(x) dx

−∞

i

∑ (a − µ)





P(ai )

∫ (x − µ)

2

p(x) dx

−∞

i

Tim Marks, Cognitive Science Department

Tim Marks, Cognitive Science Department

The Gaussian distribution p(x) =

1 2π σ

e

Estimating the Mean & Variance −

( x − µ )2 2σ

2

X ~ N(µ, σ 2)

Tim Marks, Cognitive Science Department

– After sampling from a random variable n times, these formulas can be used to estimate the mean and variance of the random variable. • Samples x1, x2, x3, …, xn Estimated mean:

m=

1 n ∑ xi n i=1

Estimated variance:

σ2 =

1 n ∑ (x i − m)2 n i=1

σ2 =

1 ∑ (x i − m)2 ← unbiased estimate n −1 i=1

← maximum likelihood estimate

n

Tim Marks, Cognitive Science Department

–2

Finding mean, variance in Matlab

Example continuous random variable

– Samples x = [x1 x 2 x 3 L x n ] – Mean >> m = (1/n)*sum(x)

• People’s heights (made up)

– Variance 1 σ = [x1 − m n 2

Method 1: Method 2:

 x1 − m    x − m x 2 − m L x n − m]  2  M    x n − m

>> v = (1/n)*(x-m)*(x-m)’ >> z = x-m >> v = (1/n)*z*z’

– Gaussian µ = 67 , σ2 = 20 • What if you went to a planet where heights Gaussian µ = 75 , σ2 = 5 – How would they be different from us?

Tim Marks, Cognitive Science Department

Tim Marks, Cognitive Science Department

Example continuous random variable • Time people woke up this morning – Gaussian µ = 9 , σ2 = 1

Random vectors – An n-dimensional random vector consists of n random variables that are all associated with the same events. – Example 2-D random vector: X  V =  Y 

where X is random variable of human heights Y is random variable of wake-up times

– Sample n times from V. v1 v 2 L v n x1 x 2 L x n    y1 y 2 L y n  Tim Marks, Cognitive Science Department

Random Vectors • What will the graph of V look like?

67 m =  10

Tim Marks, Cognitive Science Department

x (heights)

Mean of a random vector • Estimating the mean of a random vector

Mean – Mean of X is 67 – Mean of Y is 10

y (wake-up times)

Tim Marks, Cognitive Science Department

– n samples from V

• What is mean of V ?

Let’s collect some samples and graph them:

m=

v1 v 2 L v n x1 x 2 L x n    y1 y 2 L y n 

n n 1 1 x  m x  v i = ∑ i  =   ∑ n i=1 n i=1 y i  m y 

– To estimate mean of V in Matlab >> (1/n)*sum(v,2) Tim Marks, Cognitive Science Department

–3

Random vector – Example 2-D random vector: X  V =  Y 

where X is random variable of human heights Y is random variable of human weights v1 v 2 L v n x1 x 2 L x n    y1 y 2 L y n 

• Sample n times from V

– What will graph look like?

Covariance of two random variables • Height and wake-up time are uncorrelated, but height and weight are correlated. • Covariance Cov(X, Y) = 0 for X = height, Y = wake-up times Cov(X, Y) > 0 for X = height, Y = weight – Definition: Cov(X,Y ) = E ((X − µx )(Y − µy )) If Cov(X, Y) < 0 for two random variables X, Y , what would a scatterplot of samples from X, Y look like?

Tim Marks, Cognitive Science Department

Tim Marks, Cognitive Science Department

Estimating covariance from samples • Sample n times: Cov(X,Y ) = Cov(X,Y ) =

x1  y1

Estimating covariance in Matlab – Samples

x2 L xn   y2 L yn 

– Means mx ← m_x my ← m_y

x = [x1 x 2 x 3 L x n ] y = [y1 y 2 y 3 L y n ]

1 n ∑ (x i − mx )(y i − my ) n i=1

← maximum likelihood estimate

1 n ∑ (x i − mx )(y i − my ) n −1 i=1

← unbiased estimate

– Covariance 1 Cov(X,Y ) = [x1 − m x n

x 2 − mx

 y1 − m y    y 2 − my   L x n − mx ]  M    y n − my 

• Cov(X, X) = ?Var(X)

Method 1: >> v = (1/n)*(x-m_x)*(y-m_y)’

• How are Cov(X, Y) and Cov(Y, X) related?

Method 2: >> w = x-m_x >> z = y-m_y >> v = (1/n)*w*z’

Cov(X, Y) = Cov(Y, X) Tim Marks, Cognitive Science Department

Covariance matrix of a D-dimensional random vector

(

X − µX  = E [X − µX Y − µY 

Example covariance matrix • People’s heights (made up)

• In 2 dimensions X  V =  Y  T Cov(V) = E (V − µ )(V − µ )

Tim Marks, Cognitive Science Department

X ~ N(67, 20)

)   Var(X ) Cov(X,Y ) Y − µY ] =   Var(Y)   Cov(X,Y )

• In D dimensions T Cov(V) = E ((V − µ )(V − µ ) ) • When is a covariance matrix symmetric? A. always, B. sometimes, or C. never Tim Marks, Cognitive Science Department

• Time people woke up this morning Y ~ N(9, 1)

• What is the covariance matrix of

X  V =  Y 

?

20 0    0 1

Tim Marks, Cognitive Science Department

–4

Estimating the covariance matrix from samples (including Matlab code) – Sample n times and find mean of samples v1 v 2 L v n

x V = 1 y1

x2 L xn   y2 L yn 

1 Cov(V ) = n

 x1 − m x   y1 − m y

• 1-dimensional Gaussian is completely determined by its mean, µ, and variance, σ 2 :

mx  m=  my 

– Find the covariance matrix

 x1 − mx  x 2 − mx L x n − mx   x 2 − mx  y 2 − my L y n − my   M  x n − mx

>> m = (1/n)*sum(v,2) >> z = v - repmat(m,1,n) >> v = (1/n)*z*z’ Tim Marks, Cognitive Science Department

Gaussian distribution in D dimensions

X ~ N(µ, σ 2) y1 − m y   y 2 − my  M   y n − my 

p(x) =

1 2π σ

e



(x− µ )2 2σ

2

• D-dimensional Gaussian is completely determined by its mean, µ, and covariance matrix, Σ : X ~ N(µ, Σ)

p(x) =

1 (2π )

D/2

Σ

1/ 2

e

− 12 (x−µ )T Σ −1 (x−µ )

–What happens when D = 1 in the Gaussian formula? Tim Marks, Cognitive Science Department

–5