An extension of the skew-generalized normal distribution and ... - Core

1 downloads 0 Views 242KB Size Report
tation, moments and an extension of this new model. Key Words : Asymmetry; Skew-Generalized Normal Distribu- tion; Skew-Normal Distribution.
Proyecciones Journal of Mathematics Vol. 30, No 3, pp. 401-413, December 2011. Universidad Cat´olica del Norte Antofagasta - Chile

An extension of the skew-generalized normal distribution and its derivation OSVALDO VENEGAS ´ UNIVERSIDAD CATOLICA DE TEMUCO, CHILE ANTONIO I. SANHUEZA UNIVERSIDAD DE LA FRONTERA, CHILE and ´ ´ HECTOR W. GOMEZ UNIVERSIDAD DE ANTOFAGASTA, CHILE Received : October 2010. Accepted : September 2011 Abstract In this paper, we introduce a new class of skew-symmetric distributions which are formulated based on cumulative distributions of skew-symmetric densities. This new class is an extension of other skew-symmetric distributions that have already been studied. We give special attention to a family from this class that could be seen as an extension of the skew-generalized-normal model introduced by ArellanoValle et al.(2004). We study the main properties, stochastic representation, moments and an extension of this new model. Key Words : Asymmetry; Skew-Generalized Normal Distribution; Skew-Normal Distribution.

402

Osvaldo Venegas, Antonio I. Sanhueza and H´ector W. G´ omez

1. Introduction It is known that the normal distribution is very useful in different applications in statistics. However, it is also known that this distribution is not appropriate in those cases where symmetry is absent. In this respect the skew-normal family has been studied among others by Azzalini (1985), Henze (1986), Pewsey (2000) and Martinez et al. (2008), and reviewed in the book by Genton (2004). The univariate skew-normal model has the following density function: µ



2 x−µ x−µ )Φ λ , x ∈ R, λ ∈ R, µ ∈ R, σ > 0, φλ (x, µ, σ) = φ( σ σ σ (1.1)

where φ(x) and Φ(x) denote the density function and cumulative distribution function of the standard normal distribution, respectively. We denote the distribution of Azzalini as X ∼ SN (µ, σ, λ). A better family of distributions that shows a better behavior, particularly on the side with smaller mass, than the skew-normal distribution is the Skew-Generalized Normal (SGN) Distribution introduced by Arellano-Valle et al. (2004), which has the following density function: µ



Ã

x−µ λ1 (x − µ) 2 Φ p 2 φλ1 ,λ2 (x) = φ σ σ σ + λ2 (x − µ)2

!

, x ∈ R, λ1 ∈ R, λ2 ≥ 0,

µ ∈ R, σ > 0. (1.2)

We denote this distribution by X ∼ SGN (µ, σ, λ1 , λ2 ). The main objective of this paper is to introduce a new class of asymmetric distributions that contain an unlimited number of skew-symmetric models. We give special attention to a skew model that is an extension of the skew-generalized normal distribution and the skew normal distribution. In Section 2, we define this new extension and call it the Extended Skew Generalized Normal Distribution (ESGN), and also study its basic properties. In Section 3, we develop important probabilistic properties of the new

An extension of the skew-generalized normal distribution and ... 403 family of distribution, including the stochastic representation. In Section 4, we obtain the moments of the random variable with ESGN distribution and finally, Section 5 deals with a location-scale extension of the ESGN distribution and derives a generalization of the ESGN distribution.

2. A Class of Skew Distribution The following Lemma was presented by Azzalini (2005), and is a fundamental result for generating skew-symmetric distributions. Lemma 1. Let g be a probability density function symmetric about zero, and G is a distribution function such that G0 there exist and is a density function symmetric about zero, then (2.1)

f (x) = 2g(x)G(w(x)), −∞ < x < ∞,

is a density function for any odd function w(·). Some subclass of skew-symmetric distributions generated from this lemma have been studied among others by Gupta et al. (2002), Nadarajah and Kotz (2003) and G´ omez et al. (2007). In the following proposition we introduce a new subclass of skew-symmetric models. Proposition 1. Let g be a probability density function symmetric about zero, and Hn(x) be a distribution function, from a skew density function, of the form Hn(x) (m(x)) =

Z m(x)

h(t) dt, with h(t) = 2g(t)G(n(x)t) as

−∞

defined in (3) where n(x) is any odd function, then (2.2)

ϕ(x) = 2g(x)Hn(x) (m(x)), −∞ < x < ∞,

is a density function for any odd function m(·). Proof: Let R(x) = Hn(x) (m(x)) =

Z m(x)

2g(t)G(n(x)t) dt.

−∞

Now, since n(x) and m(x) are odd functions, we have R(−x) =

Z m(−x) −∞

2g(t)G(n(−x)t) dt =

Z ∞

2g(t)G(n(x)t) dt.

m(x)

Then R(x) + R(−x) = 1 and R0 (−x) = R0 (x), which shows that R0 is symmetric about zero. Thus, by using Lemma 1 we prove this proposition. 2

404

Osvaldo Venegas, Antonio I. Sanhueza and H´ector W. G´ omez

Example 1. If we consider Hn(x) (m(x)) to be an extension of the cumulative distribution function from the symmetric density function g, then we obtain an extension of the family studied by Gupta et al. (2002). Example 2. If we consider Hn(x) (m(x)) to be the cumulative distribution function of a skew-symmetric density and g = φ, then we obtain an extension of the family studied by Nadarajah and Kotz (2003). Example 3. If we consider Hn(x) (m(x)) = Φn(x) (m(x)) and g to be any symmetric density function, then we obtain an extension of the family studied by G´ omez et al. (2007). The following example is the main interest of this paper and we present it as a definition. Definition 1. A random variable X is said to have a Extended Skew Generalized Normal Distribution if its density function is given by µ



λ1 x , x ∈ R, λ1 ∈ R, λ2 ≥ 0, λ ∈ R, φλ1 ,λ2 ,λ (x) = 2φ(x)Φµ(x,λ2 ,λ) √ 1 + λ2 x2 (2.3) where

(2.4) Φµ(x,λ2 ,λ)

µ

λ x √ 1 1 + λ2 x2



=

Z √ λ1 x

1+λ2 x2

−∞

2φ(t)Φ(µ(x, λ2 , λ)t) dt,

and √ − λ2 λx (2.5) . µ(x, λ2 , λ) = √ 1 + λ2 x2 + λ2 We denote the Extended Skew Generalized Normal Distribution as X ∼ ESGN (λ1 , λ2 , λ). Remark 1. 1. Let it be considered that g = φ, H = Φ, n(x) = µ(x, λ2 , λ), m(x) = √ λ1 x 2 and by applying Proposition 1 we can prove that 1+λ2 x

(5) is a density function. 2. We can also prove that (5) is a density function by using the result shown by Chiogna (1998), since if Z ∼ SN (λ), then the random variable Y = Φ(hZ + k) has first moment:

An extension of the skew-generalized normal distribution and ... 405

p hλ Eλ [Φ(hZ+k)] = Φµ(h,λ) (k/ 1 + h2 ), with µ(h, λ) = − √ . 1 + h2 + λ2 (2.6)

We show several of the possible shapes obtained from (5) under various choices of (λ1 , λ2 , λ).

Figure 1 : The ESGN (5, 20, −2) (solid line), SGN (5, 20) (dotted line) and SN(5) (dashed line) densities.

Figure 2 : The ESGN (−5, 20, 2) (solid line), SGN (−5, 20) (dotted line) and SN (−5) (dashed line) densities. Remark 2. It should be noted that the mechanism of asymmetry in our model, is presented by the joint action of the parameters λ1 , λ2 and λ. Phenomenon already observed in the model SGN introduced by ArellanoValle et al. (2004). Proposition 2. (a) Let X be a random variable with X ∼ ESGN (λ1 , λ2 , λ). Then

406

Osvaldo Venegas, Antonio I. Sanhueza and H´ector W. G´ omez 1. 2. 3. 4. 5. 6.

φλ1 ,λ2 ,λ=0 (x) = φλ1 ,λ2 (x), i.e. ESGN (λ1 , λ2 , λ = 0) = SGN (λ1 , λ2 ). φλ1 ,λ2 =0,λ (x) = φλ (x), i.e. ESGN (λ1 , λ2 = 0, λ) = SN (λ1 ). φλ1 =0,λ2 ,λ=0 (x) = φ(x), i.e. ESGN (λ1 = 0, λ2 , λ = 0) = N (0, 1). φ−λ1 ,λ2 ,−λ (x) + φ−λ1 ,λ2 ,−λ (x) = 2φ(x), x ∈ R. −X ∼ ESGN (−λ1 , λ2 , −λ). lim φλ1 ,λ2 ,λ (x) = 2φ(x)I{x ≥ 0} and λ1 →+∞

lim φλ1 ,λ2 ,λ (x) = 2φ(x)I{x ≤ 0}, for all λ2 , λ.

λ1 →∞

7.

lim φλ1 ,λ2 ,λ (x) = 2φ(x)Φµ(x,λ) (±sgn(x)), where

λ1 →±∞

1

µ(x, λ) = ∓λ sgn(x). (b) Let Φλ1 ,λ2 ,λ (x) be the CDF of the random variable X ∼ ESGN (λ1 , λ2 , λ). Then: 1. Φλ1 ,λ2 ,λ=0 (x) = Φλ1 ,λ2 (x) 2. Φλ1 ,λ2 =0,λ (x) = Φλ1 (x) 3. Φλ1 =0,λ2 ,λ=0 (x) = Φ(x) 4. Φλ1 ,λ2 ,λ (−x) = 1 − 2Φ(x) + Φλ1 ,λ2 ,λ (x) Proof: The proofs of the items (a) and (b) are obtained directly by using the functions given in (5) and (6) of Definition 1. 2 Remark 3. When λ = 0 in part 4 of Proposition 2(b), we have that: Φλ1 ,λ2 ,0 (−x) = 1 − 2Φ(x) + Φλ1 ,λ2 ,0 (x) = 1 − Φ−λ1 ,λ2 (−x), where Φλ1 ,λ2 (x) is the CDF of the Skew-Generalized Normal Distribution.

3. Some Important Properties We derive important probabilistic properties from the ESGN distribution. The following two propositions show similar results to those obtained from the SGN distribution, where the distribution of the absolute value of a ESGN distribution is half-normal, and the square of the ESGN distribution is a chi-square with one degree of freedom. The Proposition 4 in this section shows the stochastic representation of the ESGN distribution. Proposition 3. If X ∼ ESGN (λ1 , λ2 , λ) and Y ∼ N(0, 1), then the random variables |X| and |Y | are identically distributed Standard Half-Normal.

An extension of the skew-generalized normal distribution and ... 407 Proof: Let W = |X|, then the density function of W is given by: fW (w) = φλ1 ,λ2 ,λ (w) + φλ1 ,λ2 ,λ (−w) ½ µ ¶ µ ¶¾ λ1 w −λ1 w = 2φ(w) Φµ(w,λ2 ,λ) √ + Φ−µ(w,λ2 ,λ) √ 1 + λ2 w2 1 + λ2 w2 = 2φ(w), w > 0, which has the same distribution as |Y |. 2 Corollary 1. If X ∼ ESGN (λ1 , λ2 , λ), then Y = X 2 has a chi-square distribution with 1 degree of freedom, i. e. Y ∼ χ21 . Proposition 4. If X|Y = y ∼ SN (y) and Y ∼ SN (λ1 , λ2 , λ), then X ∼ ESGN (λ1 , λ2 , λ). Proof: fX (x) =

Z ∞

µ



µ µ

y − λ1 y − λ1 2 2φ(x)Φ(yx) √ φ √ Φ λ √ λ2 λ2 λ2 −∞

= 2φ(x)

Z ∞

−∞

p

¶¶

dy

Φ(xλ1 + x λ2 u)2φ(u)Φ(λu) du p

= 2φ(x) Eλ {Φ(xλ1 + x λ2 U )} µ ¶ λ1 x , = 2φ(x) Φµ(x,λ2 ,λ) √ 1 + λ2 x2 √ λ2 λx . 1+λ2 x2 +λ2

by using the expression in (2.6), where µ(x, λ2 , λ) = √ −

2

4. Moments In this section we obtain the n-th moment of the random variable X ∼ ESGN (λ1 , λ2 , λ). In the case that n is even, the moments are derived from the chi-square distribution with 1 degree of freedom. When n is odd, we can derive an implicit expression for the n-th moment of X. In this respect, we denote:

(4.1) an (λ1 , λ2 , λ) = and

Z ∞ n u 0

u √ exp {− }Φµ(√u,λ2 ,λ) 2 2π

Ã

! √ λ1 u √ du, 1 + λ2 u

408

Osvaldo Venegas, Antonio I. Sanhueza and H´ector W. G´ omez

(4.2)

µ

Z ∞ n u

bn (λ1 , λ2 , λ) =

0



λ u √ exp {− }Φ−µ(√u,λ2 ,λ) √ 1 du. 2 λ2 2π

We can show that for all λ2 ≥ 0: 2n 2n an (0, λ2 , 0) = an (0, 0, λ) = √ Γ(n + 1) and bn (0, λ2 , 0) = √ Γ(n + 1). 2π 2π Proposition 5. Let X be a random variable with X ∼ ESGN (λ1 , λ2 , λ). Then we have that: E(X 2k+1 ) = 2{ak (λ1 , λ2 , λ) − ak (0, 0, λ)},

with k = 0, 1, 2, ...

Proof: µ

Z ∞



λ1 x x2k+1 2φ(x)Φµ(x,λ2 ,λ) √ dx 2 −∞ µ1 + λ2 x ¶ Z ∞ −λ1 x 2k+1 = (−x) 2φ(x)Φ−µ(x,λ2 ,λ) √ dx + 2 1+ µ ¶ λ2 x Z ∞0 λ1 x x2k+1 2φ(x)Φµ(x,λ2 ,λ) √ dx 1µ + λ2 x2 0 ∙Z ¶ ∞ λ1 x 2k+1 x 2φ(x)Φµ(x,λ2 ,λ) √ dx =2 1 + λ2 x2 0

E(X2k+1 ) =

=

Z ∞

2k+1

x

¸

φ(x) dx

0 2 [ak (λ1 , λ2 , λ) − ak (0, 0, λ)] .

2

Lemma 2. For any λ1 > 0, λ2 > 0, λ ∈ R and n = 0, 1, 2, ... we have: λ1 an (0, λ2 , λ) ≤ an (λ1 , λ2 , λ) ≤ 4an (0, 0, λ)φ( √ ) − bn (λ1 , λ2 , λ), λ2 where an (λ1 , λ2 , λ) and bn (λ1 , λ2 , λ) are given in (9) and (10), respectively. Also, if λ > 0, we have an (0, 0, λ) ≤ an (0, λ2 , λ). Proof: Using the fact that Φµ(√u,λ2 ,λ) (t) is an increasing function of t, we have ³ ´ ³ ´ λ1 √ √λ1 ≤ Φ , Φµ(√u,λ2 ,λ) (0) ≤ Φµ(√u,λ2 ,λ) √1+λ µ( u,λ ,λ) 2 λ2 2u and then:

An extension of the skew-generalized normal distribution and ... 409 Z ∞ n u 0

u √ exp {− }Φµ(√u,λ2 ,λ) (0) du 2 2π √ Z ∞ n u u λ1 u √ √ √ exp {− }Φµ( u,λ2 ,λ) ( ) du ≤ 2 1 + λ2 u 2π 0 Z ∞ n u u λ √ exp {− }Φµ(√u,λ2 ,λ) ( √ 1 ) du ≤ 2 λ2 2π 0

where Z ∞ n u 0

u λ √ exp {− }Φµ(√u,λ2 ,λ) ( √ 1 ) du 2 λ2 2π Z ∞ n λ u u √ exp {− }Φ( √ 1 ) du =2 2 λ2 2π 0 Z ∞ n u u λ √ exp {− }Φ−µ(√u,λ2 ,λ) ( √ 1 )du, 2 λ2 2π 0

which shows the first part. On Z ∞thenother hand, when λ > 0, u u √ e− 2 Φµ(√u,λ2 ,λ) (0) du 2π 0 µZ 0 ¶ Z ∞ n √ u u √ exp {− } = 2φ(t)Φ(µ( u, λ2 , λ)t) dt du 2 2π 0 −∞ µZ 0 ¶ Z ∞ n u u √ exp {− } 2φ(t)Φ(0) dt du ≥ 2 2π 0 −∞ ½ ¾ Z ∞ n 1 u u √ exp − = du, 2 0 2 2π which shows the second part. 2 Proposition 6. Let X be a random variable with X ∼ ESGN (λ1 , λ2 , λ). Then the moment generating function of X is given by: "

t2 MX (t) = 2 exp{ } E Φµ(y+t,λ2 ,λ) 2

Ã

λ1 (Y + t) p 1 + λ2 (Y + t)2

!#

, Y ∼ N (0, 1).

5. An Extension Class of Densities In this section we extend the definition of (5) to the location-scale family. Also, we introduce a new family of asymmetric normal distributions that contains the Extended Skew Generalized Normal Distribution.

410

Osvaldo Venegas, Antonio I. Sanhueza and H´ector W. G´ omez

Definition 2. A random variable W = µ+σX, where X ∼ ESGN (λ1 , λ2 , λ) is said to have the Location-Scale Extended Skew Generalized Normal Distribution, with density function given by: µ

Ã



w−µ 2 Φµ( w−µ ,λ2 ,λ) fW (w) = φ σ σ σ (5.1)

λ1 (w − µ) p 2 σ + λ2 (w − µ)2

!

, w ∈ R,

λ1 ∈ R, λ2 ≥ 0, λ ∈ R, µ ∈ R, σ > 0, where µ Φµ( w−µ ,λ2 ,λ) √

λ1 (w−µ) σ2 +λ2 (w−µ)2

σ



and µ( w−µ σ , λ2 , λ) are given in (6) and (7), respectively. We denote this location-scale distribution by W ∼ ESGN (µ, σ; λ1 , λ2 , λ). Remark 4. Note that the distribution (11) is a model with five parameters, estimating they can be a problem. But today, due to computing resources, particularly the statistical software, are becoming increasingly efficient, we believe it is necessary support when desired perform this task. Corollary 2. If W |Y = y ∼ SN (µ, σ, y) and Y ∼ SN (λ1 , λ2 , λ), then W ∼ ESGN (µ, σ, λ1 , λ2 , λ). Definition 3. The random variable X with density function

φλ1 ,λ2 ,λ,α (x) = 2φ(x)Φµ(x,λ2 ,λ,α) λ2 ≥ 0, λ ∈ R, α ≥ 0, (5.2) where

Φµ(x,λ2 ,λ,α) (5.3)

Ã

λ1 x p 1 + (α + λ2 )x2

!

=

Ã

λ1 x p 1 + (α + λ2 )x2

Z √

, x ∈ R, λ1 ∈ R,

λ1 x

1+(α+λ2 )x2

−∞

!

2φ(t)Φ(µ(x, λ2 , λ, α)t) dt,

An extension of the skew-generalized normal distribution and ... 411 and √ − λ2 λx µ(x, λ2 , λ, α) = p 1 + (λ2 + α)x2 + λ2 (1 + αx2 )

(5.4)

has a Beyond Skew-Generalized Normal (BSGN) Distribution, which is a generalization of the random variable with the density function given in (5). We denote this generalization as X ∼ BSGN (α, λ1 , λ2 , λ). Remark 5. 1. We note that for α = 0 we reproduce the Extended Skew Generalized Normal Distribution. 2. We may consider g = φ, H = Φ, n(x) = µ(x, α, λ2 , λ), m(x) = √ λ1 x and by applying Proposition 1 we can prove that (12) is 2 1+(α+λ2 )x

a density function. Proposition 7. If X|Y = y ∼ SGN (y, α) and Y ∼ SN (λ1 , λ2 , λ), then X ∼ BSGN (α, λ1 , λ2 , λ). Proof: Z ∞

µ



µ



µ µ

yx 2 y−λ y−λ √ φ √ 1 Φ λ √ 1 2φ(x)Φ √ 2 λ λ λ2 1 + αx −∞ 2 !2 √ Z ∞ Ã (λ1 + λ2 u)x √ Φ 2φ(u)Φ(λu) du = 2φ(x) 1 + αx2 −∞ ( Ã !) √ λ1 x + λ2 xU √ = 2φ(x) Eλ Φ 1 + αx2

fX (x) =

= 2φ(x) Φµ(x,α,λ2 ,λ)

Ã

λ1 x p 1 + (α + λ2 )x2

!

¶¶

dy

, √

− λ2 λx . by using the expression in (2.6), where µ(x, α, λ2 , λ) = √ 1+(λ2 +α)x2 +λ2 (1+αx2 ) 2

Acknowledgments The authors acknowledge helpful comments and suggestions the referee which substantially improved the presentation. The research of O. Venegas was supported by DGIP 2009|2|1. The research of A. I. Sanhueza was supported by FONDECYT 1090265. The research of H. W. G´omez was supported by FONDECYT 1090411.

412

Osvaldo Venegas, Antonio I. Sanhueza and H´ector W. G´ omez

References [1] Arellano-Valle, R. B., G´ omez, H. W. and Quintana, F. A. A new class of skew-normal distributions. Communications in Statistics : Theory and Methods, 33(7), pp. 1465—1480, (2004). [2] Azzalini, A. A class of distributions which includes the normal ones. Scandinavian Journal Statistics, 12, pp. 171—178, (1985). [3] Azzalini, A. The skew-normal distribution and related multivariate families (with discussion). Scandinavian Journal Statistics, 32, pp. 159—188, (2005). [4] Chiogna, M. Some results on the scalar skew-normal distribution. J. Ital. Statist. Soc., 7, pp. 1—13, (1998). [5] Genton, M. G., Ed. Skew-elliptical distributions and their applications: a journey beyond normality. Chapman & Hall/CRC, (2004). [6] G´ omez, H. W., Venegas, O. and Bolfarine H. Skew-symmetric distributions generated by the distribution function of the normal distribution. Environmetrics, 18, pp. 395—407, (2007). [7] Gupta, A. K., Chang, F. C. and Huang, W. J. Some skew-symmetric model. Randon Operators Stochastic Equations, 10, pp. 133—140, (2002). [8] Henze, N. A probabilistic representation of the skew-normal distribution. Scandinavian Journal Statistics, 13, pp. 271—275, (1986). [9] Martinez, E., Varela, H., G´omez, H. W. and Bolfarine, H. A note on the likelihood and moments of the skew-normal distribution. Statistics and Operations Research Transactions (SORT). 32(1), pp. 57-66, (2008). [10] Nadarajah, S., Kotz, S. Skewed distributions generated by the normal kernel. Statistics Probability Letters, 65, pp. 269—277, (2003). [11] Pewsey, A. Problems of inference for Azzalini’s skew-normal distribution. Journal of Applied Statistics, 27(7), pp. 859—870, (2000).

An extension of the skew-generalized normal distribution and ... 413 Osvaldo Venegas Departamento de Cs. Matem´aticas y F´ısicas, Facultad de Ingenier´ıa, Universidad Cat´olica de Temuco, Chile e-mail : [email protected] Antonio I. Sanhueza Departamento de Matem´ atica y Estad´ıstica, Facultad de Ingenier´ıa, Ciencias y Administraci´on, Universidad de La Frontera, Chile e-mail : [email protected] and H´ ector W. G´ omez Departamento de Matem´ aticas, Facultad de Ciencias B´ asicas, Universidad de Antofagasta, Chile e-mail: [email protected]