Local Polynomial Regression Estimator of the Finite ...

3 downloads 0 Views 890KB Size Report
Jan 13, 2017 - Syengo Charles Kilunda (PAUISTI). Thesis Defense. Jan 13, 2017. 1 / 32 ..... Sine: m2(x) = 2 + sin(2πx). Syengo Charles Kilunda (PAUISTI).
Local Polynomial Regression Estimator of the Finite Population Total under Stratified Random Sampling: A Model-Based Approach Syengo Charles Kilunda Supervised by: 1. Prof. R. Odhiambo 2. Dr. G. Orwa MS300-0006/2015

Jan 13, 2017

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

1 / 32

Overview 1

Introduction Background Information Problem Statement Objectives of the Study Main Objective Specific Objectives

2 3

Literature Review Methodology Introduction Local Polynomial Regression Estimation of the unknown function m(.)

The Proposed Estimator Properties of the Proposed Estimator ˆLP is Asymptotically Model-Unbiased Y ˆLP Mean Square Error (MSE) of Y 4

5

Simulation Study Description of population Performance Criteria Results Discussion

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

2 / 32

Introduction

Background Information

Background Information

In sample surveys, the objective is to obtain information about the population, then use this information to make inference about some population quantities. Such information can be collected by either sampling methods or census. In sample surveys, can use auxiliary information at the estimation stage of finite population quantities (totals or mean) to increase precision of estimators of such finite population quantities (Dorfman, 1992). With availability of auxiliary information, nonparametric regression may be used in estimation of finite population totals or means. Nonparametric-based estimation is often more robust and more flexible than inference based on parametric regression models or design probabilities (i.e. design-based inference) (Dorfman, 1992).

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

3 / 32

Introduction

Background Information

Background Information Ctd... Approaches to construction of more efficient estimators:- Model-based and design-based methods. - Model-based approach :- The regression model of y on x is used to predict the non-sampled y 0 s and, consequently their total (or other functions of interest). - Design-based approach :- Here, the probability structure of the procedure by which the sample s is selected serves as the basis for inference. Advantages: Model-based estimators generally ignore sampling probabilities. It also ignores stratum boundaries and also an automatic estimator. In this thesis, focus is on ”Model-based approach” which is based on superpopulation models ξ.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

4 / 32

Introduction

Problem Statement

Problem statement Over years, scientists have used auxiliary information on a character x to estimate the finite population total or mean of character y under study ´ (e.g. Montanari and Ranalli 2003, 2005; Sanchez-Borrego and Rueda 2009; Rady and Ziedan 2014b,a; Orwa et al. 2010). Most scientists have used estimators based on simple random sampling to estimate population totals Y in a model-based approach in the case of local polynomial regression. From the literature review, no work has been found concerning the development of model-based local polynomial regression estimators of a finite population total Y under stratified random sampling. Recent works by Orwa et al. (2010) and Ngesa et al. (2012) have shown that estimators developed under stratified sampling are better than those based on simple random sampling. Hence need for this particular study.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

5 / 32

Introduction

Objectives of the Study

Main Objective

To determine and investigate the theoretical properties of a local polynomial regression estimator of the finite population total under stratified random sampling in a model-based approach.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

6 / 32

Introduction

Objectives of the Study

Specific Objectives

1

To determine a model-based estimator of the finite population total using local polynomial regression under stratified random sampling.

2

To investigate the asymptotic properties (asymptotic model unbiasedness and consistency) of the proposed estimator.

3

To conduct a simulation study to compare the performance of the proposed estimator against existing ones namely Horvitz-Thompson estimator, the Linear regression estimator (Cochran, 1977, p. 200), and the Mixed ratio estimator of Orwa et al. (2010).

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

7 / 32

Literature Review

Literature Review Neyman (1938) work thought as the first attempt to use auxiliary information Cochran (1940), in developing a ratio estimator, also used auxiliary information. Later, Cochran (1977) used auxiliary information to propose regression estimators under stratified random sampling. In most cases, a linear model is assumed, then employed to incorporate auxiliary information in construction of estimators. Sometimes the linear model is inappropriate, and hence the resulting estimators do not beat the purely design-based estimators. Due to this, several researchers have considered nonparametric models. E.g. Nadaraya (1964), Watson (1964), Dorfman (1992) and Chambers et al. (1993) who introduced model-based nonparametric kernel regression estimators. In model-based approach, the model ξ is used to predict the non-sampled values ¯ or total Y . of the population, and hence the population quantities mean Y Breidt and Opsomer (2000) considered nonparametric models within a model-assisted approach and obtained a local polynomial regression estimator as a generalization of the ordinary generalized regression estimator. Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

8 / 32

Literature Review

Literature Review Ctd... ´ Sanchez-Borrego and Rueda (2009) extended Breidt and Opsomer (2000) idea and considered a general working model through a nonparametric class of models ξ within model-based approach to inference. Orwa et al. (2010) proposed a mixed ratio estimator of finite population total under stratified sampling in a model-based approach. Their estimator yields better results as compared to the usual ratio estimator, which is design-based. Their estimator is both asymptotically unbiased and consistent. Most recently, Rady and Ziedan (2014a) has developed a model-based local linear regression estimator of the finite population total in a bivariate case but under simple random sampling. Their estimator beats the classical regression estimator when the model is misspecified, a proof of robustness. In this study, focus is on the estimation of finite population total using local polynomial regression in the presence of one auxiliary variable under stratified random sampling; And this is in the context of model-based approach to inference. Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

9 / 32

Methodology

Introduction

Methodology: Introduction Suppose we have a population of N units i.e. U = 1, 2, . . . . . . ., N. Let s be a random sample of size n selected according to some sampling design, say d, with the first order inclusion probability πi . Let yi be the value of the study variable y , for the i th population element. Let also xi be the auxiliary variate which is available for all population units in U and associated to yi . We define a finite population total as follows Y =

N X i=1

X X yi = yi + yi i∈s

(1)

i∈r

where r is the non-sampled yi0 s. P P The first component yi in Equation (1) is known while the second yi i∈s

i∈r

requires prediction which is of interest in this particular study.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

10 / 32

Methodology

Local Polynomial Regression

Methodology: Local Polynomial Regression Various nonparametric methods exist for the prediction of the P nonsampled component yi . In this thesis, we focus on local i∈r

polynomial regression, a kernel-based technique, in the context of stratified random sampling. Let the superpopulation model describing the relationship between the survey variable yi and the auxiliary variable xi be defined as yi = m(xi ) + ei

(2)

where the unknown function m(xi ) is assumed to be a smooth function. The error terms ei0 s are assumed to be independently distributed random variables with mean 0 and variance σ 2 (x). Model (2) treats the proxy values yi0 = m(xi ) as the predicted values of the unobserved values yi , i ∈ r .

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

11 / 32

Methodology

Local Polynomial Regression

Estimating m(.) using Local polynomial regression

Let Kb (u) = b−1 K (u/b), where K denotes a continuous kernel function and b is the bandwidth. Consider a Taylor series expansion of the unknown function m(.) i.e. 0

m(xi ) ≈ m(x0 ) + m (x0 )(xi − x0 ) + ..... + m(p) (x0 )(xi − x0 )p

1 p!

(3)

for xi in the neighbourhood of a point of interest, x0 . Note that Taylor’s theorem says that any k−times differentiable or continuous function can be approximated with a polynomial.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

12 / 32

Methodology

Local Polynomial Regression

Estimating m(.) using Local polynomial regression Define the local kernel weights as wi = Kb (xi − x0 ) At a target value, x0 , the local weighted sum of squares is given by n X

 wi yi − β0 −

p X

2 βj (xi − x0 )

j

(4)

j=1

i=1

In this case, the estimator is ˆ 0 ) = βˆ0 m(x

(5)

T



where βˆ = βˆ0 , βˆ1 , βˆ2 , ....., βˆp is the value of β = (β0 , β1 , β2 , ....., βp )T that minimizes the local weighted sum of squares in equation (4) The solution ˆ β(x) = (XsjT W Tsj Xsj )−1 X Tsj WsjT Ys exists as long as

0

Xsj Wsj0 Xsj

Syengo Charles Kilunda (PAUISTI)

(6)

is invertible.

Thesis Defense

Jan 13, 2017

13 / 32

Methodology

Local Polynomial Regression

Estimating m(.) using Local polynomial regression From Equation (6), a consistent prediction of the unknown m(.) based on the whole finite population is given by ˆ 0 ) = βˆ0 (x) = eT1 (XsjT W Tsj Xsj )−1 X Tsj WsjT Ys = wsjT Ys m(x

(7)

where e1 = (1, 0, 0, ....., 0)T is a column vector of length p + 1; Ys = [yi ]i∈s ; Wsj = diag{Kb (xi − x0 )}i∈s and Xsj = [1, (xi − x0 ), ......, (xi − x0 )p ]i∈s . Equation (7) holds as long as XsjT W Tsj Xsj is a nonsingular matrix. Consequently, the model-based local polynomial regression estimator for Y is given by X X ˆlp = ˆi Y yi + m (8) i∈s

i∈r

where r is the nonsampled set. Now the estimator in Equation (8) is extended to stratified random sampling using one auxiliary variable.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

14 / 32

Methodology

The Proposed Estimator

The Proposed Estimator Consider a population consisting of N units. Suppose this population is divided into H disjoint strata, each of size Nh , h = 1, 2, ...., H. Let yhj , j = 1, 2, ...., Nh be the survey measurement for the j th unit in the hth stratum. Further, let xhj , j = 1, 2, ...., Nh be the auxiliary measurement positively correlated with yhi . From each stratum, a simple random sample of size nh is selected without replacement, where nh is sufficiently large with respect to Nh and fh = nh /Nh −→ 0. Let sh be the sample in the hth stratum and rh be the nonsampled set in the hth stratum. Define the finite population total as Y =

H X

yhs +

h=1

where yhs =

nh P

yhj and yhr =

j=1 Syengo Charles Kilunda (PAUISTI)

Nh P

H X

yhr

(9)

h=1

yhj .

j=nh +1 Thesis Defense

Jan 13, 2017

15 / 32

Methodology

The Proposed Estimator

yhj0 s

Suppose the distribution generating is given by the superpopulation model, ξ in which yhj = m(xhj ) + ehj (10) where ehj0 s are independently distributed random variables with mean 0 and variance σ 2 (xhj ). Accordingly, E(yhj ) =m(xhj ) (11) ( Cov (yhj , yh0 j 0 ) =

σ 2 (xhj ), 0,

for otherwise

h = h0

and

j = j0

(12)

where σ 2 (x) and m(x) are assumed to be continuous and twice differentiable functions of x, and σ 2 (x) > 0 Adopting Equations (6), (7) and (8), a model-based local polynomial regression estimator of the nonsampled yhj0 s in the hth stratum is therefore given by  −1 ˆ hj = e1T XhjT Whj Xhj m XhjT WhjT y = whjT y (13) where e1 = (1, 0, 0, ....., 0)T is a column vector of length p + 1; y = [yhj ]j∈s ;   h Whj = diag {Kb (xhj − xhi )}j∈s and Xhj = 1, (xhj − xhi ) , ......, (xhj − xhi )p j∈s . h

Equation (13) holds as long as XhjT Whj Xhj is a nonsingular matrix. Syengo Charles Kilunda (PAUISTI)

Thesis Defense

h

Jan 13, 2017

16 / 32

Methodology

The Proposed Estimator

ˆLP and the Denoting the estimator for the finite population total by Y th ˆ estimator within the h stratum by YLPh . Therefore, in stratum h, the estimator of the population total based on local polynomial regression is Nh X

ˆLP = yh + Y s h

ˆ hj m

(14)

j=nh +1

and the estimator for the finite population total is   Nh H H X X X ˆLP = ˆLP = yhs + ˆ hj  Y Y m h h=1

with yhs =

nh P

h=1

(15)

j=nh +1

yhj .

j=1

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

17 / 32

Methodology

Properties of the Proposed Estimator

Properties of the Proposed Estimator

Assumptions made are: (i) (ii) (iii) (iv)

The regression function m(x) has a bounded second derivative. The marginal density, fX (x) is continuous and fX (x) > 0. The conditional variance σ 2 (x) =var(Y /X = x) is bounded and continuous. The kernel density function K (x) is bounded and continuous satisfying: ∞ ∞ ∞ ´ ´ ´ 2 K (x)dx = 1, xK (x)dx = 0, x K (x)dx > 0 and −∞ ∞ ´

−∞

−∞

2t

x K (x)dx < ∞ for t = 1, 2, .......

−∞

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

18 / 32

Methodology

Properties of the Proposed Estimator

ˆLP is Asymptotically Model-Unbiased 1. Y Consider the difference:     H H X H H X X X X X ˆ ˆ hj  −  YLP − Y =  yhs + m yhs + yhj  h=1

h=1 j∈rh

=

H X X

h=1

(16)

h=1 j∈rh

ˆ hj − yhj ) (m

(17)

h=1 j∈rh H X X ˆ hj − mhj ) + (mhj − yhj )) = ((m

(18)

h=1 j∈rh

and taking expectation yields H X H X  X  X ˆLP − Y = ˆ hj − mhj ) + Eξ (m Eξ (mhj − yhj ) Eξ Y h=1 j∈rh

=

(19)

h=1 j∈rh

H X X ˆ hj − mhj ) Eξ (m

(20)

h=1 j∈rh

since Eξ (yhj ) = mhj Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

19 / 32

Methodology

Properties of the Proposed Estimator

Approximating mhj by Taylor series expansion about a point xhj and assuming further that nh −→ ∞ and b −→ 0, then observe that 0

00

ˆ hj ≈ mhj + mhj (xhj − xhi ) + (1/2) mhj (xhj − xhi )2 + ...... m

(21)

Letting u = (xhj − xhi ) /b =⇒ ub = xhj − xhi , then 0

00

ˆ hj ≈ mhj + mhj (ub) + (1/2) mhj (ub)2 + O(b2 ) m 0

(22)

00

ˆ hj − mhj ≈ mhj (ub) + (1/2) mhj (ub)2 + O(b2 ) =⇒ m

(23)

and applying expectations then  0  00 ˆ hj − mhj ) = Eξ (mhj (ub) + (1/2) mhj (ub)2 ) + O(b2 ) E ξ (m

(24)

Theorem 3 of Fan and Gijbels (1996) allows that under conditions (1) − (4) if b −→ 0 and  0 nh b −→ ∞,  00 Eξ mhj (ub) + (1/2) mhj (ub)2 + O(b2 ) −→ 0

ˆ

mhj b Syengo Charles Kilunda (PAUISTI)

00

uKb (u)du + (1/2) mhj b2 Thesis Defense

ˆ u 2 Kb (u)du + O(b2 )

(25) Jan 13, 2017

20 / 32

Methodology

00

= (1/2) mhj b

Properties of the Proposed Estimator

ˆ 2

u 2 Kb (u)du + O(b2 )

So that 

00

ˆ hj − mhj = (1/2) mhj b Eξ m

ˆLP Y

(26)

ˆ 2

u 2 Kb (u)du + O(b2 )

(27)

 ˆ hj − mhj −→ 0 provided that b −→ 0 and nh −→ ∞, and thus =⇒ Eξ m is asymptotically model-unbiased.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

21 / 32

Methodology

Properties of the Proposed Estimator

ˆLP 2. MSE of Y The estimator (15) has the MSE  2 ˆLP − Y ˆLP ) = Eξ Y MSE(Y

(28)

which can be decomposed as h  i2   ˆLP ) = Bias Y ˆLP ˆLP MSE(Y + Var Y

(29)

Theorem 1 of Fan (1993) allows that under Condition (1), if b = dnh−γ , 0 < γ < 1, then !2 ∞ H P P 00 ´ 4 2 ˆ MSE(YLP ) = b /4 m u Kb (u)du + hj

−∞

h=1 j∈rh

b−1

H X X nh−1 f −1 (xhj )σ 2 (xhj ) h=1 j∈rh

ˆ∞

  Kb2 (u) + O b4 + (nh b)−1

(30)

−∞

Observe that Equation (30) tends to zero if b −→ 0 and nh b −→ ∞ and thus ˆLP ) −→ 0. MSE(Y ˆLP is statistically consistent. =⇒ Y Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

22 / 32

Simulation Study

Description of population

Simulation Study: Description of population Four estimators are compared in this section: ˆHT Y ˆ YREG ˆPE Y ˆLP Y

Horvitz-Thompson Linear regression Mixed Ratio Local polynomial with p = 1

Horvitz and Thompson (1952) (Cochran, 1977, p. 200) Orwa et al. (2010) Equation (15)

Table: Estimators being compared in the Simulation study Simulated populations are used. Working model is taken to be E(yhj ) = m(xhj ), Cov (yhj , yh0 j 0 ) = σ 2 . Four populations are considered in this study; They are generated from the regression model below: yi = m(xi ) + ei 1 ≤ i ≤ 2, 000 with the following mean functions

(31)

Linear: m1 (x) = 1 + 2(x − 0.5) Sine: m2 (x) = 2 + sin(2πx) Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

23 / 32

Simulation Study

Description of population

Bump: m3 (x) = 1 + 2(x − 0.5) + exp(−200(x − 0.5)2 ) Jump: m4 (x) = 1 + 2(x − 0.5)I{x≤0.65} + 0.65I{x>0.65} with x ∈ [0, 1]. The four models represent a class of correct and incorrect model specifications for the estimators being considered. These populations are plotted in Fig. 1. Sine

2.5

0.0

1.0

1.5

2.0

m2

1.0 0.5

m1

1.5

3.0

2.0

Linear

0.2

0.4

0.6

0.8

1.0

0.0

0.2

0.4

0.6

x

x

Bump

Jump

0.8

1.0

0.8

1.0

m4

0.0

1.3

1.4

0.5

1.5

1.6

1.0

m3

1.7

1.8

1.5

1.9

2.0

2.0

0.0

0.0

0.2

0.4

0.6 x

0.8

1.0

0.0

0.2

0.4

0.6 x

Thesis Bump Defense and Jump populations Figure: Plot of Linear, Sine,

Syengo Charles Kilunda (PAUISTI)

Jan 13, 2017

24 / 32

Simulation Study

ei0 s ∼ i.i.d N 0, 0.12

Description of population



The populations yi0 s are generated from the mean functions by adding the errors ei0 s in each of the cases. R Software is used in Data simulations and computations. Each of the populations is partitioned into 10 equal, disjoint and mutually exclusive strata. A sample of size, n = 200 is then taken with each stratum contributing a sample size of nh = 20, (h = 1, 2, ...., 10). Epanechnikov kernel, K (u) =

 3 1 − u 2 I{|u|≤1} , 4

is used for kernel smoothing. Bandwidth values b = n−1/5 (with n = 200), b = 0.4, b = 1 and b = 2 are considered.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

25 / 32

Simulation Study

Performance Criteria

Performance criteria Relative absolute bias (RAB) is computed as   R θ(s X ˆ i) − Y ˆ RAB(θ) = Y i=1

(32)

and Relative efficiency (RE) with respect to the Horvitz-Thompson (HT ) estimator is computed as 2 R  P ˆ i) − Y θ(s ˆ = i=1 RE(θ) (33) 2 ) R  P ˆHT (si ) − Y Y i=1

θˆ is the estimator of the finite population total being considered; Y is the true population total and R is the number of replications. RE is meant to test the robustness of the various estimators. Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

26 / 32

Simulation Study

Results

Simulation study: Results Yˆ HT

ˆ Y REG

Yˆ PE

Yˆ LP

Population

b

RAB

RE

RAB

RE

RAB

RE

RAB

Linear

0.3465724 0.4 1 2

0.03212401 0.03212401 0.03212401 0.03212401

1 1 1 1

0.005778929 0.005778929 0.005778929 0.005778929

0.03155733 0.03155733 0.03155733 0.03155733

0.03321496 0.0335352 0.03434122 0.03272264

1.067811 1.089573 1.144951 1.037753

0.03201888 0.0320533 0.03210449 0.03212023

0.995 0.996 0.999 0.999

Sine

0.3465724 0.4 1 2

0.01855193 0.01855193 0.01855193 0.01855193

1 1 1 1

0.03836453 0.03836453 0.03836453 0.03836453

4.286723 4.286723 4.286723 4.286723

0.02072086 0.02082649 0.0201947 0.01895357

1.243534 1.255919 1.183826 1.043951

0.01657321 0.01685303 0.01810882 0.0184607

0.799 0.826 0.957 0.990

Bump

0.3465724 0.4 1 2

0.03109618 0.03109618 0.03109618 0.03109618

1 1 1 1

0.01449569 0.01449569 0.01449569 0.01449569

0.2130984 0.2130984 0.2130984 0.2130984

0.03243536 0.03289121 0.03357809 0.03165829

1.085912 1.116063 1.165075 1.036739

0.03100986 0.03319303 0.0321397 0.03106365

0.993 1.123 1.061 0.998

Jump

0.3465724 0.4 1 2

0.004845022 0.004845022 0.004845022 0.004845022

1 1 1 1

0.02483609 0.02483609 0.02483609 0.02483609

26.07389 26.07389 26.07389 26.07389

0.005616896 0.0056205 0.005181882 0.004852543

1.353566 1.35023 1.155266 1.006773

0.007676967 0.007750974 0.005505162 0.004872778

2.274 2.329 1.259 1.006

R

Table: Relative absolute bias (RAB) and Relative efficiency (RE) based on 1000 replications of simple random sampling within strata from four fixed populations of size N = 2000. Sample size is n = 200. Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

27 / 32

Discussion

Discussion of Results ˆREG , performs best when the model is The parametric estimator, Y correctly specified i.e. In Linear and the Bump populations (In Linear, a strong linear relationship holds between the variables while in Bump, the function is linear over most of its range despite a “bump” for a small part of the range of xhi0 s). When the model is completely misspecified (i.e. in Sine and Jump), a greater efficiency can be achieved by the nonparametric regression estimators and Horvitz-Thompson estimator. ˆLP emerges as the best performing among the nonparametric Y estimators being considered. As the bandwidth size increases, some amount of efficiency is lost in the nonparametric estimators. ˆLP , with smaller values of A good overall performance is achieved with Y ˆPE for every population RAB and RE than the model-based competitor Y and fixed bandwidth under consideration. ˆLP is consistently good as long as the The overall performance of Y bandwidth remains small in this particular study. Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

28 / 32

Conclusion

Conclusion

Generally, use of the proposed estimator leads to relatively smaller values of RE compared to other estimators. Nonparametric regression approach under stratified random sampling using the proposed estimator yields good results.

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

29 / 32

Recommendations

Recommendations

(i) Bandwidths used in this study were predetermined; Need to investigate the performance of the proposed estimator under optimal data-driven bandwidths. (ii) One auxiliary variable was used; Estimation with two or more auxiliary variables under stratified sampling need to be investigated and the performance of the resulting estimator determined against other rival estimators under stratified random sampling. (iii) Error variances were treated as homoscedastic; Need to investigate the performance of the estimators when the error variances are functions of an auxiliary variable (i.e. heteroscedastic).

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

30 / 32

References

References Breidt, F. J. and Opsomer, J. D. (2000). Local polynomial regression estimators in survey sampling. The Annals Of Statistics, 28:1026–1053. Chambers, R. L., Dorfman, A. H., and Wehrly, T. E. (1993). Bias robust estimation in finite populations using nonparametric calibration. Journal of American Statistical Association, 88:268–277. Cochran, W. G. (1940). The estimation of the yields of the cereal experiments by sampling for the ratio of grain to total produce. Journal of Agricultural science, 30:262–275. Cochran, W. G. (1977). Sampling Techniques. J. Wiley, New York. Dorfman, A. H. (1992). Nonparametric regression for estimating totals in finite population. in section on survey research methods. Journal of American Statistical Association, pages 622–625. Fan, J. (1993). Local linear regression smoothers and their minimax efficiencies. The Annals Of Statistics, 21(1):196–216. Fan, J. and Gijbels, I. (1996). Local Polynomial Modelling and Its Applications. Chapman and Hall, London. Horvitz, D. G. and Thompson, D. J. (1952). A generalization of sampling without replacement from a finite Thesis universe. Journal of American Statistical Syengo Charles Kilunda (PAUISTI) Defense Jan 13, 2017 31 / 32

THANK YOU

Syengo Charles Kilunda (PAUISTI)

Thesis Defense

Jan 13, 2017

32 / 32

Suggest Documents