• Rezultati Niso Bili Najdeni

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination September 3

N/A
N/A
Protected

Academic year: 2022

Share "University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination September 3"

Copied!
7
0
0

Celotno besedilo

(1)University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination September 3rd , 2020. ID number:. io n. Instructions. s. Name and surname:. Read carefully the wording of the problem before you start. There are four problems altogeher. You may use a A4 sheet of paper and a mathematical handbook. Please write all the answers on the sheets provided. You have two hours.. a.. b.. c.. So. lu t. Problem 1. 2. 3. 4. Total. •. d. • • •.

(2) Methodology of Statistical Research, 2019/2020, M. Perman. M. Pohar-Perme. 1. (25) Assume that every unit in a population of size N has two values of statistical variables. Denote these pairs of values by (x1 , y1 ), (x2 , y2 ), . . . , (xN , yN ). The average of all the values N 1 X λ= (xk + yk ) . 2N k=1 is to be estimated. If the k-th unit is selected, she responds with the value xk with probability 21 , and with value yk with probability 12 independently of other units and independently of the sampling procedure. The pollsters do not know which of the two values is given. Assume that a simple random sample of size n is selected from the population. The quantity λ is estimated by the sample average. The estimator is expressed as N. 1X Ik (xk Jk + yk (1 − Jk )) , λ̂ = n k=1 where Ik is the indicator that the k-th unit is selected, and Jk is the indicator that the k-th unit’s response is xk . The assumptions imply that the vectors (I1 , . . . , IN ) and (J1 , . . . , JN ) are independent, and that the indicators J1 , . . . , Jn are independent. a. (5) Show that the estimator λ̂ is unbiased. Solution: Use independence and linearity of the expected value to get N. 1X E(λ̂) = E(Ik ) (xk E(Jk ) + yk E(1 − Jk )) = λ . n k=1 b. (10) Show that for k = 1, 2, . . . , N n var (Ik (xk Jk + yk (1 − Jk ))) = N. . x2k + yk2 2. . n2 − 2 N. . xk + yk 2. Solution: From simple random sampling we know that E(Ik ) = that   n xk + y k E [Ik (xk Jk + yk (1 − Jk ))] = . N 2. n . N. 2. This implies. Using the facts that Ik2 = Ik , Jk2 = Jk and Jk (1 − Jk ) = 0 we get     E Ik2 (xk Jk + yk (1 − Jk ))2 = E Ik x2k Jk + yk2 (1 − Jk )   n x2k + yk2 . = N 2 The formula for the variance follows. 2. ..

(3) Methodology of Statistical Research, 2019/2020, M. Perman. M. Pohar-Perme. c. (10) Show that for k 6= l cov (Ik (xk Jk + yk (1 − Jk )), Il (xl Jl + yl (1 − Jl ))) =. n(n − 1) (xk + yk )(xl + yl ) . 4N (N − 1). Solution: From simple random sampling we know that cov(Ik , Il ) = −. n(N − n) . N 2 (N − 1). This implies that E(Ik Il ) = −. n(N − n) n2 n(n − 1) + = . N 2 (N − 1) N 2 N (N − 1). Use the linearity of expected value and independence assumptions to compute E [(Ik (xk Jk + yk (1 − Jk )) (Il (xl Jl + yl (1 − Jl ))] xk yl xl y k yk yl xk xl E(Ik Il ) + E(Ik Il ) + E(Ik Il ) + E(Ik Il ) = 4 4 4 4 n(n − 1) = (xk + yk )(xl + yl ) . 4N (N − 1). 3.

(4) Methodology of Statistical Research, 2019/2020, M. Perman. M. Pohar-Perme. 2. (25) Let the observed values x1 , x2 , . . . , xn be generated as independent, identically distributed random variables X1 , X2 , . . . , Xn with distribution (θ − 1)x−1 θx. P (X1 = x) = for x = 1, 2, 3, . . . and θ > 1.. a. (10) Find the MLE estimate of θ based on the observations. Solution: We find `(θ, x) =. n X. ! xk − n log(θ − 1) −. k=1. n X. ! xk. log θ .. k=1. Taking the derivative we have 0. Pn. ` (θ, x) =. xk − n − θ−1. k=1. Pn. k=1. xk. θ. = 0.. It follows that n. 1X θ̂ = xk = x̄ . n k=1 b. (15) Write an approximate 99%-confidence interval for θ based on the observations. Assume as known that ∞ X 1 xax−1 = (1 − a)2 x=1 for |a| < 1. Solution: We have. x x−1 + 2. 2 (θ − 1) θ To find the Fisher information we need ∞ X (θ − 1)x−1 x E(X1 ) = . x θ x=1 `00 (θ, x) = −. Using the hint we get  −2 1 θ−1 E(X1 ) = · 1 − = θ. θ θ We have. 1 . θ(θ − 1) An approximate 99%-confidence interval is s θ̂(θ̂ − 1) θ̂ ± 2.56 · . n I(θ) =. 4.

(5) Methodology of Statistical Research, 2019/2020, M. Perman. M. Pohar-Perme. 3. (25) Assume that the observed values x1 , x2 , . . . , xm and y1 , y2 , . . . , yn were created as independent random variables X1 , X2 , . . . , Xm and Y1 , Y2 , . . . , Yn with Xk ∼ exp(µ) for k = 1, 2, . . . , m and Yk ∼ exp(ν) for k = 1, 2, . . . , n. The hypothesis H0 : µ = ν. H1 : µ 6= ν. versus. is to be tested. Assume that µ, ν > 0. a. (15) Find the Wilks likelihood ratio statistics λ for this testing problem. Solution: The log-likelihood functions is `(µ, ν|x, y) = m log µ − µ. m X. xk + n log ν − ν. k=1. n X. yk .. k=1. If µ and ν can vary freely the maximum is attained at m µ̂ = Pm. k=1. xk. =. 1 x̄. and. n ν̂ = Pn. k=1. yk. =. 1 . ȳ. Evaluating the log-likelihood function at the MLE estimates gives `(ν̂, µ̂|x, y) = m log µ̂ − m + n log ν̂ − n . If ν = µ the MLE turns out to be Pn µ̃ = ν̃ =. k=1. P xk + nk=1 yk m+n. and `(µ̃, ν̃|x, y) = (m + n) log µ̃ − m − n . It follows that λ = 2m log µ̂ + 2n log ν̂ − 2(m + n) log µ̃ .. b. (5) What is the approximate distribution of the Wilk’s likelihood statistics? Solution: Bt Wilks’ theorem the approximate distribution is χ2 (1).. 5.

(6) Methodology of Statistical Research, 2019/2020, M. Perman. M. Pohar-Perme. 4. (25) Assume the following regression model Yi1 = βxi1 + i Yi2 = βxi2 + ηi for i = 1, 2, . . . , n. Assume that the pairs (1 , η1 ), . . . , (n , ηn ) are independent and identically distributed with E(i ) = E(ηi ) = 0, var(i ) = var(ηi ) = σ 2 and corr(i , ηi ) = ρ. Assume that ρ is known. a. (5) Let. Pn (Y x + Yi2 xi2 ) Pn i1 i1 . β̂ = i=1 2 2 i=1 (xi1 + xi2 ). Is this estimator unbiased? Compute its standard error. Solution: All the estimators in the sequel are of the form n X β̂ = (ai Yi1 + bi Yi2 ) i=1. for suitable ai and bi . We have E(β̂) = β. n X. (ai xi1 + bi xi2 ). i=1. and var(β̂) =. n X. var(ai Yi1 + bi Yi2 ) = σ 2. i=1. n X. (a2i + b2i + 2ρai bi ) .. i=1. Plugging in the respective ai and bi we find that all the estimators are unbiased and we derive the formulae for standard errors. b. (5) Adding we get Yi1 + Yi2 = β(xi1 + xi2 ) + ξi , where ξi = i + ηi . The terms ξ1 , . . . , ξn are uncorrelated with E(ξi ) = 0 and var(ξi ) = σ 2 (2 + ρ). The parameter β can be estimated as Pn (Yi1 + Yi2 )(xi1 + xi2 ) β̂ = i=1Pn . 2 i=1 (xi1 + xi2 ) Is this estimator unbiased? Compute ist standard error. Solution: See a. c. (5) Replace for each i = 1, 2, . . . , n the second equation by     Yi2 − ρYi1 xi2 − ρxi1 ηi − ρi =β + . 2(1 − ρ) 2(1 − ρ) 2(1 − ρ) 6.

(7) Methodology of Statistical Research, 2019/2020, M. Perman. M. Pohar-Perme. Denote Ỹi2 =. Yi2 − ρYi1 2(1 − ρ). in x̃i2 =. xi2 − ρxi1 . 2(1 − ρ). Estimate β by Pn (Y x + Ỹi2 x̃i2 ) Pn i1 2i1 β̂ = i=1 . 2 i=1 (xi1 + x̃i2 ) Is this estimate unbiased? Compute its standard error. Solution: See a. d. (10) Which of the above estimators has the smallest standard error? Explain. Solution: Let η̃i =. ηi − ρi . 2(1 − ρ). This random variable is uncorrelated with i and E(η̃i ) = 0 and var(η̃i ) = σ 2 . The model in c. satisfies all the assumptions of the Gauss-Markov theorem which means that the estimator in c. is the best linear unbiased estimator of the parameters.. 7.

(8)

Reference

POVEZANI DOKUMENTI

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination June 29th , 2021.. Name

Let Ik be the indicator of the event that the k-th unit is selected and Ik,1 the indicator that the k-th unit will respond YES,YES.. Let Ik,3 be the indicator of the event that the

Solution: By independence the likelihood function is equal to Lθ = 2n1 θ2n0 +n1 1 − θn1 +2n2 where nk is the number of occurences of k among the observed values.. We

11: Thunder Cave, Cathedral Cave, Wellington Caves, New South Wales, Australia a pair of hemispherical cupolas with an elliptical conical cupola the “Politicianʼs Ear”

Rekli bi lahko, da če transfer poveže delo subjekta, za katerega se predpostavlja, da ve, s fiksacijo simptoma, potem končna identifikacija s sim - ptomom razveže simptom od

The investigated alloy is a ferromagnetic material, exhibiting soft magnetic properties, with a relatively high value of the saturation magnetization and a small value of

In connection with the generally insufficiency of experimental data for the statistical analysis, as in case of creep static loading, it is expedient to evaluate the value of

The goal of the research: after adaptation of the model of integration of intercultural compe- tence in the processes of enterprise international- ization, to prepare the