• Rezultati Niso Bili Najdeni

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination January 28

N/A
N/A
Protected

Academic year: 2022

Share "University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination January 28"

Copied!
9
0
0

Celotno besedilo

(1)University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination January 28th , 2021. ID number:. io n. Instructions. s. Name and surname:. Read carefully the wording of the problem before you start. There are four problems altogether. You may use a A4 sheet of paper and a mathematical handbook. Please write all the answers on the sheets provided. You have two hours.. a.. b.. c. •. d. •. • •. • •. So. lu t. Problem 1. 2. 3. 4. Total.

(2) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. 1. (25) For purposes of sampling the population is divided into K strata of sizes N1 , N2 , . . . , NK . The sampling procedure is as follows: first a simple random sample of size k ≤ K of strata is selected. The selection procedure is independent of the sizes of strata. The second step is then to select a simple random sample in each of the selected strata. If stratum i is selected then we choose a simple random sample of size ni in this stratum for i = 1, 2, . . . , K. Assume the selection process on the second step is independent of the selection process on the first step. a. (10) Find an unbiased estimator of the population mean. Explain why it is unbiased. Hint: let Ii be the indicator that the i-th stratum is selected, and let X̄i be the sample average for the simple random sample selected in the i-the stratum. The estimator can be written using these random variables. From the description of the sampling procedure we have that the vector (I1 , I2 , . . . , IK ) is independent of all X̄i , and the variables X̄1 , X̄2 , . . . , X̄K are independent. Solution: Define  Ii =. 1 if stratum i is chosen, 0 else.. From the above it follows that E(Ii ) = P (Ii = 1) = k/K for all i. Let Ȳi be the sample average for the sample chosen in stratum i. We have E(Ii Ȳi ) = E(Ii )E(Ȳi ) =. k · µi . K. If we put Ȳ =. K X. wi ·. i=1. we have E(Ȳ ) =. K X. K · Ii Ȳi k. w i µi = µ .. i=1. b. (15) Find the standard error of your unbiased estimator. Solution: We have " K # X K2 X 2 var(Ȳ ) = 2 wi var(Ii Ȳi ) + 2 wi wj cov(Ii Ȳi , Ij Ȳj ) . k i=1 i<j By independence of Ii and Ȳi we have var(Ii Ȳi ) = E(Ii )E(Ȳi2 ) − E(Ii )2 E(Ȳi )2 .. 2.

(3) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. We have E(Ȳi2 ) = var(Ȳi ) + E(Ȳi )2 =. σi2 Ni − ni · + µ2i . ni Ni − 1. By independence of (Ii , Ij ) and (Ȳi , Ȳj ) we have cov(Ii Ȳi , Ij Ȳj ) = E(Ii Ij )E(Ȳi )E(Ȳj ) −. k2 µi µj . K2. By definition E(Ii Ij ) = P (Ii = 1, Ij = 1) =. k k−1 · . K K −1. It follows that k cov(Ii Ȳi , Ij Ȳj ) = µi µj K. . k−1 k − K −1 K. . Simplifying we find cov(Ii Ȳi , Ij Ȳj ) = −. (K − k)k µi µj . (K − 1)K 2. Putting all the pieces together gives the standard error.. 3. ..

(4) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. 2. (20) The Birnbaum-Saunders distribution has the density 2 !    1 1 1 √ 1 1 f (x) = exp − 2 x− √ + 2γ x1/2 x3/2 2γ x for x > 0 and γ > 0. Assume that the observed values x1 , . . . , xn are an i.i.d. sample from the density f (x). a. (5) Find the MLE estimate for the paramater γ. Solution: The log-likelihood function is `(γ, x) = −n log 2 − n log γ +. n X. 1. k=1. xk. 1/2. +. 1 3/2. xk. !. n 2 1 X  1/2 −1/2 − 2 x − xk . 2γ k=1 k. Take the derivative to get n 2 ∂` n 1 X  1/2 −1/2 . xk − xk =− + 3 ∂γ γ γ k=1. Set the derivative to zero and solve for γ to get v u n  2 u 1 X 1/2 −1/2 t γ̂ = x − xk . n k=1 k b. (5) Assume as known that    1 √ 1 P (X ≤ x) = Φ x− √ , γ x where Φ(x) is the distribution function of the standard normal distribution. Show that the variable Y defined as √ 1 Y = X−√ X has the N (0, γ 2 ) distribution. √ √ Solution: Denote f (x) = x − 1/ x. The function f (x) is increasing and P (Y ≤ y) = P (f (X) ≤ y)  = P X ≤ f −1 (y)    1 −1 = Φ f f (y) γ   y = Φ . γ. 4.

(5) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. c. (5) Is 2 n  1 X p 1 γ̂ = Xk − √ n k=1 Xk 2. an unbiased estimator of γ 2 ? Rešitev: Using part b. compute p  1 E Xk − √ = γ2 . Xk It follows that γ̂ 2 is an unbiased estimate of γ 2 . d. (10) Compute the standard error for γ̂. Solution: Compute the second derivative of the log-likelihood function for n = 1.   ∂ 2` 1 3 √ 1 =− 2 + 4 x− √ . ∂γ 2 γ γ x It follows  −E hence. ∂ 2` ∂γ 2.  =. 2 . γ2. γ se(γ̂) = √ . 2n. 5.

(6) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. 3. (25) Assume the observed values are pairs (x1 , y1 ), . . . , (xn , yn ). We assume that the pairs are an i.i.d. sample from the bivariate normal density given by f (x, y) =. 2 2 1 − bx −2cxy+ay 2(ab−c2 ) √ e 2π ab − c2. where a, b > 0 and ab − c2 > 0. We would like to test the hypothesis H0 : c = 0. versus. H1 : c 6= 0 .. a. (15) Assume as known that the unrestricted maximum likelihood estimates of the parameters are given by     1 Pn Pn 1 2 â ĉ x x y k k k k=1 k=1 n P = 1 nPn n 1 2 ĉ b̂ k=1 xk yk k=1 yk n n Find the likelihood ratio statistic λ for the testing problem. Solution: The log-likelihood function is given by n. X n 1 ` (a, b, c|x, y) = −n log 2π − log(ab − c2 ) − (bx2k − 2cxk yk + ayk2 ) . 2 2 2(ab − c ) k=1 Using the known unrestricted maximum likelihood estimates we get n   X n 1 2 ` â, b̂, ĉ|x, y = −n log 2π − log(âb̂ − ĉ ) − (b̂x2k − 2ĉxk yk + âyk2 ) . 2 2 2(âb̂ − ĉ ) k=1 We need to simplify the last expression. Summing up we get n X. (b̂x2k − 2ĉxk yk + âyk2 ) = b̂nâ − 2ĉnĉ + ânb̂ .. k=1. It follows that   n ` â, b̂, ĉ|x, y = −n log 2π − log(âb̂ − ĉ2 ) − n . 2 In the restricted case we need to maximize n n n n 1 X 2 1 X 2 ` (a, b|x, y) = −n log 2π − log a − log b − xk − y . 2 2 2a k=1 2b k=1 k The above expression is maximized when the terms containing a and b are maximized. We get n n 1X 2 1X 2 ã = xk and b̃ = y . n k=1 n k=1 k It follows We have.   n n ` ã, b̃, 0|x, y = −n log 2π − log ã − log b̃ − n . 2 2   λ = n − log(âb̂ − ĉ2 ) + log ã + log b̃ .. 6.

(7) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. b. (10) What is the approximate distribution of λ under H0 ? Solution: By Wilks’s theorem λ ∼ χ2 (r) where r = 3 − 2 = 1.. 7.

(8) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. 4. (25) Assume the following linear regression model: Y = Xβ +  with E() = 0 and var() = σ 2 V , where vij =. ρ|i−j| . 1 − ρ2. Assume that σ 2 is an unknown constant, and ρ ∈ (−1, 1) is known. a. (10) Let the components Z1 , Z2 , . . . , Zn of the vector Z be given by the CochranOrcutt transformation p Z1 = 1 − ρ2 Y1 in Zi = Yi − ρYi−1 for i = 2, 3, . . . , n. Compute var(Zi ), cov(Zi , Zj ) for i 6= j. Solution: Compute var(Z1 ) = σ 2 , and for i = 2, 3, . . . n p 1 − ρ2 cov (Y1 , Yi − ρYi−1 ) p  σ 2 1 − ρ2 i−1 = ρ − ρ · ρi−2 2 1−ρ = 0.. cov(Z1 , Zi ) =. Continue to compute 1 < i ≤ n: var(Zi ) = var(Yi − ρYi−1 ) = var(Yi ) − 2ρ cov(Yi , Yi−1 ) + ρ2 var(Yi−1 ) ρ2 σ ρ2 σ 2 σ2 = − 2 + 1 − ρ2 1 − ρ2 1 − ρ2 2 =σ , and cov(Zi , Zj ) = cov(Yi − ρYi−1 , Yj − ρYj−1 )  σ2 = ρj−i − ρj−i+2 − ρj−i + ρj−i+2 2 1−ρ = 0.. 8.

(9) Methodology of Statistical Research, 2020/2021, M. Perman, M. Pohar-Perme. b. (15) Find the best unbiased linear estimator of β. Solution: Define a new matrix X̃ by changing rows Xi of X into p X̃1 = 1 − ρ2 X1 and X̃i = Xi − ρXi−1 . Change the error terms into p η1 = 1 − ρ2 1. and. ηi = i − ρi−1 .. The model Z = X̃β + η satisfies the assumptions of the Gauss-Markov theorem. The BLUE β is −1 β̂ = X̃T X̃ X̃T Z .. 9.

(10)

Reference

POVEZANI DOKUMENTI

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination June 29th , 2021.. Name

Let Ik be the indicator of the event that the k-th unit is selected and Ik,1 the indicator that the k-th unit will respond YES,YES.. Let Ik,3 be the indicator of the event that the

Solution: By independence the likelihood function is equal to Lθ = 2n1 θ2n0 +n1 1 − θn1 +2n2 where nk is the number of occurences of k among the observed values.. We

• We then select m units in each group selected on the first step by simple random sampling.. • The estimate of the population mean is the average ¯ Y of

• We then select m units in each group selected on the first step by simple random sample with replacement.. • The estimate of the population mean is the average ¯ Y of

The goal of the research: after adaptation of the model of integration of intercultural compe- tence in the processes of enterprise international- ization, to prepare the

The research attempts to reveal which type of organisational culture is present within the enterprise, and whether the culture influences successful business performance.. Therefore,

– Traditional language training education, in which the language of in- struction is Hungarian; instruction of the minority language and litera- ture shall be conducted within