• Rezultati Niso Bili Najdeni

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination January 31

N/A
N/A
Protected

Academic year: 2022

Share "University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination January 31"

Copied!
7
0
0

Celotno besedilo

(1)University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination January 31st , 2019. ID number:. io n. Instructions. s. Name and surname:. Read carefully the wording of the problem before you start. There are four problems altogeher. You may use a A4 sheet of paper and a mathematical handbook. Please write all the answers on the sheets provided. You have two hours.. a.. b.. c.. So. lu t. Problem 1. 2. 3. 4. Total. •. d.. • • •.

(2) Methodology of Statistical Research, 2018/2019, M. Perman, M. Pohar-Perme. 1. (25) In a population od size N there are two types of units: A and B. If a unit is selected into a sample and is asked about its type the response is not necessarily truthful. A unit of type A will say that it is of type A with probability pAA and will say that it is of type B with probability pAB and similarly for units of type B. Assume that units choose their reponses indpendently of each other and independently of the sampling procedure. Assume all the probabilities pXY are known. Suppose a simple random sample of size n is chosen from the population. Denote the proportion of units of type A by a and the proportion of units of type B by b. We would like to estimate a and b. a. (5) Let NA be the number of units in the sample of type A and NB the number of units in the sample of type B. Let MX be the number of units in the sample who respond A, and MB the number of units in the sample who respond B. Calculate E (MA |NA = nA , NB = nB ) . Solution: The units of type A in the sample respond A with probability pAA . By assumption their responses are independent so the conditional distribution of truthful responses is binomial with parameters nA and pAA . A similar argument hold for units of type B. It follows E (MA |NA = nA , NB = nB ) = nA pAA + nB pBA .. b. (5) Suggest unbiased estimates for the proportions a and b. When is it possible to estimate a and b? Solution: By the formula for total expectation we have E(MA ) = napAA + nbpBA and E(MB ) = n − E(MA ) = napAB + nbpBB . If pAA pBB − pAB pBA 6= 0 we can solve for a to get (PBB + pBA ) n1 E(MA ) − pBA . a= PAA pBB − pAB pBA The quantity MA is observed. By linearity of expectation we have that â =. (PBB + pBA ) n1 MA − pBA PAA pBB − pAB pBA. is an unbiased estimate of a, and b̂ = 1 − â is an unbiased estimate of b.. 2.

(3) Methodology of Statistical Research, 2018/2019, M. Perman, M. Pohar-Perme. c. (5) Compute var (MA |NA = nA , NB = nB ). Use this to compute var(MA ). Hint: NA ∼ HiperGeom(n, N a, N ). Solution: Conditionally on {NA = nA , NB = nB } the random variable MA is the sum of two independent binomial random variable. We have var (MA |NA = nA , NB = nB ) = nA pAA (1 − pAA ) + nB pBA (1 − pBA ) . Consequently  E MA2 |NA = nA , NB = nB = nA pAA (1−pAA )+nB pBA (1−pBA )+n2A p2AA +n2B p2BA and E(NA ) = na, E(NB ) = nb and E(NA2 ) =. nab(N − n) + n2 a2 N −1. and. E(NB2 ) =. nab(N − n) + n2 b2 . N −1. Unconditioning we get E(MA2 ) = napAA (1 − pAA ) + nbpBA (1 − pBA )     nab(N − n) nab(N − n) 2 2 2 2 2 + + n a pAA + + n b p2BA . N −1 N −1 We have E(MA ) = napAA + nbpBA and consequently   (N − n)(p2AA + p2BA ) − 2npAA pBA . var(MA ) = n(a + b)pAA (1 − pAA ) + nab N −1. d. (10) Compute the standard error of the unbiased estimate â. Solution: We have var(â) =. p2BA var(MA ) . (pAA pBB − pAB pBA )2. 3.

(4) Methodology of Statistical Research, 2018/2019, M. Perman, M. Pohar-Perme. 2. (25) Assume that the data x1 , x2 , . . . , xn are an i.i.d. sample from the discrete distribution given by P (X1 = x) = (x − 1)p2 (1 − p)x−2 for x = 2, 3, . . .. a. (10) Find the MLE estimate for the parameter p. Solution: The log-likelihood function is n n X X `(p, x) = log(xk − 1) + 2n log p + log(1 − p) (xk − 2) . k=1. The derivative is. k=1. 2n ∂` = − ∂p p. Pn. − 2) . 1−p. k=1 (xk. Equating to zero we get 2n p̂ = Pn. k=1. xk. .. b. (10) Compute the Fisher information I(p). Solution: We have. 2 x1 − 2 ∂ 2` =− 2 − . 2 ∂p p (1 − p)2 Replacing x1 by X1 and computing the expectation we get I(p) =. 2 E(X1 ) − 2 − . 2 p (1 − p)2. To compute E(X1 ) note that X1 ∼ NegBin(2, p) and hence E(X1 ) = 2/p. Alternatively ∞ X k(k − 1)p2 (1 − p)k−2 E(X1 ) = k=2. and using ∞ X. k(k − 1)xk−2. k=2. d2 = 2 dx. ∞ X. ! xk. k=0. =. 2 , (1 − x)3. we get I(p) =. 2 . − p). p2 (1. c. (5) Write the 95%-confidence interval for the parameter p based on x1 , x2 , . . . , xn . Solution: The interval is. where z0,975. √ p̂ 1 − p̂ p̂ ± z0,975 √ , 2n . = 1,96 is the 0,975-quantile of the standard normal distribution.. 4.

(5) Methodology of Statistical Research, 2018/2019, M. Perman, M. Pohar-Perme. 2. (25) Suppose that the random variables X1 , X2 , . . . , Xn are independent equally distributed random variables taking only values 0, 1 and 2 with probabilities P (X1 = 0) = θ2 ,. P (X1 = 1) = 2θ(1 − θ) and P (X1 = 2) = (1 − θ)2. where 0 < θ < 1. We would like to test H0 : θ = 1/2. H1 : θ 6= 1/2 .. against. a. (15) Find the likelihood ratio statistic given the observed values x1 , x2 , . . . , xn from the above distribution. Hint: Express the likelihood function with n0 , n1 and n2 where nk is the number of occurences of k among the observed values for k = 0, 1, 2. Solution: By independence the likelihood function is equal to L(θ) = 2n1 θ2n0 +n1 (1 − θ)n1 +2n2 where nk is the number of occurences of k among the observed values. The MLE estimate of θ is the solution of 2n0 + n1 n1 + 2n2 − = 0. θ 1−θ We have θ̂ =. 2n0 + n1 . 2n. We have Λ=. θ̂2n0 +n1 (1 − θ̂)n1 +2n2 .  1 2n 2. It follows λ = 2 log Λ = 2n log 2 + (2n0 + n1 ) log θ̂ + (n1 + 2n2 ) log(1 − θ̂) .. b. (10) How would you determine the critical value if the confidence level is α. Solution: By Wilks’s theorem under H0 the distribution of λ is χ2 (1). The critical level is the number cα such that P (χ2 (1) > cα ) = α .. 5.

(6) Methodology of Statistical Research, 2018/2019, M. Perman, M. Pohar-Perme. 4. (25) Let the regression model be given by Yk1 = α + βxk1 + k1 Yk2 = α + βxk2 + k2 for k = 1, 2, . . . , n. Assume that E(k1 ) = E(k2 ) = 0, var(k1 ) = var(k2 ) = σ 2 + τ 2 and cov(k1 , k2 ) = σ 2 . For k 6= l we have cov(ki , lj ) = 0 for i, j = 1, 2. Assume that the ratio ρ2 = τ 2 /σ 2 is known. Let p p a = ρ + 2 + ρ2 and b = ρ − 2 + ρ2 . a. (10) Compute var (ak1 + bk2 ) , var (bk1 + ak2 ) and cov (ak1 + bk2 , bk1 + ak2 ) . Solution: We compute var (Vk )   2 σ + τ2 σ2 VT = V σ2 σ2 + τ 2     2 2 2 1 1 + ρ 0 1 + ρ 1 2 1+ρ p p = σ 1 1 + ρ2 0 ρ 2 + ρ2 −1 ρ 2 + ρ2   p 1 0 2 2 2 = σ ρ(1 + ρ ) 2 + ρ 0 1 b. (10) Find explicitly the best unbiased linear estimates of the parameters α and β. Solution: For k = 1, 2, . . . , n define Ỹk1 = (1 + ρ2 )Y1k − Yk2 p Ỹk2 = ρ 2 + ρ2 Yk2 , and .  Ỹ11  Ỹ12     Ỹ21      Ỹ =  Ỹ22   ..   .    Ỹn1  Ỹn2.  2 (1 + ρ2 )x11 − x12 pρ √ ρ 2 + ρ 2 ρ 2 + ρx12    2  (1 + ρ2 )x21 − x22    pρ √  ρ 2 + ρ2 ρ 2 + ρx 22 X̃ =     . . . .   . .   2 2   ρ (1 + ρ )x − x n1 n2 p √ ρ 2 + ρ2 ρ 2 + ρxn2 . and. 6.

(7) Methodology of Statistical Research, 2018/2019, M. Perman, M. Pohar-Perme. and ˜k1 = (1 + ρ2 )1k − k2 p ˜k2 = ρ 2 + ρ2 k2 .. and. .  ˜11  ˜12     ˜21      ˜ =  ˜22   ..   .    ˜n1  ˜n2. For the model.   α Ỹ = X̃ + ˜ β. the usual assumptions for linear regression models are met. By Gauss-Markov theorem we have that    −1 α̂ = X̃T X̃ X̃T Ỹ β̂ is tha BLUE for α and β. c. (5) Give the standard error for the BLUE of β. Solution: In the new model the ˜ki are uncorrelated with variance p p var(˜ki ) = σ 2 ρ 1 + ρ2 2 + ρ2 .  −1 Denote C̃ = X̃T X̃ . By known formulae we have se(β̂) = σ. p p √ 1 + ρ2 4 2 + ρ2 c22 .. 7.

(8)

Reference

POVEZANI DOKUMENTI

The sampling procedure is as follows: first a simple random sample of size k ≤ K of strata is selected.. The selection procedure is independent of the sizes

University of Ljubljana Doctoral Programme in Statistics Methodology of Statistical Research Written examination June 29th , 2021.. Name

Let Ik be the indicator of the event that the k-th unit is selected and Ik,1 the indicator that the k-th unit will respond YES,YES.. Let Ik,3 be the indicator of the event that the

Matej Černe from the Faculty of Economics of the University of Ljubljana who ac- cepted the position of the new editor of the Dy- namic Relationships Management Journal, and

The goal of the research: after adaptation of the model of integration of intercultural compe- tence in the processes of enterprise international- ization, to prepare the

Such criteria are the success of the managed enterprises (e.g. profitabil- ity, social responsibility) as we claim that it is the ut- most responsibility of managers; the attainment

The research attempts to reveal which type of organisational culture is present within the enterprise, and whether the culture influences successful business performance.. Therefore,

– Traditional language training education, in which the language of in- struction is Hungarian; instruction of the minority language and litera- ture shall be conducted within