• Rezultati Niso Bili Najdeni

View of The distribution of the ratio of jointly normal variables

N/A
N/A
Protected

Academic year: 2022

Share "View of The distribution of the ratio of jointly normal variables"

Copied!
10
0
0

Celotno besedilo

(1)

The Distribution of the Ratio of Jointly Normal Variables

Anton Cedilnik

1

, Katarina Košmelj

2

, and Andrej Blejec

3

Abstract

We derive the probability density of the ratio of components of the bivariate normal distribution with arbitrary parameters. The density is a product of two factors, the first is a Cauchy density, the second a very complicated function. We show that the distribution under study does not possess an expected value or other moments of higher order. Our particular interest is focused on the shape of the density. We introduce a shape parameter and show that according to its sign the densities are classified into three main groups. As an example, we derive the distribution of the ratio Z =−Bm1/(mBm) for a polynomial regression of order m. For m=1, Z is the estimator for the zero of a linear regression, for m=2, an estimator for the abscissa of the extreme of a quadratic regression, and for

=3

m , an estimator for the abscissa of the inflection point of a cubic regression.

1 Introduction

The ratio of two normally distributed random variables occurs frequently in statistical analysis. For example, in linear regression, E(Y |x)=β01x, the value x for which the expected response 0 E(Y) has a given value y is often of interest. 0 The estimator for x , the random variable 0 X0 =

(

y0B0

)

/ B1 , is under the standard regression assumption expressed as the ratio of two normally distributed and dependent random variables B and 0 B , which are the estimators for 1 β0 and β1 and whose distributions and dependence are known from regression theory.

1 Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, 1000 Ljubljana, Slovenia;

Anton.Cedilnik@bf.uni-lj.si

2Biotechnical Faculty, University of Ljubljana Jamnikarjeva 101, 1000 Ljubljana, Slovenia;

Katarina.Kosmelj@bf.uni-lj.si

3 National Institute of Biology, University of Ljubljana, Večna pot 111, 1000 Ljubljana, Slovenia; Andrej.Blejec@uni-lj.si

(2)

Similar to the example above is the situation of a quadratic regression,

2 2 1

) 0

|

(Y x x x

E =β +β +β , where the value sought is the x for which 0 E(Y) reaches its extreme value. At this point, the first derivative must be zero. Hence,

2 1

0 B /2B

X =− is expressed as the ratio of two normally distributed and dependent variables as well.

From the literature it is known that the distribution of the ratio Z =X Y , when X and Y are independent, is Cauchy. The probability density function for a Cauchy variable U:C ,

( )

a b is pU(x)=π

(

(xab)2 +b2

)

, where the location parameter a is the median, while the quartiles are obtained from the location parameter a and the positive scale parameter b, q1,3 =amb. This density function pU(x) has ‘fat tails’, hence U does not possess an expected value or moments of higher order (Johnson et al., 1994).

Some results about the ratio from the literature are:

(a) The ratio Z of two centred normal variables is a Cauchy variable (Jamnik, 1971: 149):

(

0, , , 1

)

: = = ≠±



 

N µX µY σX σY ρ Y

X

Y :

Z = X 

 

 = , =σσ 1−ρ2

σ ρσ

Y X Y

X b

a C

The simplest case is the ratio of two independent standardised normal variables which is a ‘standard’ Cauchy variable C

( )

0,1 .

(b) The ratio Z of two non-centred independent normal variables is a particular Cauchy-like distribution. This result is shown in Kamerud (1978).

(c) The ratio of two arbitrary normal variables is discussed in Marsaglia (1965) and leads again to a Cauchy-like distribution.

The case considered in (b) is not general and the result in the cited article is presented in a very implicit way. Marsaglia dealt with the ratio of two independent normal variables, having shown previously, however that any case could be transformed into this setting.

The objective of our work is to derive the probability density for the ratio of components of the bivariate normal distribution for a general setting. Let the vector W=[X Y]T: N

(

µX,µY,σX >0,σY >0,ρ

)

be distributed normally, with the density (for ρ ≠±1):









 − − − − + −

− −

− ⋅

= 2 2 2 2 2

2

) (

) )(

( 2 ) (

) 1 ( 2 exp 1 1

2 ) 1 , (

Y Y Y

X

Y X

X X Y

X

y y

x y x

x

p σ

µ σ

σ

µ µ

ρ σ

µ ρ ρ

σ

W πσ

and with the expected value and the variance-covariance matrix

(3)



 

=

Y

E X

µ ) µ

(W , 

 

= 2 2

) var(

Y Y

X

Y X X

σ σ

ρσ

σ ρσ

W σ .

Our aim is to express the density function of the ratio Z =X /Y explicitly, in terms of the parameters of the bivariate normal distribution. We shall also discuss the degenerate situation, ρ=±1.

2 Probability density for the ratio

The following theorem is the basis for our derivation of the probability density for the ratio (Jamnik, 1971: 148).

Theorem 1. Let W=[X Y]T be a continuously distributed random vector with a probability density function pW(x,y). Then Z = X Y is a continuously distributed random variable with the probability density function

∫ ∫ ∫

=

=

0

0

) , ( )

, ( )

(z y p zy y dy y p zy y dy

pZ W W . (2.1)

For the derivation of pZ(z) for the ratio of the components of a bivariate normal vector we calculated the integral (2.1) using formulae in the Appendix. A long but straightforward calculation gives the next theorem.

Theorem 2. The probability density for Z = X Y, where

(

, , , , 1

)

: ]

[X Y T N µX µY σX σY ρ≠± is expressed as a product of two terms:

=

 

 

 

 + ⋅Φ



 

− ⋅ + ⋅

= −

) (

) 1 (

2 sup exp 1 ) 2

( ) 1

( 2 2 2 2

2

R R R R

z z z

p

X Y X Y

Y X

Z π σ ρσ σ σ ϕ

ρ σ

σ

[ ]

 

 

 

− ⋅ −

⋅ Φ

⋅ +



 

− ⋅ + ⋅

= 2 22 2 2 sup 2 2

2 exp 1 ) ( 2

2 sup exp 1 ) 2

(

1 R R R R R

z

z X Y X

Y Y

X π

σ σ ρσ σ

π

ρ σ σ

(2.2) where:

2 2

2 2

2 2 2

2 2

2 1

2 1

) ) (

(



 

 + 



 

 −

 −

 

 −

+ =

+

= −

=

Y X Y

X

Y X Y Y X X Y

Y X X

X Y X Y

Y X

Y X X Y X Y

Y X X Y

z z

z z

z z z

R R

σ σ σ

ρσ ρ

σ σ σµ σµ σµ ρ

σµ ρ σ

σ ρσ σ

ρ σ σ

µ σ µ σ ρσ µ

σ ρσ µ

σ (2.2a)

2

2 2

2 2 2

2 2 2

2 2

1 2 )

1 ( sup 2

ρ

σµ σµ

σµ σµ ρ

ρ σ

σρσ σ µ µ σ µ µ

σ



 

 + 

 −

 

− = +

= − Y

Y Y

Y X X X

X

Y X

Y X Y X Y X X

R Y , (2.2b)

(4)

2 2

2

2 2

2

2 2

2

2 2

) sup (



 

 + 



 

 −

+ =

= −

Y X Y

X Y Y Y X X X

X Y X Y

Y X

z z

z z

z R z

R

σ σ σ

ρσ

σµ σ σ σµ σ

σ ρσ

σ µ µ

. (2.2c)

The first factor in (2.2), the standard part, is the density for a non-centred

Cauchy variable, 

 

 = , = 1−ρ2

σ σ σ

ρσ

Y X Y

X b

a

C . We have to stress that this factor is independent of the expected values µX and µY.

The second factor, the deviant part, is a complicated function of z, including also the error function Φ(.) (in Gauss form; see Appendix). We need four parameters: ρ,

X X

σµ ,

Y Y

σµ

and

Y X

σ

σ , to fully describe the distribution. It is strictly positive and asymptotically constant – it has the same positive value for both

±∞

=

z , due to the fact that

1 2

)

( σ σ ρ

µ ρσ µ

σ

± −

=

±∞

Y X

Y X X

R Y . Therefore, the asymptotic

behaviour of pZ(z) is the same as that of the Cauchy density, so E(Z) and other moments do not exist.

We wrote the deviant part in (2.2) in two forms. The first form is nicer and can also be found in Marsaglia (1965), but the second form is better for numerical purposes.

A more detailed analysis of pZ(z) led us to the definition of the shape parameter ω:



 

 −

=

Y Y X

X Y Y

σµ σµ ρ

σµ

ω , (2.3)

based on R(±∞) and

( )

(

2 2 2

)

3/2

2

2 1

X Y X Y

Y X Y

X

z z

z dz

dR

σ σ ρσ σ

µ µ ρ σ

σ

+

= − . The sign of ω separates

three different types of shape of pZ(z): I. ω >0

II. ω <0

III. ω =0 which occurs in three variants:

a. µY ≠0, b. µY =0≠µX, c. µY =0=µX .

(5)

The derivative of the deviant part led us to the definitions of two quantities for types I and II:

Y X

Y Y X X

Y

u X

σ σ σµ σ µ µ

µ =

= and

Y X

Y Y X

X

Y Y X X

d σσ

σµ σµ ρσ µ σ ρ µ

− ⋅

= . u is the abscissa of

the local maximum and d the abscissa of local minimum of the deviant part. For type I: d <a<u, and for type II: u<a<d ; as previously,

Y

a X

σ ρσ

= , the centre of the standard part (see Figure 1).

(X,Y) ~ N( 2 , 1 , 1 , 1 , 0 )

-4 -2 0 2 4

0

Density

Type I

-4 -2 0 2 4

0.00.10.20.3

(X,Y) ~ N( -2 , 0.25 , 1 , 1 , 0.5 )

-10 -5 0 5 10

0

Density

Type II

-10 -5 0 5 10

0.00.050.100.15

Figure 1: A case with a positive shape parameter (Type I) and with a negative shape parameter (Type II). On the left, the standard Cauchy part (thick line) and the deviant part (thin line) are presented; the functions are on different scales in order to depict the shapes of both functions on one plot. The vertical dashed lines indicate the abscissas of the local extremes of the deviant part, the horizontal dashed line is its asymptote. The

right plot presents the graph of the density pZ(z).

(6)

(X,Y) ~ N( 1 , 1 , 4 , 2 , 0.5 )

-4 -2 0 2 4

0

Density

Type IIIa

-4 -2 0 2 4

0.00.050.100.150.20

(X,Y) ~ N( 2 , 0 , 1 , 1 , 0.5 )

-10 -5 0 5 10

0

Density

Type IIIb

-10 -5 0 5 10

0.00.040.080.12

(X,Y) ~ N( 0 , 0 , 2 , 1 , 0.5 )

-10 -5 0 5 10

0

Density

Type IIIc

-10 -5 0 5 10

0.00.050.100.15

Figure 2: Three cases having zero value of the shape parameter (Type III). On the left, the standard Cauchy part (thick line) and the deviant part (thin line) are presented; the functions are on different scales in order to depict the shapes of both functions on one plot. The vertical dashed line indicates the abscissa of the local extreme of the deviant part, the horizontal dashed line is its asymptote. The right plot presents the graph of the

density pZ(z).

(7)

Type III describes the marginal case, not likely to occur in practice. In variant IIIa (resp. IIIb), the deviant part has only a maximum (resp. a minimum) at z=a. In variant IIIc, the deviant part is equal to constant 1 (see Figure 2).

The median M(Z) and mode(s) can not be obtained analytically for the general case; further numerical calculations have to be done for each particular case. But we have derived some partial results. For type I :

Y

Z X

M( )>ρσσ , for type II:

Y

Z X

M( )<ρσσ , for type III: pZ(z) is symmetric and

Y

Z X

M( )=ρσσ . Variants IIIa and IIIc are unimodal; generally, pZ(z) may be uni- or bimodal.

The distribution function and quantiles require numerical integration.

3 Degenerate situation

Now, let us consider the case ρ =±1, but still withσX >0, σY >0. Then, the distribution of W=[X Y]T is degenerate, and with probability 1, it holds

X X Y

Y X

Y

σ µ σ−µ =ρ⋅ −

; hence:

Y Y

Z X

Y Y X X

Y X

σ µ ρσ µ σ

ρσ +

=

= . Since the marginal

distribution Y:NYY) is the usual normal distribution, it is easy to find the probability density for Z from the following theorem.

Theorem 3. If Y:NYY) and

Y a c

Z = + , c≠0, then Z has the density given by









− −

=

2 2

2

2 exp 1 ) 2 (

)

( Y

Y Y

Z z a

a c c z

z

p µ

σ π

σ .

The function pZ(z) from this theorem is much simpler than (2) and it is rather easy to find its characteristics, including quantiles and distribution function. Also, there are two modes that can be found explicitly, and between them there is a removable singularity pZ(a)=0. The expected value, as in non-degenerate cases, does not exist.

It is worth noting that in the degenerate case the shape parameter (3) is zero precisely when pZ(z) is symmetric, as in the non-degenerate case. According to the sign of the shape parameter, the relations between the median M(Z) and the quantity

Y

a X

σ ρσ

= remain the same, as well.

(8)

4 Examples

Now, let us discuss the two problems presented in the Introduction. First, we will consider a linear regression E(Y |x)=β01x. We shall be interested in the x-axis intercept: X0 =−B0/ B1 , where B0 and B1 denote the estimators for β0 and β1. Under the assumption that Y |x: N01x, σreg) , the variable X is expressed 0 as the ratio of two normally distributed and dependent random variables −B0 and B1. Given the data

{

(xi,yi),i=1,...,n

}

(x1 <xn), we denote: x =n1

xi ,

= 1 2

xi

w n and

) (w2 x2 n

q reg

= σ −

. Then: 

 

 −



 

w q x qw B N

B : 0, 1, , ,

1

0 β β ,



 

−



 

−

w q x qw B N

B : 0, 1, , ,

1

0 β β . Hence, X has a distribution with density function 0

(2) on making the substitution: µX →−β0 , µY →β1 , σXqs , σYq , w

x

ρ .

Now we shall be concerned with a general polynomial regression

m mx x

x Y

E =β +β +K+β

1

) 0

|

( , m≥1. Let us define Z =−Bm1/(mBm). For m=1, X0

Z = from the first example, the estimator for the zero of a linear regression.

For m=2, Z is an estimator for the abscissa of the extreme of a quadratic regression, and for m=3, Z is an estimator for the abscissa of the inflection point of a cubic regression.

Introduce the following two data matrices:

) 1

, (

, 0

, , 1

+

×



 

 

 

=

= =

m n k

i k m

n x i

K K

v , the

matrix of powers of x-s , and Y=

[

yi (i=1,K,n)

]

n×1. The regularity condition, that there are at least m+1 distinct x-s, implies that the rank of v is precisely m+1. Hence, vTv is invertible and

[ ( ) ]

( 1) ( 1)

1

T ) , 0, ,

( ⋅ = = + × +

= v v djk j k K m m m

d . Let

[

( =0, , )

]

( +1)×1

= βk k K m m

β be the column of the regression coefficients, and

[

( =0, , )

]

( +1)×1

= Bk k K m m

B the column of their estimators. The normal system of equations in matrix form is then vTvB=vTY , and its solution is

B=dvTY (4.1) As usual, we shall suppose that they-s are independent normally distributed random variables with E yi =β +β xi +K+βmxim

1

) 0

( , var(yi)=σreg2 . Hence, the

(9)

vector Y is normally distributed with E(Y)=vβ, var(Y)=σreg2 I. According to (4), B is also normally distributed, E(B)=(dvT)⋅E(Y)=β,

d v

d Y v

d

B) ( T) var( ) ( T)T 2

var( = ⋅ ⋅ ⋅ ⋅ =σreg .

Introduce two matrices:

) 1 (

0 2

0 0

0 1 0 0

+

×



 

 −

=

m m

L

u L and

1 2 1

×

 

=−

=

m m

mB B B

u

W .

W is also a normal variable with

 

=−

=

=

m m

E m

E β

β 1

) ( )

(W u B u β and



 

= −

=

=

m m m

m

m m m

m reg

reg md m d

md d

, 2 ,

1

, 1 1

, 2 1

T 2

) T

var(

)

var(W u B u σ u d u σ . Therefore, the

distribution of W is 



− −

m m m m

m m m

m reg m

m reg m

m d d

d d m

d m

N

, 1 , 1

, 1 ,

1 , 1

1, β ,σ , σ ,

β .

Hence, Z has a distribution with density function (2) with the exchange:

1

m

X β

µ , µYmβm , σX σreg dm1,m1 , σYmσreg dm,m ,

m m m m

m m

d d

d

, 1 , 1

, 1

ρ→ .

References

[1] Jamnik, R. (1971): Verjetnostni račun. Mladinska knjiga, Ljubljana.

[2] Johnson N.L, Kotz, S., and Balakrishnan, N. (1994): Continuous Univariate Distributions. 1. John Wiley and Sons.

[3] Kamerud D. (1978): The random variable X/Y, X, Y normal. The American Mathematical Monthly, 85, 207.

[4] Marsaglia, G. (1965): Ratios of normal variables and ratios of sums of uniforms variables. JASA, 60, 163-204.

(10)

Appendix

+ + =

> ⇒

m

dt c bt at t

a

0

2 )

exp(

0

[ ]

 

 

 

 −

Φ +



 

 Φ

⋅

 

⋅ 

⋅ +

= a

a b a m

b a

b a

e b a am

a bm

ec c

2 2 4 2

2 exp )

exp(

2 1

2

2 π



 

 + ⋅Φ

= +

+

⋅ +

= + +

∫ ∫

−∞

( )

) 1 (

) exp(

) exp(

0

2 0

2

r r r a dt e c bt at t

dt c bt at t

c

ϕ ,

where:

a r b

= 2 

 

= 

= Φ

= 21 ( )

( ) 21erf 2

) (

0 2

2 r

dx x r

e r

r r

π ϕ

ϕ .

Reference

POVEZANI DOKUMENTI

The article focuses on how Covid-19, its consequences and the respective measures (e.g. border closure in the spring of 2020 that prevented cross-border contacts and cooperation

A single statutory guideline (section 9 of the Act) for all public bodies in Wales deals with the following: a bilingual scheme; approach to service provision (in line with

If the number of native speakers is still relatively high (for example, Gaelic, Breton, Occitan), in addition to fruitful coexistence with revitalizing activists, they may

We analyze how six political parties, currently represented in the National Assembly of the Republic of Slovenia (Party of Modern Centre, Slovenian Democratic Party, Democratic

Roma activity in mainstream politics in Slovenia is very weak, practically non- existent. As in other European countries, Roma candidates in Slovenia very rarely appear on the lists

Several elected representatives of the Slovene national community can be found in provincial and municipal councils of the provinces of Trieste (Trst), Gorizia (Gorica) and

We can see from the texts that the term mother tongue always occurs in one possible combination of meanings that derive from the above-mentioned options (the language that

The comparison of the three regional laws is based on the texts of Regional Norms Concerning the Protection of Slovene Linguistic Minority (Law 26/2007), Regional Norms Concerning