Estratto del documento

2 - REVIEW OF PROBABILITY

2.1 Random Variables and Probability Distributions

The set of all possible outcomes is called the sample space. An event is a subset of the sample

space, that is, an event is a set of one or more outcomes. The event “my computer will crash no

more than once” is the set consisting of two outcomes: “no crashes” and “one crash.”

A random variable is a numerical summary of a random outcome.

A formula giving the probabilities for different values of the r.v. X is called probability

distribution in the case of discrete r.vs (can take a finite or countably infinite set of values), and

probability density function for continuous r.vs (can assume an infinite number of different

outcomes, f.e. any value in [0,1]).

Bernoulli distribution. It’s the distribution of a discrete random binary variable, whose outcomes

are 0 or 1. One outcome has probability p, the other 1-p.E(G)=p. Var(G)=p(1-p)

Expected Value: E(Y) or μY, is the long-run average value of a random variable over many

repeated trials.

Variance: var(Y), is a measure of the dispersion or spread of a probability distribution.

2.3 Two Random Variables

Joint probability distribution of two discrete random variables, say X and Y, is the probability

that the random variables simultaneously take on certain values, say x and y.

Conditional distribution of Y given X, is the distribution of a random variable Y conditional on

another random variable X taking a specific value. Pr(Y=y|X=x).

-law of iterated expectations: the expectation of Y is the expectation of the conditional expectation

of Y given X, E(Y)=E[E(Y|X)]

where the inner expectation on the right-hand side is computed using the conditional distribution of

Y given X and the outer expectation is computed using the marginal distribution of X. For example,

the mean height of adults is the weighted average of the mean height of men and the mean height of

women, weighted by the propor- tions of men and women. The law of iterated expectations implies

that if the conditional mean of Y given X is zero, then the mean of Y is zero.

Two random variables are independent if knowing the value of one of them provides no

information about the other. Pr(Y=y|X=x)=Pr(Y=y)

A first measure of the extent to which two random variables move together is the covariance.

Another measure of the dependence between X and Y is the correlation, which, differently from

the covariance, is unitless.

2.4 The Normal, Chi-Squared, Student t and F Distributions

The Normal Distribution

A continuous random variable with a normal distribution has the familiar bell-shaped probability

62 62).”

density. The normal distribution with mean μ and variance is expressed concisely as “N(μ,

62

The standard normal distribution is the normal distribution with mean m = 0 and variance = 1

and is denoted N(0, 1). Random variables that have a N(0, 1) distribution are often denoted Z, and

the standard normal cumulative distribution function is denoted by the Greek letter Φ; Pr(Z ... c) =

Φ(c).

The Chi-squared Distribution

The chi-squared distribution is used when testing certain types of hypotheses in statistics and

econometrics.

The chi-squared distribution is the distribution of the sum of m squared independent standard

normal random variables. This distribution depends on m, which is called the degrees of freedom of

the chi-squared distribution. For example, let Z Z 2, and Z be independent standard normal

1, 3

21 2 23

random variables. Then Z + Z + Z has a chi-squared distribution with 3 degrees of freedom.

The name for this distribution derives from the Greek letter used to denote it: A chi- squared

2

distribution with m degrees of freedom is denoted χ .

m

The Student Distribution

The Student t distribution with m degrees of freedom is defined to be the distribution of the ratio

of a standard normal random variable, divided by the square root of an independently distributed

chi-squared random variable with m degrees of freedom divided by m. That is, let Z be a standard

normal random variable, let W be a random variable with a chi-squared distribution with m degrees

of freedom, and let Z and W be independently distributed. Then the random variable Z/√(W/m) has

Review of Probability

CHAPTER 2 a Student t distribution (also called t distribution) with m degrees of freedom; denoted with tm.

The Student t distribution has a bell shape similar to that of the normal distribution, but when m is

small (20 or less), it has more mass in the tails—that is, it is a “fatter” bell shape than the normal.

Simple Random Sampling and i.i.d. Random Variables

CONCEPT When m is 30 or more, the Student t distribution is well approximated by the standard normal

2.5 distribution and the t distribution equals the standard normal distribution.

In a simple random sample, objects are drawn at random from a population and

n

The F Distribution

each object is equally likely to be drawn. The value of the random variable for

Y

The F distribution with m and n degrees of freedom, denoted Fm,n , is defined to be the distribution

th

the randomly drawn object is denoted . Because each object is equally likely

i Y i

of the ratio of a chi-squared random variable with degrees of freedom m, divided by m, to an

to be drawn and the distribution of is the same for all the random variables

Y i,

i

c,

independently distributed chi-squared random variable with degrees of freedom n, divided by n. To

, are independently and identically distributed (i.i.d.); that is, the distri-

Y Y

1 n c,

state this mathematically, let W be a chi-squared random variable with m degrees of freedom and let

bution of is the same for all 1 , and is distributed independently

Y

Y i n

= 1

i

c,

V be a chi-squared random variable with n degrees of freedom, where W and V are independently

of , and so forth.

Y Y

2 n

distributed. Then (W/m)/(V/n) has an Fm,n distribution, that is an F distribution with numerator

degrees of freedom m and denominator degrees of freedom n.

In statistics and econometrics, an important special case of the F distribution arises when the

different members of the population are chosen, their values of will differ. Thus

Y

denominator degrees of freedom is large enough that the Fm,n distribution can be approximated by

c,

the act of random sampling means that , can be treated as random vari-

Y Y

1 n

the Fm,∞ distribution. In this limiting case, the denominator random variable V/n is the mean of

c,

ables. Before they are sampled, , can take on many possible values;

Y Y

1 n

infinitely many squared standard normal random variables, and that mean is 1 because the mean of

after they are sampled, a specific value is recorded for each observation.

a squared normal random variables is 1 Thus, the Fm,∞ distribution is the distribution of a chi-

squared random variable with m degrees of freedom, divided by m: W/m is distributed Fm,∞.

c,

i.i.d. draws. Because , are randomly drawn from the same population,

Y Y

1 n

2.5 Random Sampling and the Distribution of the sample Average

c,

the marginal distribution of is the same for each 1 , this marginal

Y i n;

=

i

The simple random sampling is a situation in which n objects are selected at random from a

distribution is the distribution of in the population being sampled. When has

Y Y i

population and each member of the population is equally likely to be included in the sample. The n

c, c,

the same marginal distribution for 1 , then , are said to be

i n, Y Y

= 1 n

observations in the sample are denoted Y , …, Y

identically distributed. 1 n.

When Yi has the same marginal distribution for i = 1, …, n, then Y 1, …, Yn are said to be

Under simple random sampling, knowing the value of provides no infor-

Y 1

identically distributed. Under simple random sampling, knowing the value of Y provides no

mation about , so the conditional distribution of given is the same as the

Y Y Y 1

2 2 1

information about Y , so the conditional distribution of Y given Y is the same as the marginal

marginal distribution of . In other words, under simple random sampling, is

Y Y

2 2 1

2 1

c,

distributed independently of , .

Y Y

distribution of Y . In other words, under simple random sampling, Y is distributed independently of

2 n

2 1

c,

When , are drawn from the same distribution and are indepen-

Y Y

Y , …, Y . 1 n

2 n independently and identically distributed

dently distributed, they are said to be

When Y 1, c, Yn are drawn from the same distribution and are independently distributed, they are

i.i.d.).

(or

said to be independently and identically distributed (or i.i.d.).

Simple random sampling and i.i.d. draws are summarized in Key Concept 2.5.

(However, please note: time series are not random samples. They are often autocorrelated (large

values follow large values, etc.) The textbook theory does not always apply!)

The Sampling Distribution of the Sample Average

The Sampling Distribution of the Sample Average c,

The sample average or sample mean, Y, of the n observations Y 1, …, Y is

sample average sample mean,

The or of the observations , is

Y, n Y Y n

1 n

a

n

1 1

g

+ + +Y ) . (2.43)

(Y

Y Y Y

= =

1 2 n i

n n 1

i =

An essential concept is that the act of drawing a random sample has the effect of

c, role in statistics and econometrics. We start our discu

samples , .

Y Y

For example, suppose our student commuter selected five days at random to

1 n

sampling distribution

is called the of because it is the probability distribution

Y bution of by computing its mean and variance und

Y

The sampling distribution of averages and weighted averages plays a central

record her commute times, then computed the average of those five times. Had

associated with possible values of that could be computed for different possible

Y population distribution of Y.

role in statistics and econometrics. We start our discussion of the sampling distri-

she chosen five different days, she would have recorded five different times—and

c,

samples , .

Y Y

1 n _

bution of by computing its mean and variance under general conditions on the

Y

thus would have computed a different value of the sample average.

The sampling distribution of averages and weighted averages plays a central

Mean and variance of Y

. Suppose that the observati

population distribution of Y.

Because is random, it has a probability distribution. The distribution of

Y Y

role in statistics and econometrics. We start our discussion of the sampling distri-

2

let and denote the mean and variance of (beca

Y

m s Y

Y i

_

sampling distribution

is called the of because it is the probability distribution

Y

bution of by computing its mean and variance under general conditions on the

Y c

the mean and variance is the same for all 1,

i

c, =

Mean and variance of Y

. Suppose that the observations , are i.i.d., and

Y Y

associated with possible values of that could be computed for different possible

Y 1 n

population distribution of Y. +

of the sum is given by applying Equation

Y Y

2

let and denote the mean and variance of (because the observations are i.i.d.

Y

c, 1 2

m s

samples , .

Y Y Y

Y i

_

1 n 2m . Thus the mean of the sample average is E

c,

=

An essential concept is that the act of drawing a random sample has the effect of making the sample

m

the mean and variance is the same for all 1, When 2, the mean

i n). n

= =

Y Y

c,

Mean and variance of Y

.

The sampling distribution of averages and weighted averages plays a central

Suppose that the observations , are i.i.d., and

Y Y

1 n

. In general,

m

average Y a random variable. Because Y is random, it has a probability distribution. The distribution

+ + +

of the sum is given by applying Equation (2.28): )

Y Y E(Y Y =

Y m

2

role in statistics and econometrics. We start our discussion of the sampling distri-

1 2 1 2 Y

let and denote the mean and variance of (because the observations are i.i.d.

Y

m s Y

Y i 1 1

+ *

2m . Thus the mean of the sample average is (Y )4 2m

E3 Y

= = =

of Y is called the sampling distribution of Y because it is the probability distribution associated a

c,

m

bution of by computing its mean and variance under general conditions on the

Y

the mean and variance is the same for all 1, When 2, the mean

1 2

i n). n

Y Y Y

= = 2 2 n

1

. In general, )

E(Y) E(Y

= =

with possible values of Y that could be computed for different possible samples Y , …, Yn .

m m

population distribution of Y. 1

+ + +

of the sum is given by applying Equation (2.28): )

Y Y E(Y Y

Y = i Y

n

m

1 2 1 2 Y 1

i =

12 12

+ *

2m . Thus the mean of the sample average is (Y )4 2m

E3 Y

= = =

_ a

Mean and variance of Y

m 1 2

Y Y Y

n

1

c, The variance of is found by applying Equatio

Y

Mean and variance of Y

. Suppose that the observations , are i.i.d., and

Y Y ) . (2.44)

E(Y) E(Y

= =

. In general, m

1 n

m

Suppose that the observations Y1,…,Yn are i.i.d., and let μY and 6^2Y denote the mean and

i Y

n

Y 2

+

2, var(Y ) 2s , so [by applying Equatio

Y

n

2 = =

let and denote the mean and variance of (because the observations are i.i.d.

Y 1

i =

m s 1 2 Y

Y

Y i

variance of Yi (because the observations are i.i.d. the mean and variance is the same for all i=1,

a 1

c, 2

4,

n cov(Y , ) 0 var(Y) . For general bec

Y n,

= =

1

the mean and variance is the same for all 1, When 2, the mean

i n). n

= = s

The variance of is found by applying Equation (2.37). For example, for

Y 1 2 Y

2

) . (2.44)

E(Y) E(Y

= =

…,n). m

i Y

n ≠

and are independently distributed for so co

Y i j,

+ + +

of the sum is given by applying Equation (2.28): )

Y Y E(Y Y = 1

2 m

+

2, var(Y ) 2s , so [by applying Equation (2.31) with and

Y

n a b

= = = =

j

1

i =

1 2 1 2 Y

1 2 Y 2

When n=2, the mean of the sum Y1+Y2 is given by applying equation 2.28: E(Y1Y2)= μY+ μY=2

1 1

+ *

2m . Thus the mean of the sample average is (Y )4 2m

E3 Y c,

= = =

1 2

m 4, a

cov(Y , ) 0 var(Y) . For general because , are i.i.d.,

Y n, Y Y Y

= = 1 2

Y Y Y

s

2 2

The variance of is found by applying Equation (2.37). For example, for

Y n

1 2 1

Y n i

2 1

μY. Thus the mean of the sample average is E[1/2(Y1+Y2)]=1/2*2 μY= μY. In general:

. In general, a b

var(Y ) var Y

=

m ≠

and are independently distributed for so cov(Y , ) 0 . Thus,

Y i j, Y =

Y 1

2 i

+

2, var(Y ) 2s , so [by applying Equation (2.31) with and n

Y

n a b

= = = =

j i j

1 2 Y 2 1

i =

c,

12 a

2

4,

cov(Y , ) 0 var(Y) . For general because , are i.i.d.,

Y n, Y Y Y

= = a a a a

s

n

1

1 2 1

Y n i

n n n n

1 1 1

) . (2.44)

E(Y) E(Y

= = a b

var(Y ) var Y

m

≠ +

= var(Y )

and are independently distributed for so cov(Y , ) 0 . Thus,

Y i j, Y =

=

i Y

n i

n i

j i j 2 2

n n

1

i = 1

i ≠

= 1 1 1,

i i j j i

= = =

a a a a

n

1 n n n

The variance of is found by applying Equation (2.37). For example, for

Y 1 1 2

The variance is: s

a b

var(Y ) var Y

= Y

+

var(Y ) cov(Y ,Y )

= .

=

i

n 1

i i j

2

+

2, var(Y ) 2s , so [by applying Equation (2.31) with and

2 2

Y

n a b

= = = =

n n n

1

i =

1 2 Y ≠

1 1 1,

i i j j i

2

= = =

c,

a a a

12 2

4,

cov(Y , ) 0 var(Y) . For general because , are i.i.d.,

Y n, Y Y Y

= = s n n n

1 1

1 2 1

Y n i

2 The standard deviation of is the square root of the

Y

s

+

var(Y ) cov(Y ,Y )

= Y

and are independently distributed for so cov(Y , ) 0 . Thus,

Y i j, Y =

. (2.45)

=

i i j

2 2

n n

j i j

n

These result hold whatever the distribution of Y is; that is, the distribution of Y does not need to

1 1 1,

i i j j i

= = =

i i

a

n 2

take on a specific form, such as the normal distribution, for these results to hold.

1 2n.

The standard deviation of is the square root of the variance,

Y

s s

Y

a b Y

var(Y ) var Y

= . (2.45)

= i

n n

1

i =

a a a

2.6 Large Sample Approximations to Sampling Distributions

n n n 2n.

The standard deviation of is the square root of the variance,

Y

1 1 s

Y

+

var(Y ) cov(Y ,Y )

= i i j

2 2

It’s important to know what the sampling distribution of Y is. There are two approaches to

n n ≠

1 1 1,

i i j j i

= = =

characterizing sampling distributions: an “exact” approach and an “approximate” approach.

2 M02_STOC1312_Ch02_pp060-110.indd 91

s Y . (2.45)

=

The exact approach entails deriving a formula for the sampling distribution that holds exactly for

n

any value of n. 2n.

The standard deviation of is the square root of the variance,

Y s

M02_STOC1312_Ch02_pp060-110.indd 91 29/07/14 2:06 PM

Y

The “approximate” approach uses approximations to the sampling distribution that rely on the

sample size being large. The large-sample approximation to the samplin

Anteprima
Vedrai una selezione di 16 pagine su 86
Applied Econometrics Pag. 1 Applied Econometrics Pag. 2
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 6
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 11
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 16
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 21
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 26
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 31
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 36
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 41
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 46
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 51
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 56
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 61
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 66
Anteprima di 16 pagg. su 86.
Scarica il documento per vederlo tutto.
Applied Econometrics Pag. 71
1 su 86
D/illustrazione/soddisfatti o rimborsati
Acquista con carta o PayPal
Scarica i documenti tutte le volte che vuoi
Dettagli
SSD
Scienze economiche e statistiche SECS-P/05 Econometria

I contenuti di questa pagina costituiscono rielaborazioni personali del Publisher mane15 di informazioni apprese con la frequenza delle lezioni di Applied Econometrics e studio autonomo di eventuali libri di riferimento in preparazione dell'esame finale o della tesi. Non devono intendersi come materiale ufficiale dell'università Università Cattolica del "Sacro Cuore" o del prof Monticini Andrea.
Appunti correlati Invia appunti e guadagna

Domande e risposte

Hai bisogno di aiuto?
Chiedi alla community