Appunti di Statistica con esercizi

Appunti completi di Statistica. Argomenti: inferenza, stimatori, verosimiglianza, problemi regolari ed irregolari, score function, fisher information, famiglia esponenziale, proprietà …

Esame Numerical and statistical methods for finance

Facoltà Economia

Dal corso del Prof. Favaro Stefano

Università Università degli studi di Torino

Publisher __irene__22

A.A. 2022-2023

107 pagine

Appunti esame

Vota 5,0 / 5 (2)

Scarica

Estratto del documento

Schemes - Statistics

Introduction

In Statistic, we always start from data and we try to get some informations in order to learn something about the phenomenon ⇒ we want to make inference

(z₁, z₂, ..., z_n), n ≥ 1 ⇒ Sample (realizations)
(X₁, X₂, ..., X_n), n ≥ 1 ⇒ Random sample (random variables)

We assume that the random sample comes from the model X_n ~ F_X(., θ), θ∈Θ

It means that we assume that the random variables X₁, X₂, ..., X_n are independent and identically distributed, following a model parameterized by θ.

Notice that if I fix θ I know everything about the model.

What do we want to do? → Starting from data we want to estimate the parameter θ, in order to be able to make inference on the population.

Example:

X_n ~ N(μ, σ²), (μ, σ²) ∈ ℝ x ℝ⁺

In this example the parameters are two: θ = (μ, σ²)So, θ is bi-dimensional (and in general it can be n-dimensional)

Now, let's consider a random sample (x₁, ..., x_n) from X_n ~ F_X(., θ), θ∈Θand suppose that the sample mean X̄_n = (1/n) Σ x_i is an estimator for θ.

But, is it a good estimator? Could we have a better estimator?We'll try to answer to this kind of questions.

First of all: What is an estimator?

An Estimator Ŷ = f(X₁, ..., X_n) is a function of the random variables X₁, ..., X_n
We want to study his distributional properties
Examples of estimators: 1) Sample mean 2) Sample variance

The best we can do is to find the distribution of the estimator but it's almost impossible.Exception: Since the Gaussian model is a particular casewe are able to find the distribution of the sample mean and the sample variance.

So, let's assume that we know the model X_n ~ F_X(., θ), θ∈Θ and the estimator Ŷ = f(X₁, ..., X_n).

We need to know if the estimator is good, that means if it has some specific desirable properties.In particular, we'll introduce tools and methods that allow us to get estimators with good properties in order to be a good estimator.

NOTATION:

f_X(z;θ) : density function (continuous)
p_X(z;θ) : probability mass function (discrete)
F_X(z;θ) : cumulative distribution function
z = (z₁,...,z_n) : observed sample
X = (X₁,...,X_n) : random sample
X_n∼f(.,θ), θ∈Θ : the random sample X comes from a model parameterized by θ
X_i are independent and identically distributed as X

From the definition of random sample, we can find the distribution of X :

f_z(x;θ) = ∏_i=1f_Xi(z_i;θ) => JOINT DENSITY FUNCTION

We have used INDEPENDENCE : joint = product of marginals

In general, we are not interested in X = (X₁,...,X_n) and in its distribution

Instead, we are interested in some functions T such that :

T : Rⁿ → R^m

(X₁,...,X_n) → T(X₁,...,X_n)

This kind of function T is called STATISTIC.

In the point estimation theory T is an ESTIMATOR, instead in the testing theory T is a TEST.

COMMON EXAMPLES:

Sample mean : Ⅹ̄_n = 1/n ∑_i=1ⁿ X_i
Sample variance : S²_n = 1/n ∑_i=1ⁿ(X_i - Ⅹ̄_n)²

The correct version has 1/n-1 instead of 1/n

Sample moment of order x > 1 : Ⅹ̄_x,n = 1/n ∑_i=1ⁿX^x_i
Centered sample moment of order x > 1 : _x,n = 1/n ∑_i=1ⁿ(X_i - Ⅹ̄_n)^x
Sample minimum : X₍₁₎ = min { X₁,...,X_n }
Sample maximum : X_(n) = max { X₁,...,X_n }

These are all examples of estimators T : Rⁿ → R^m with m = 1

In general m ≤ n since we want to summarize data and the information that they contain.

If m = 1 (or almost m = 2) the statistic T is more informational.

Theorem (Fisher-Cochran)

If Q, Q₁, Q₂ are random variables such that Q = Q₁ + Q₂ and if Q₁ ∼ χ²(g₁), Q ∼ χ²(g), then:

Q₂ ∼ χ²(g₂), where g₂ = g - g₁ and Q₂ is independent of Q₁.

Theorem

If X₁, ..., X_n is a random sample from N(μ, σ²), then:

X_n and S_n² are independent random variables.
Moreover, if X_n and S_n² are independent random variables,
(X₁, ..., X_n) is a random sample from a Gaussian model.

So, now we can find the distribution of S_n² (**)

S_n² = (1/n) ∑ (X_i - X_n)²

Intuition: We expect a sort of chi-squared distribution, since (X_i - X_n) is a Gaussian and so (X_i - X_n)² is the square of a Gaussian → chi-squared.

First of all let's consider S_n^* = (1/n) ∑ (X_i - μ)² and multiply it by n/σ²:

n/σ² S_n^* = (1/σ²) ∑ (X_i - μ)² = (1/σ²) ∑ (X_i - X_n + X_n - μ)² =

= ∑ ((X_i - X_n)/σ + (X_n - μ)/σ)² = ∑ (X_i - X_n)/σ)² + 2 ∑ ((X_i - X_n)/σ) ((X_n - μ)/σ) + n((X_n - μ)/σ)²

Notice that:

(X_i - μ)/σ is a Gaussian since X_i is a Gaussian
(X_i - X_n)/σ is a Gaussian since X_i is a Gaussian
(X_n - μ)/(σ/√n) is a Gaussian since X_n is a Gaussian

We can use the theorem (Fisher - Cochran), so we have :

∑ (X_i - X_n)/σ)² ∼ χ²_n-1 (**)

Condition of Identifiability

We say that a statistical model is identifiable if for θ₁, θ₂ ∈ Θ

there exists at least one event E such that:

P[X ∈ E, θ₁] ≠ P[X ∈ E, θ₂]

Let's consider X=(X₁,...,X_n) from a regular model X_n∼f_X(⋅;θ) , θ∈Θ

Let's call V_n(θ) = log L(θ;X) = log∏_i=1ⁿf_X_i(x_i;θ) = ∑_i=1ⁿlog f_X_i(x_i;θ)

We define the score function as the first derivative of the log-likelihood random variable V_n(θ) :

V'_n(θ) = ^d/_dθ log L(θ; X) = ^{L'(θ; X)}/_{L(θ; X)}

Under the condition of regularity it can be proved that:

E [ V'_n(θ)] = 0
I_n(θ) = Var [ V'_n(θ)] = E [ (V'_n(θ))² ] − E [ (V'_n(θ)) ]² = E [ (V'_n(θ))² ] = − E [ V''_n(θ)]

Remark:

All the notions presented above are valid for a parametric space of dimension k:

V_n(θ) is a k-dimensional vector
I_n(θ) is a k×k matrix

Now, let's prove that E [ V'_n(θ)] = 0 :

E [ V'_n(θ)] = ∫_Rⁿ V'_n(θ)⋅f_X(z;θ) dz = ∫_Rⁿ ^d/_dθ f_X(z;θ) / f_X(z;θ) ⋅ f_X(z;θ) dz = ∫_Rⁿ ^d/_dθ f_X(z;θ) dz

By Leibnitz Theorem we can swap integral and derivative

= ^d/_dθ ∫ f_z(z;θ) dz = ^d/_dθ (1) = 0

⇒ E [ V'_n(θ)] = 0

Recall:

V'_n(θ) is a function of θ and also of the data z=(z₁,...,z_n)

V'_n(θ) = ^d/_dθ log L(θ; X) = ^d/_dθ L(θ; X) / L(θ; X) =

Anteprima

Vedrai una selezione di 20 pagine su 107