GENERAL CONCEPT
Statistical data:
- Observational data: no dataset planned in advance, and the source is unknown. The
research do not manipulate any variables, they just observe what’s happening.
- Experimental data: are totally different kind of data. Analysis of variance can be very
useful when you start studying a set of data.
RSM: Resource Surface Methodology.
Variable: variable could be categorical (qualitative) or quantified (continued or discrete).
Difference between variable and factor:
- Variable (2° step DoE, statistical modeling): they can be quantitative or qualitative
and can assume all the values contained in X.
- Factors (1° step DoE, planning of the experiment): they represent SoVs, they can be
quantitative or categorical and they can assume a specific set of values called levels, which
define the X. A factor could be quantitative or qualitative.
SoVs= Sources of variation, to indicate the variables that can influence the results of an
experiment.
If one factor is categorical:
a) RV is estimated for each level of F
b) Replace it with a quantitative factor, where each set of value identifies a level 1
estimation of RV.
Interaction: measure how the effect of a factor on the RV, depends/ is affected by changing
the level of another one. If it’s null or negligible, it means that the effect of a factor on the RV is
the some across different levels of another factor.
2 Fs: we may define AB as the average difference between the effect of B at the high level of
A and the effect of B at the low level of A.
Effect: effect is different from variable. Direct (or indirect) impact of a factor on the RV, is
measured through a coefficient. It can be:
- Fixed: variables whose values are known and fixed, because we control them.
- Random: variables whose values are not known in advance, they’re not fixed. They
cannot be controlled a priori (eg. Xz method).
Source of variability: a fraction of the RV variability captured by a factor.
Statistical model: it describes the relationship (expressed by the model parameters) between
variables (independent) and the response variable (dependent), with the goal of making
inference. You don’t fit all the data point, but you try to minimize residuals.
REGRESSION MODEL
LINEAR relationship
the independent
set
model which
LRM models
is of
anyway
a a
variable
reguessos) equation
RVI the
by
variables to
lineau
dependent
and fitting a
a
lobservational experimental
data au the
quantify
understand and
and relationship
lineau pretrations
GOAL make .
Assumptions :
linearify
· with
and distribuited and
Indipendently homeledastiy
Noumally
ein o
mean
· ,
Let To
linea
start illustrate
models
by consider
reguession
gifting example
We an
,
. the
to
c hemical temperature
yield reaction sataly
the and of mate
the feed
relating of a .
response
y : variables S
Xi e u ro
:
:
model is
The basic &
t
: ↳ imper a
paxe Mae
example Regression
of Lineau
CRM
+ ,
represents the sataly
the represents temperature of
represents
the
yieed
where and X
I In
, ,
two
flue voliables
refusou
calse
and
rate independent I
voliables du
geed name
X x
: a re
reguessaus because the
The term linear equation
is function
=> used the
linea
is unknown
of
a
Bo B1
havameteus B2
and
, .
Partial sofficient The
B
Br and measurement important
unit
reguession . B is
of of
The unit flue
measurament Bis unit of
of of measure
ment Vy/Ux
of .
The tool
variation fo
between
B
=> is
of ,
the
describes two
The The
dimensional parameteu defines
plane Bo
model in space
x x .
a , soggiorents
The the
intercent By nautical
the and B
because
plane reguession
B
of a re ,
. constant
the held
unit elvange the
is
expected change in When
in
measures neu Xe
X1
y ,
the clange
which unit
slarge
expected When
in
B in X2
you neu XI
same measures y
,
leef camstamt
is .
In between
the
mandom value
difference observed
and
LRM is response
asservance
e u row
&
MODEL REGRESSORS
K
WITH the
the to
In Then
general be general
related k variables
variable
response y may regresso
, .
model is
tex
: ab
Be
... +
that multiple The
regression
lineau variables
is parameters
With K
model Bj
.
regresso
a hypeuplane the
sofficients
the describes
called This model
j k in
regression
1
0
, a re a
= ,
.... , .
SxjY the
-Dimensional The parameter represents
By
variables
the expected
space of leguresso . the independent
unit all remaining
cange
clange When
in
in variables
xj
response per
y
/ifj) constant
leld
:
X are equation
the
flat interaction
than often
complex
Models consider adding
mave
are an
may fluis
two
term to change
however
the into
girst-ader implies
variables
model in a
,
the clavacteristies
variable .
ESTIMATION
PARAMETER FINING
= MODEL parameters
flan
to
When bigger
have number
need
apply model size
sample of
we
we a ,
to
want estimate
I . to
Four novameters
BEX low
BEX
Box Bx i
& need
formula
example in y ... many
=
estimate ? k 1
+ . typically the
method
The to
least is
ordinary estimate
OLS reguession
used
Squares
of Situation
standard Sie Without
lineau
multiple
soggioients model in
reguession
in a
a .
the that
beterosoluedastrictyI must
that sampling size
averdispension is
Suppose
an so we
i ,
. For
K
have will have
observations each observed yi we
> response
yz
y1
m ym
: , a
...,
, ,
. teum a
the the
that
observation
sauvespending variable in
also
We
XJ
of assume e ro
. Juif
E/2) VIC) that the
and
las unevelated
and
5) mondem variables
model 0 are
=
=
Nowmally distributed
Data assumed
a re . must that
estimation
apply method
=> weven mine residuals
I
as assume
we homesedasticity
expected that
asorming
lave
value
lave I the
and variance
o
= am
, .
constant fuel id
distributed
data
usually and
is is
this :
not a re
ave homeseluedastricity
*
22/
& Niidn/0 the
, is contrary
homeschedasticity assumption
> leteusschedasticity
of NO CONSTANT
↓ VARIATION
that yi
implies
This luctauescle
If lave tasticity
=> we
distribuited
normally
a re weiglited
(w
have WIS
we = the
have mot
If Mandam
With all have
variables variables
fixed
OLS apply
we can
we
we
.
LRM . limited apply
is model only normal
lave homesolvedativity
if
LRM
=> can
we we
a ,
only continuous
assumption iid voliables .
,
, the
the
teums
the estimates
write
We equation
motel following
I of in
as
may ,
formula &
: (3)
formula
include
=> we
m
alternatively
an Bexiz
Bexin Baxik
Bo
Yi if the
If lave in
I
+ +
+ +...
= must mot
formula i
Botixij
1 the
include sandem
2
1 ...
= = 1 e u ro
B's
to the
the estimates flat
obtain
The method allows
Os
of of
so sum
the not
mininizes that
Si
16s) This does
the imply
is
of e u ro rs yo
squares , .
best
the
lave clussem data
gifting four youve . /
-yi
yand i (y
to the
wants between min
minimizes :
difference
LRM
=> : .
* BETWEEN OBSERVATIONAL
DIFFERENCE AND DATA
EXPERIMENTAL :
example Grag1 50/100
M Gragz 1000/2000
2000 - nu experimental
dota
& If have
I
1000 - investigate this regiona
avea experimental
in
2
↑
↑
So 100 Instead limit
bata
observational I lave
have
I
if
=> no
,
-
Them the is
function
OLS :
The to to Br estimatous
minimized respect The
BK
with po
function I be OLS
is ,
, ...,
, .
M
B
Bo the
satiggy equations
must
BK following
say :
, ,
, ...,
tito
These called
a re
... gi
Bas least noumal
squares
equations
GL In
the 1)
Simplifying to
The
/P the
system
equations obtain equation solution
K
may +
we = . Br K
sefficiente
equations the
estimators
be the
will
normal reguession
OLS of o , .
, ...
re-write the matrix
equations
We starting
motation
in from
previous
now ,
may 131
model
equation : 2.
Bxxix
Baxiz
B1xin
Ba
* +
+ +
y: +
= K Bjxij Ei
Bo i 2
1
+ , ...,
+ m
=
= ,
,
:
↓
↓ ↓
matuix verta
vestau the of
levels ↓
of
of of
observations level Mantam
independent
the vertau
the
of of e u ro s
(m 1)
x (m 1)
x
variables reguession
(mxp) cefficients
1)
(px
B
the minimites
that
estimatous
to vector
The aim find
main is as
of :
,
,
&
-
transpose d
fluat be
Note expuessed
I as
may
i
&
11x1) /BX'y)
matuix transpose
because its
BIXy and
scalar yx3
is an
a a =
,
,
the
is scalar formulat
same . 2x x5
the
must satigy
The following +
a 0
y =
formulatty
to
This simpligies the equations
This mateix the
is naumal
form of OLS ,
the the equation
materix motation
which system
is you ,
the
To
formed pequations
by equations
normal
solve
. ,
the
by
both XX
multiply invense
sides this equation .
of this voually
formula
In
Bisexy * we
White to
Bos
the distinguist
Thus estrimatour
least of
squares
, with
obtainet
B
from methods
ten
·
matuix Xy
/pxpl
flat
Note px11
symmetrie In
vecteu bbition
XIX and
is is column
a .
a ,
a
the
must the
lighlighted sturture
it be matuix
special XX
of .
the (85) elements
diagonal elements
The the in
. XX
1 of of of
sums
are squares
the X
columns of i flue moss-products
elements the elements
off-diagonal of
The in
2 of
sums
a re
. the solums of X : the
Xy
The cross-uoducts
elements the
.
3 columns of X
of of
of
sums
a re
Gyif
the observations
and .
t
The is
gitted model
reguession in
Baxis
the
In filed model
salou gor , m
... value
the
The observation
actual the
between yi sovespanding
and filed
difference :
eizyi-yi bemoted by
mx1I
the besto
The is
residuale
resitual
named
is . of
say :
, tig
consider LRM
Weven
=> and cuvespending
lave Sot
response
we a y
we
a a
/deviance) component
Streetstres
into
divided to
be related
My beriance coveb Sot a
= ,
. component to
related resitual.
and
regression a test to
I low
related
is
that important
stauting
showed muche SREG
have is
a , greater
slowed than
respect the have Stres
<Scarica il documento per vederlo tutto.
Scarica il documento per vederlo tutto.
Scarica il documento per vederlo tutto.
Scarica il documento per vederlo tutto.
Scarica il documento per vederlo tutto.
Scarica il documento per vederlo tutto.
-
Appunti Statistica per la sperimentazione e le previsioni in ambito tecnologico (Parte 3)
-
Appunti Statistica per la sperimentazione e le previsioni in ambito tecnologico (Parte 2)
-
Prima parte Appunti di statistica per la sperimentazione e le previsioni in ambito tecnologico
-
Appunti + domande esame di Statistica per la sperimentazione e le previsioni in ambito tecnologico