Why is fitlm affected by variable scale?

Question

0 voti

Dear all,

My statistics is pretty solid and my understanding is that if you fit a linear regression the scale of the X and Y variables should not affect the resulting p-values. I am running fitlm on some data (see demo and data attached) and changing the scale of the variables by transfiorming them to z-scores has a profound effect on the resulting p values. In the attached (Demo.m) code I fit two models with the same model design on the same data (in the attached 'Data.mat' file). The only difference is that for model 1 the X and Y variables are normalised to z scores and in model 2 they are not. I then scatter the p-values. You can see in the upper left corner that two p values that were not significant for model 1 become signfiocant for model 2.

Sorry I cannot get the demo code embedded in this question, so I have attached it. If anyone has any insights into this that would be great :)

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Devendra il 13 Apr 2024

Thank you very much for detailed explanation. I am getting wierd results of fitlm function used in my matlab code. I am attaching the code and input data file and request you to kindly have a look on code and suggest me how to get the correct results.

I would appreciate your kind cooperation.

Deva

Accedi per commentare.

Accedi per rispondere a questa domanda.

Follow Question

Answer 1

Ive J il 1 Dic 2021

0 voti

Well, the real question would be why not?

You have introduced interaction terms to the model. Two models test different hypotheses (except for the interaction terms). You can find a good explanation here. Clearly, when you remove the interaction terms, all t-stats would be the same for both models.

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Devendra il 13 Apr 2024

Modificato: Devendra il 13 Apr 2024

thanks for valuable information.

Accedi per commentare.

Answer 2

Jeff Miller il 1 Dic 2021

Apri in MATLAB Online

0 voti

Your understanding is correct for linear regression but your model is nonlinear because of the interaction terms. Consider:

zX = zscore(X);
corr(X(:,1),zX(:,1))
ans =
       1
corr(X(:,1).*X(:,2),zX(:,1).*zX(:,2))
ans =
       0.2421

0 Commenti
Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti

Accedi per commentare.

Why is fitlm affected by variable scale?

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Risposta accettata

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Più risposte (1)

0 Commenti
Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti

Categorie

Prodotti

Release

Tag

Community Treasure Hunt

Why is fitlm affected by variable scale?

1 Commento Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Risposta accettata

1 Commento Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Più risposte (1)

0 Commenti Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti

Categorie

Prodotti

Release

Tag

Vedere anche

Community Treasure Hunt

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti