What is the difference between different ways to do least square

Question

Zeyuan il 13 Ott 2025 alle 2:34

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2180546-what-is-the-difference-between-different-ways-to-do-least-square

Modificato: Matt J il 13 Ott 2025 alle 15:47

myvariable.mat

Here I encounter this problem of using different ways to do least square. And I got different results (some are quite different). I want to know why. Basically, I tried to use different ways to compute ||Aθ-y||min. So I used these three methods.

theta_train_5k = ((A_train_5k'*A_train_5k)^-1)*A_train_5k'*y_train_5k;
% This is the result of least square
theta_train_5k_3 = A_train_5k\y_train_5k;
% This is also the result of least square
theta_train_5k_2 = lsqr(A_train_5k,y_train_5k);
% This is result of least square using lsqr

And I found different results.

theta_train_100 = ((A_train_100'*A_train_100)^-1)*A_train_100'*y_train_100;
theta_train_100_3 = A_train_100\y_train_100;
% This is also the result of least square for 100 data points
theta_train_100_2 = lsqr(A_train_100,y_train_100);
% This is result of least square using lsqr

For the above one, the result is even more strange. with theta_train_100 1000 to 100000 times larger than theta_train_3 and theta_train_2. So I was wondering when should I use which? Does it have something to do with the condition number or the singular value of the matrix?

Please help. Thank you in advance.

Variables are in the attachment

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Matt J il 13 Ott 2025 alle 2:54

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2180546-what-is-the-difference-between-different-ways-to-do-least-square#answer_1571029

Modificato: Matt J il 13 Ott 2025 alle 14:56

Apri in MATLAB Online

myvariable.mat

The train_100 system is underdetermined, so of course you aren't going to get a unique solution.

For the 5k data, the only reason you see a significant disagreement with lsqr is because you ran lsqr with too few iterations and too loose a tolerance. You can see below that adjusting this reduces the disagreement. In any case, mldivide() is considered the efficient and stable method for small, nonsparse systems (which yours is), so there is no reason to be using lsqr.

load myvariable
theta_train_5k = ((A_train_5k'*A_train_5k)^-1)*A_train_5k'*y_train_5k;
% This is the result of least square
theta_train_5k_3 = A_train_5k\y_train_5k;
% This is also the result of least square
theta_train_5k_2 = lsqr(A_train_5k,y_train_5k,1e-8,300);
lsqr converged at iteration 183 to a solution with relative residual 0.36.
pdiff=@(a,b) norm(a-b)/norm(a)*100;  % percent disagreement function
pdiff(theta_train_5k_3, theta_train_5k )
ans = 1.1997e-11
pdiff(theta_train_5k_3, theta_train_5k_2 )
ans = 7.7810e-04

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Zeyuan il 13 Ott 2025 alle 15:23

Also, I found out that if we add 1e-8,300 to the code, it will kinda overfit, so that testing accuarcy will go down by 0.2%

Matt J il 13 Ott 2025 alle 15:31

Modificato: Matt J il 13 Ott 2025 alle 15:47

I am a bit confused. If we do not get a unique solution for train_100 data, how can we still get results in theta_train_100, theta_train_100_2,theta_train_100_3?

Least squares solutions still exist even when non-unique (there will be infinitely many). But you cannot expect different methods to give you the same one..

Also, I found out that if we add 1e-8,300 to the code, it will kinda overfit, so that testing accuarcy will go down by 0.2%

That doesn't mean the least squares solver made a mistake. The equations you provided were still correctly solved, as we can see from the 3-way agreement between all the solver results.

Accedi per commentare.

What is the difference between different ways to do least square

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

What is the difference between different ways to do least square

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

3 Commenti Mostra 1 commento meno recenteNascondi 1 commento meno recente

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente