Why does filtering data before PCA improve results?

Question

Morgan Facchin il 2 Ago 2022

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1772560-why-does-filtering-data-before-pca-improve-results

Modificato: Bruno Luong il 2 Ago 2022

I have a set of images that I want to discriminate using PCA. I noticed that applying a low-pass filtering (using filter2) to the images before feeding them into PCA greatly improves the results (it increases the relative amount of variance in the first PCs and corresponds more to what I expect). I then have the following more general question: why does filtering improve the results? I have two conflicting intuitions on this:

On the one hand, the performance is better simply because filtering reduces the noise in the images
On the other hand, filtering is only a linear transformation of the data, and the principal axes found by PCA should be "dragged" by this linear transformation and give the exact same results.

Would you have any clues to help me clarify this?

7 Commenti
Mostra 5 commenti meno recentiNascondi 5 commenti meno recenti

Matt J il 2 Ago 2022

Modificato: Matt J il 2 Ago 2022

Apri in MATLAB Online

The spatial filtering is linear, but I don't know why you think PCA is invariant to linear transformations of the observations. The following simplified example shows that it is not.

X=rand(7,5); X=X-mean(X);
[U,S,V]=svd(X,0); PCA1=U*S
PCA1 = 7×5
   -0.5159    0.3722   -0.3550    0.0126   -0.0581
    0.6617    0.6046    0.1841    0.0151   -0.0022
    0.2549   -0.3709   -0.0693    0.2474    0.0109
    0.2677   -0.2852   -0.2614    0.1047   -0.0047
   -0.4143   -0.1693    0.5058    0.0784   -0.0438
    0.1698   -0.2888   -0.0159   -0.4318   -0.0113
   -0.4240    0.1374    0.0118   -0.0264    0.1091
[U,S,V]=svd(X*rand(5),0); PCA2=U*S
PCA2 = 7×5
   -0.5649    0.4691   -0.1119    0.0754   -0.0108
    1.1273    0.1412   -0.2793   -0.0462   -0.0124
    0.2503   -0.4315   -0.0264    0.0420    0.0115
   -0.2608   -0.3301   -0.1545    0.0349    0.0043
    0.5185    0.0112    0.4618    0.0151   -0.0156
   -1.0065   -0.1389    0.0153   -0.0824   -0.0166
   -0.0639    0.2791    0.0950   -0.0387    0.0396

Bruno Luong il 2 Ago 2022

Convolution f*g is linear wrt f and wrt g.

Bruno Luong il 2 Ago 2022

Modificato: Bruno Luong il 2 Ago 2022

Apri in MATLAB Online

@Morgan Facchin

Let me try too understand your question, because I do this extremey simple code to feel how filtering improve PCA, and my conclusion is quite the opposite:

M=diag([1,100]);
x=randn(2,1e6);
y=M*x;
% PCA of Non filtered data
[U,S,V]=svd(y',0);
PCA=V(:,1);
if PCA(2)<0
    PCA=-PCA;
end
nfiltererror = norm(PCA-[0;1])
nfiltererror = 1.8027e-05
% PCA of filtered data
xf = mean(x,2);
yf = M*xf;
[Uf,Sf,Vf]=svd(yf',0);
PCAf=Vf(:,1);
if PCAf(2)<0
    PCAf=-PCAf;
end
filtererror = norm(PCAf-[0;1])
filtererror = 0.0279
if filtererror < nfiltererror
    fprintf('filter is better\n');
else
    fprintf('non-filter is better\n');
end
non-filter is better

So what do you observe? Can you make a MWE (example with 2 pixels?) to show it?

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Matt J il 2 Ago 2022

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1772560-why-does-filtering-data-before-pca-improve-results#answer_1019725

Modificato: Matt J il 2 Ago 2022

Apri in MATLAB Online

PCA applied to the transformed cluster should find PC1 close to L', and therefore the projections of the images on L' should be the same as they were on L (withing a scaling factor)

That is true for a rotation, but for arbitrary linear transformations, it is not true when the dimension of L is greater than 1. We can recraft my example above to examine how the singular values change under an arbitrary transformation when L and L' are 2D:

X=rand(7,2); X=[X,X]; X=X-mean(X);
S1=svd(X,0)
S1 = 4×1
    1.4299
    1.0318
    0.0000
    0.0000
S2=svd(X*rand(4),0)
S2 = 4×1
    2.0355
    0.2737
    0.0000
    0.0000

Clearly also the change is more than just a global scaling,

S1./S2.*[1 1 0 0]'
ans = 4×1
    0.7025
    3.7701
         0
         0

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Why does filtering data before PCA improve results?

7 Commenti
Mostra 5 commenti meno recentiNascondi 5 commenti meno recenti

Risposte (1)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Why does filtering data before PCA improve results?

7 Commenti Mostra 5 commenti meno recentiNascondi 5 commenti meno recenti

Risposte (1)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

7 Commenti
Mostra 5 commenti meno recentiNascondi 5 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti