What does sumd method in k-means clustering function exactly calculate?

Question

Onur Kapucu il 8 Mag 2018

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate

Commentato: Onur Kapucu il 8 Mag 2018

I am doing basic experiments with kmeans function. As a real simple example, say that I have a data set of 4 items with 1 attribute and this attribute is their value:

Data=[1;2;3;4];

If I want to split this data set into 2 clusters I should get one centroid in 1.5 and another in 3.5:

[idx,C,sumd]=kmeans(Data,2)
C =     
1.5000
3.5000

and I get it. However to my understanding sumd in this case should be:

abs(1-1.5)+abs(2-1.5) or  abs(3-3.5)+abs(4-3.5)
ans =
       1

but I am getting sumd as:

sumd =
      0.5000
      0.5000

for both clusters. Instead of getting 1's for both.

My question is what exactly does sumd calculate?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Ameer Hamza il 8 Mag 2018

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319322

Modificato: Ameer Hamza il 8 Mag 2018

Apri in MATLAB Online

If you look at the documentation of kmeans(), you will know that it uses the square of the Euclidean distance, by default. So you should calculate it like this

abs(1-1.5).^2+abs(2-1.5).^2 or  abs(3-3.5).^2+abs(4-3.5).^2
ans = 
  0.5 (both cases)

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Onur Kapucu il 8 Mag 2018

Thanks

Accedi per commentare.

Answer 2

the cyclist il 8 Mag 2018

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319323

It's because the default distance metric used is the squared Euclidean distance (for minimization, and reporting). See the Distance input parameter.

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Onur Kapucu il 8 Mag 2018

Thanks

Accedi per commentare.

What does sumd method in k-means clustering function exactly calculate?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Più risposte (1)

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

What does sumd method in k-means clustering function exactly calculate?

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Più risposte (1)

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti