Graph analysis question

Question

0 voti

I have this data:

X = 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21

With

Y = 0.5 1 1.5 2 2.5 3 3.5 4 1 1.5 3 3.5 4.5 5.5 6 2 4 4.5 5.5 6.5 8

This may be more of a math question rather than a matlab question. If you plot the above data you will see three distinct sets of data, is there any way I can get matlab to automatically split this data into the three seperate variables. The number of variables may change depending on the data set, the data I have provided is indicative for the real data the X axis is actually dates.

In short I need Matlab to detect how many separate data sets there are and split the data into different variables.

I've tried using a max and min point script but I didn't get very far with it.

Thanks in advance

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

the cyclist il 3 Ago 2011

Robert, if you do plot(X,Y,'.'), you will understand better what he means. The data lie in three well defined linear groups.

Robert Cumming il 3 Ago 2011

yes I see now that I actually plotted it... Misinterpreted the question

Accedi per commentare.

Accedi per rispondere a questa domanda.

Accedi per seguire l’attività

Answer 1

the cyclist il 3 Ago 2011

Apri in MATLAB Online

1 voto

After I posted my answer about cluster analysis, I noticed the following:

Are your (X,Y) data always sorted as in your example, or can the pairs be jumbled? If they are always sorted, you can just look for negative jumps in Y, using the find() and diff() commands, and separate the data wherever there is such a jump downward.

X = [1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21];
Y = [0.5 1 1.5 2 2.5 3 3.5 4 1 1.5 3 3.5 4.5 5.5 6 2 4 4.5 5.5 6.5 8];
N = numel(Y);
negativeJumpIndex = find(diff([0,Y])<0);
numberNegativeJumps = numel(negativeJumpIndex);
numberClusters = numberNegativeJumps + 1;
indexToNewLine = [1 negativeJumpIndex N+1];
IDX = zeros(N,1);
for nc=1:numberClusters
    IDX(indexToNewLine(nc):indexToNewLine(nc+1)-1) = nc;
end
figure
gscatter(X,Y,IDX)

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Matt il 3 Ago 2011

Many thanks for all of the responses, the data is always sorted so I will try to look for negative jumps.

Once again, many thanks

Accedi per commentare.

Answer 2

the cyclist il 3 Ago 2011

0 voti

What you are asking about is generically known as "cluster analysis", and MATLAB does have some functionality for it, at least in the Statistics Toolbox. If you have that toolbox, look at "doc kmeans" as a possible starting point.

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

the cyclist il 3 Ago 2011

I put this answer up before I plotted out your data. Not sure that kmeans() is going to help you, because I think it is always searching for a "nearest neighbors" sort of clustering, rather than the linear groups you have. It seems to me that you have a fairly specialized problem here, and you'll likely need to write custom code for it.

Accedi per commentare.

Answer 3

Wolfgang Schwanghart il 3 Ago 2011

Apri in MATLAB Online

0 voti

I just tried your example. While results of a kmeans clustering don't look too promising, the function clusterdata works quite well. What you should know a priori is the number of clusters in your data.

x = [1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21]';
y = [0.5 1 1.5 2 2.5 3 3.5 4 1 1.5 3 3.5 4.5 5.5 6 2 4 4.5 5.5 6.5 8]';
IDX = clusterdata([x y],'distance','chebychev','maxclust',3);
gscatter(x,y,IDX)

Regards, W.

2 Commenti
Mostra Nessuno Nascondi Nessuno

the cyclist il 3 Ago 2011

kmeans() seems to do better than clusterdata() to me, although neither very acceptably well. kmeans() at least gets most of the points attributed to the correct lines. clusterdata() combines almost all of the middle and right-hand lines into one cluster (at least with the syntax you provided).

Matt il 3 Ago 2011

Thank you for the help

Accedi per commentare.

Graph analysis question

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

Risposta accettata

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Più risposte (2)

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

2 Commenti
Mostra Nessuno Nascondi Nessuno

Categorie

Prodotti

Tag

Community Treasure Hunt

Graph analysis question

3 Commenti Mostra 1 commento meno recente Nascondi 1 commento meno recente

Risposta accettata

1 Commento Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

Più risposte (2)

1 Commento Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

2 Commenti Mostra Nessuno Nascondi Nessuno

Categorie

Prodotti

Tag

Vedere anche

Community Treasure Hunt

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

1 Commento
Mostra -1 commenti meno recenti Nascondi -1 commenti meno recenti

2 Commenti
Mostra Nessuno Nascondi Nessuno