Compare values of a matrix

Question

0 voti

I have a n*n matrix and want to group the numbers together in different classes. I will explain it with a 5*5 matrix. The upper and lower diagonal values are same while the diagonal matrix is always 1. This is a correlation matrix of pairs where (1,2)=(2,1). I have a threshold of 0.9 so whichever value is greater than 0.9 then both the values are clustered in the same group. I will start with row 1. I want to select the values greater than 0.9 which in this case is only 4th. Now I will group (1,4) in group A. I will also in this same iteration find the least value which should be less than 0.9. Here it is 3rd entry which is 0.75. So for the column 3 I move to row 3 in the second iteration. Now I will repeat the same process but I have to only group remaining columns which are not yet grouped which is 2,3,5 by the same rule. In this iteration only 5th is greater than 0.9 so I have (3,5) as the pair and the least value is (3,1) but since I have to only check 2,3,5 (1,4 are already grouped in A) I will look for minimum out of these two (2,5) which is 2. Now I move to row 2. In my algorithm now 2 is the only last remaining factor I'd look for the maximum value in the second row which is (2,5) and will group this remaining 2 with (2,3,5). I will repeat this process till all the items are grouped in groups A, B, C.... and so on. So to group the last remaining row number I just group them with the best value they have row number.

There would not be any case where a factor is common i.e they are in two groups. For example after (1,4) so when we do iteration for 3rd row then 4th column would not have greater value than 0.9 (most likely) because the factors are from certain image features which associates to each other in a way that since 1 is not associated with 3 very well and good with 4 hence 3 would also be not very well associateed with 4.

Please let me know if you have any questions as I think this may confuse you pretty much. Please let me know if you can let me know a simple algorithm which works for this. Thanks in advance.

My final output group will be (1,4),(2,3,5)

0.88  0.75  0.91  0.79
88  0001  0.76  0.74  0.97
75  0.76  0001  0.76  0.99
91  0.74  0.76  0001  0.80
79  0.97  0.99  0.80  0001

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

Guillaume il 6 Lug 2019

I didn't understand the meaning of So for the column 3 I move to row 3 in the second iteration. I think you need to explain what happens on the 2nd step as well.

It would also help if you showed what is the final result for the above matrix.

Abhishek Singh il 6 Lug 2019

@Rik, @Guillaume I have updated the question with including the output and also the second iteration. Please let me know if you still could not get the question. Thank you.

Accedi per commentare.

Accedi per rispondere a questa domanda.

Follow Question

Answer 1

Guillaume il 8 Lug 2019

Apri in MATLAB Online

0 voti

I'm still not entirely clear on your whole algorithm, I think the following do what you want:

function clusters = cluster(m, threshold)
    columns = 1:size(m, 2);   %keep track of column indices remaining. clustered columns are removed from m when they are clustered
    clusters = {};          
    currentrow = 1;          %start on 1st row
    while ~isempty(m)
        tocluster = m(currentrow, :) > threshold;   %cluster columns whose value is above threshold on current row. Note that since m(currentrow, currentrow) is 1, it's always included in the cluster
        if sum(~tocluster) == 1  %if there's only one column left to cluster afterward
            tocluster = true(size(tocluster));    %then include it in the current cluster
        end
        clusters{end + 1} = columns(tocluster); %#ok<AGROW>       %get actual index of columns to cluster
        [~, newrow] = min(m(currentrow, :));         %find next row to start clustering from
        currentrow = columns(newrow);
        m = m(:, ~tocluster);           %get rid of columns that have been clustered.
        columns = columns(~tocluster);
    end
end

I certainly didn't understand what to do when there's only one column/row left to cluster. In the above it's included in the last cluster.

14 Commenti
Mostra 12 commenti meno recenti Nascondi 12 commenti meno recenti

Abhishek Singh il 17 Lug 2019

Modificato: Abhishek Singh il 17 Lug 2019

Apri in MATLAB Online

Hi @Guillaume, I was running the code with big matrix and it works fine however I tried with a small 4*4 matrix to see if its working as anticipated. Look at this matrix for example.

0000    0.9185    0.7956    0.8323
9185    1.0000    0.8004    0.8340
7956    0.8004    1.0000    0.8401
8323    0.8340    0.8401    1.0000

If my threshold is 0.9 then in the first iteration Img1 and Img2 are paired together and since the lowest value is Img3 it will go to row 3 in the next iteration. From here col1 and col2 are already removed from iteration so it won't find any value greater than 0.9 and hence will go to the lowest which is Img4 while not pairing with it and Img3 would be loner. In the next and last iteration Img 4 is the only one left so my final clusters should be (1,2),(3),(4).

But by the code I am getting (1,2) and (3,4). Could you please help me here? I can see that your question was the same that what to do with the last left one and you include it in the last cluster but I'd like it to be clustered separately since it had none of the pairs where threshold was met. Hope you understand this example. If not, please let me know.

Guillaume il 18 Lug 2019

Apri in MATLAB Online

The %#ok<AGROW> suppresses a warning in the editor that the array cluster grows inside the loop. It is typically a valid warning, you can usually preallocate the array. In this case however, you don't in advance how many clusters there will be, so you do have to grow the array in the loop. It is a cell array however and the number of clusters is not likely to be big so the impact on performance will be negligible.

If there is going to be a significant amount of clusters such that the array growing is having an impact then the alternative is to preallocate an array large enough (max size it can be is the number of rows) and trim it at the end:

function clusters = cluster(m, threshold)
    columns = 1:size(m, 2);   %keep track of column indices remaining. clustered columns are removed from m when they are clustered
    clusters = cell(1, size(m, 1));     %preallocate cell array. Probably too big.    
    currentrow = 1;          %start on 1st row
    clusteridx = 0;
    while ~isempty(m)
        clusteridx = clusteridx + clusteridx + 1;
        tocluster = m(currentrow, :) > threshold;   %cluster columns whose value is above threshold on current row. Note that since m(currentrow, currentrow) is 1, it's always included in the cluster
        clusters{clusteridx} = columns(tocluster);   %get actual index of columns to cluster
        [~, newrow] = min(m(currentrow, :));         %find next row to start clustering from
        currentrow = columns(newrow);
        m = m(:, ~tocluster);           %get rid of columns that have been clustered.
        columns = columns(~tocluster);
    end
    cluster = cluset(1:clusteridx);   %trim cell array to portion used
end

This will temporarily use more memory for a possible marginal speed gain.

Abhishek Singh il 18 Lug 2019

Thanks a lot! In that case I will use the AGROW version of code if I may need it to be more faster for a lot of images. Thanks again.

Accedi per commentare.

Answer 2

Bruno Luong il 6 Lug 2019

0 voti

It seems like you you want to cluster indexes according to correlation. In this case why not threshold you matrix, that gives some sort of connectivity graph, then using some graph technique to get the connex components.

There a a bunch of such function in File Exchange.

0 Commenti
Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti

Accedi per commentare.

Compare values of a matrix

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

Risposta accettata

14 Commenti
Mostra 12 commenti meno recenti Nascondi 12 commenti meno recenti

Più risposte (1)

0 Commenti
Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti

Categorie

Tag

Community Treasure Hunt

Compare values of a matrix

3 Commenti Mostra 1 commento meno recente Nascondi 1 commento meno recente

Risposta accettata

14 Commenti Mostra 12 commenti meno recenti Nascondi 12 commenti meno recenti

Più risposte (1)

0 Commenti Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti

Categorie

Tag

Vedere anche

Community Treasure Hunt

3 Commenti
Mostra 1 commento meno recente Nascondi 1 commento meno recente

14 Commenti
Mostra 12 commenti meno recenti Nascondi 12 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recenti Nascondi -2 commenti meno recenti