How can I separate data into multiple groups?

6 visualizzazioni (ultimi 30 giorni)
Jayden Yeo
Jayden Yeo il 3 Mar 2023
Commentato: Jayden Yeo il 8 Mar 2023
Hi,
I have a csv with more than 50,000 rows (an extract is provide in the attached csv file).
I need to group the data as highlighted in yellow in the attached file. The numbers in each group are either very close to each other (difference of less than 1) or they are multiple of the smaller number (with tolerance of +/- 0.3).
How can I write the code such that it can name the highlighted group as 1, 2, 3 and so on? For those number that don't belong to a group, 0 will be their default group number.
Thanks for the help in advance.
  2 Commenti
Jayden Yeo
Jayden Yeo il 3 Mar 2023
Hi,
The colour is only to help to explain my question. The csv file (with more data) that I am working on does not have any coloured elements.

Accedi per commentare.

Risposte (1)

Jan
Jan il 3 Mar 2023
Modificato: Jan il 3 Mar 2023
data = [2416.015, 127.402, 382.165, 127.425, 127.3387, 127.406, 637.001, 127.405, 2240.913, ...
2257.54, 241.801, 3064.636, 441.559, 220.805, 220.799, 1204.011, 1547.622, 322.37, ...
322.43, 6482.511, 558.603, 279.301, 2234.423, 279.307, 279.31, 279.295, 3901.168, ...
3595.353, 90.315];
m = [true, abs(diff(data)) < 1]; % Distance is small
ini = strfind(m, [0, 1]); % Index where blocks are starting
p = zeros(size(data));
p(ini) = 1;
p = cumsum(p); % Count starts
m(ini) = true;
result = m .* p; % Use m as mask
format long g
disp([data.', result.'])
2416.015 0 127.402 0 382.165 0 127.425 1 127.3387 1 127.406 1 637.001 0 127.405 0 2240.913 0 2257.54 0 241.801 0 3064.636 0 441.559 0 220.805 2 220.799 2 1204.011 0 1547.622 0 322.37 3 322.43 3 6482.511 0 558.603 0 279.301 0 2234.423 0 279.307 4 279.31 4 279.295 4 3901.168 0 3595.353 0 90.315 0
  8 Commenti
Jayden Yeo
Jayden Yeo il 8 Mar 2023
@Jan: I have to admit that the problem is tricky, but the data I have is what I have shown as an extract in the csv file. I think your answer above is the best that I have, and thanks a lot for your help. Once the groups are defined, I will remove those groups that are zero, and then plot groups 1, 2, 3...and so on.

Accedi per commentare.

Prodotti


Release

R2015b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by