How to find rank within group
7 visualizzazioni (ultimi 30 giorni)
Mostra commenti meno recenti
I have a dataset with 100 different groups, and between 8-11 entries in each group. I want to rank them, within a group, by the value of a variable y. How would I do this?
Concretely, if I have 2 groups with 2 people each such that: y=[1; 3; 6; 5] group_id=[1;1;2;2]
then I want a third vector: rank=[1;2;2;1]
0 Commenti
Risposta accettata
Andrei Bobrov
il 27 Giu 2017
Modificato: Andrei Bobrov
il 27 Giu 2017
[~,a] = cellfun(@sort,accumarray(group_id,y,[],@(x){x}),'un',0);
rank = cell2mat(a);
or within for..end:
y=[1; 3; 6; 5];
group_id=[1;1;2;2];
rank = group_id;
a = unique(group_id);
for ii = 1:numel(a)
t = group_id == a(ii);
[~,rank(t)] = sort(y(t));
end
2 Commenti
Simon
il 7 Ago 2023
The result created by the cellfun-based method is different from the for-loop based method when group_id is not in sorted order. The within-group ranking is the same in both methods. But rank = cell2mat(a) put all 1st group's within-group ranks together, followed by the second group's.
I have came across similar problem and in a subsequent analysis I need to put rank as a new column side-by-side next to group_id. The cellfun-based method would give me erroneous result.
Più risposte (1)
Jess Lovering
il 26 Giu 2017
Do you mean that you want to sort the data by a specific array? You can do this using the sort function. It will also give you the indices of the new order so you can use that to apply to the other arrays, example:
y=[1; 3; 6; 5]
group_id=[1;1;2;2]
rank=[1;2;2;1]
[sorted_y,I]= sort(y)
sorted_group_id = group_id(I)
sorted_rank = rank(I)
Vedere anche
Categorie
Scopri di più su Regression in Help Center e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!