Data normalization using robust scaling

16 visualizzazioni (ultimi 30 giorni)
MByk
MByk il 4 Giu 2024
Commentato: Voss il 4 Giu 2024
Hello all, I am trying to implement "Robust Scaling" but I am confused. Should I use "all" argument for "median" and "iqr" functions?
Thanks for the help.
DataSet = readtable('Datasets/Test.csv');
DataSet = table2array(DataSet); % Row:7195 x Colums:22
RScaling = (DataSet - median(DataSet))./iqr(DataSet)

Risposta accettata

Voss
Voss il 4 Giu 2024
If you want to normalize all columns the same way (i.e., using the median and inter-quartile range of the entire data set), then use "all".
If you want to normalize each column separately (i.e., using each column's own median and inter-quartile range), then do not use "all". And in this case, it's best to use the dim argument set to 1, to explicitly say you want the median and iqr by column, in order to properly handle the situation that your data set has only one row.
  4 Commenti
Voss
Voss il 4 Giu 2024
You're welcome!

Accedi per commentare.

Più risposte (0)

Categorie

Scopri di più su Mathematics in Help Center e File Exchange

Prodotti


Release

R2024a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by