Distribution-balanced stratified cross-validation

Versione 1.1.0 (1,9 KB) da Jan Motl
An improvement to stratified cross-validation for small imbalanced data sets.
116 download
Aggiornato 12 ott 2019

Visualizza la licenza

Distribution optimally balanced stratified cross-validation (DOB-SCV) partitions a data set into n folds in such a way that a balanced distribution in feature space is maintained for each class, in addition to stratification based on the label.

The real-world effect of using DOB-SCV, instead of stratified cross-validation, is slightly higher testing accuracy. The biggest improvements can be expected on small, class imbalanced data sets.

The implementation can be used as a drop-in replacement for CVPARTITION.

Reference: Study on the Impact of Partition-Induced Dataset Shift on k-Fold Cross-Validation available from https://ieeexplore.ieee.org/document/6226477

Cita come

Jan Motl (2024). Distribution-balanced stratified cross-validation (https://www.mathworks.com/matlabcentral/fileexchange/72963-distribution-balanced-stratified-cross-validation), MATLAB Central File Exchange. Recuperato .

Compatibilità della release di MATLAB
Creato con R2018a
Compatibile con qualsiasi release
Compatibilità della piattaforma
Windows macOS Linux
Categorie
Scopri di più su Statistics and Machine Learning Toolbox in Help Center e MATLAB Answers

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!
Versione Pubblicato Note della release
1.1.0

Speed up

1.0.0