accuracy test higher than training

2 visualizzazioni (ultimi 30 giorni)
soepblik
soepblik il 9 Nov 2021
Hi,
i have the following inputs to the classification learner:
label 'A' has 191x15 data inputs
label 'B' has 339x15 data inputs
What i now did is take 75% of both label A and label B as trainings data
and the rest is test data.
the strange thing is that i see that with cross validation 5 folds the accuracy is about 71% for for example ensemble and the test accuracy is 76%.
What is the problem? because I think the test accuracy must be lower than the validation accuracy.
What is the best way to fix this?
thanks in advance!

Risposte (1)

Sulaymon Eshkabilov
Sulaymon Eshkabilov il 10 Nov 2021
You can try to employ random selection of the data for training and validation using randperm() or randsample(), for example.
Another approach to address the issue in general is to change the ratio of data for training and validation purposes, e.g. 65% vs. 35%.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by