Why SVM-based fitcecoc function makes unexplainable misclassifications when 'fitPosterior' label is true?

Question

Omar Elnaggar il 20 Dic 2021

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1614480-why-svm-based-fitcecoc-function-makes-unexplainable-misclassifications-when-fitposterior-label-is

Risposto: Sahil Jain il 22 Dic 2021

I am using the fitcecoc function with SVM template (and RBF kernel), and the 'onevsone' design matrix. The input dataset is purely constructed from 16-dimensional floating-point numbers (decimals) and the output should be one of 12 different class labels.

I know from my training and testing datasets, that few classes are overlapping so I expect some degree of misclassifications.

I noticed an interesting observation, that is, when the label 'fitPosterior' is false, the overall ECOC model (~70% accurate) makes misclassifications that can be explained in the light of the few overlapping classes. I verified this by removing one overlapping class and retraining the whole ECOC model, and the performance reflected an improvement.

Interestingly, when I enabled the 'fitPosterior' label to get some probabilities (not just hard output labels), the ECOC model overall performance relatively improved (~84% accurate) but with some persistent misclassifications. The difference this time is that these misclassifications are not with the overlapping classes anymore. Instead, the model misclassifies the incoming testing instances with very different classes (of little to no overlap).

To wrap up, I find it difficult trying to understand:

(1) Why the performance with 'fitPosterior' enabled showed relative improvement compared to with it disabled? Why this improved performance was associated with reduced explainability and bizzare misclassifications (without overlap between confused classes).

(2) How does 'fitPosterior' works as an algorithm? Is there any way through which we can have some control over how this "Posterior Probability Estimation" gets trained.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Sahil Jain il 22 Dic 2021

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1614480-why-svm-based-fitcecoc-function-makes-unexplainable-misclassifications-when-fitposterior-label-is#answer_859985

Hi Omar. By default, the software minimizes the Kullback-Leibler divergence to estimate class posterior probabilities. Other than KL divergence, Quadratic Programming can also be used (requires optimization toolbox). To know more about the algorithm, please refer to the Algorithms section of the "predict" function. To understand the behaviour of the algorithm, I'd suggest going through the references linked in the section.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Why SVM-based fitcecoc function makes unexplainable misclassifications when 'fitPosterior' label is true?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Why SVM-based fitcecoc function makes unexplainable misclassifications when 'fitPosterior' label is true?

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti