Negative D2 score on training data after lassoglm fit

Question

T0m07 il 17 Nov 2024

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2166873-negative-d2-score-on-training-data-after-lassoglm-fit

Risposto: Jaimin il 25 Nov 2024 alle 6:44

How can the deviance from a null model (i.e. betas all equal zero) be lower than the deviance from the full model? Surely lassoglm should choose betas all zero in this case?

From the code below, my d2Train is -0.0808.

[B, FitInfo] = lassoglm(table2array(indat.params.trainDataX), indat.params.trainDataY(:, minInd), 'poisson', 'Lambda', indat.combTable.bestLambdas(minInd), 'Alpha', indat.combTable.bestAlphas(minInd));
predCountsTrain = calculateRates(table2array(indat.params.trainDataX),B,FitInfo.Intercept)+eps;
predDevianceTrain = calculateDeviance(indat.params.trainDataY(:, minInd),predCountsTrain);
nullCountsTrain = calculateRates(table2array(indat.params.trainDataX),zeros(size(B)),FitInfo.Intercept)+eps;
nullDevianceTrain  = calculateDeviance(indat.params.trainDataY(:, minInd),nullCountsTrain);
d2Train = 1 - (predDevianceTrain ./ nullDevianceTrain);
function rates = calculateRates(x,y,int)
rates = exp((x * y) + int);
end
function dev = calculateDeviance(observed,predicted)
    scaledLogRatio = log(observed./predicted).*observed;
    rawDifference = observed-predicted;
    diffOfTerms = scaledLogRatio - rawDifference;
    dev = nansum(diffOfTerms)*2;
end

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Jaimin il 25 Nov 2024 alle 6:44

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2166873-negative-d2-score-on-training-data-after-lassoglm-fit#answer_1549278

Apri in MATLAB Online

Hi @T0m07

The negative (d^2) value indicates that the full model's deviance is unexpectedly higher than the null model's. Kindly refer to the quick checks and fixes mentioned below:

Verify Calculations: Ensure “calculateRates” and “calculateDeviance’ are correctly implemented. Use a small constant (eps) to avoid division by zero.

function rates = calculateRates(x, y, int)
    rates = exp((x * y) + int);
end
function dev = calculateDeviance(observed, predicted)
    % Avoid division by zero or log of zero by adding a small constant
    observed = observed + eps;
    predicted = predicted + eps;
    
    % Calculate scaled log ratio and raw difference
    scaledLogRatio = log(observed ./ predicted) .* observed;
    rawDifference = observed - predicted;
    
    % Deviance calculation
    diffOfTerms = scaledLogRatio - rawDifference;
    dev = nansum(diffOfTerms) * 2;
end

Model Overfitting: Check if the model is overfitting. Adjust lambda and alpha values in “lassoglm” (https://www.mathworks.com/help/stats/lassoglm.html).

Data Issues: Inspect the data for anomalies or outliers that might affect predictions.

Poisson Assumptions: Ensure the Poisson model assumptions hold (mean ≈ variance). If not, consider alternatives like the negative binomial model.

Cross-validation: Use cross-validation to validate model performance and prevent overfitting.

By addressing these areas, you should improve model performance and resolve the deviance issue.

I hope this will be helpful.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Negative D2 score on training data after lassoglm fit

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Community Treasure Hunt

Negative D2 score on training data after lassoglm fit

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Prodotti

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti