Implement the "total variation distance" (TVD) in Matlab

Question

Sim il 3 Lug 2023

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1991183-implement-the-total-variation-distance-tvd-in-matlab

Modificato: Bruno Luong il 4 Ago 2023

Risposta accettata: Bruno Luong

Apri in MATLAB Online

I am trying to implement the Total variation distance of probability measures (TVD) in Matlab.

Would it be correct to use the max function, in order to calculate the "supremum" of the TVD equation (here below)?

My attempt:

% Input
A =[     0.444643925792938         0.258402203856749
         0.224416517055655         0.309641873278237
        0.0730101735487732         0.148209366391185
        0.0825852782764812        0.0848484848484849
        0.0867743865948534        0.0727272727272727
        0.0550568521843208        0.0440771349862259
       0.00718132854578097        0.0121212121212121
       0.00418910831837223        0.0336088154269972
       0.00478755236385398        0.0269972451790634
       0.00359066427289048       0.00110192837465565
       0.00538599640933573       0.00220385674931129
      0.000598444045481747                         0
       0.00299222022740874       0.00165289256198347
                         0                         0
       0.00119688809096349      0.000550964187327824
                         0      0.000550964187327824
       0.00119688809096349      0.000550964187327824
                         0      0.000550964187327824
                         0      0.000550964187327824
      0.000598444045481747                         0
      0.000598444045481747                         0
                         0                         0
                         0      0.000550964187327824
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0      0.000550964187327824
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
                         0                         0
       0.00119688809096349      0.000550964187327824];
P   = A(:,1);
Q   = A(:,2);
% Total variation distance (of probability measures)
d = max(abs(P-Q))
d = 0.1862

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Bruno Luong il 4 Ago 2023

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1991183-implement-the-total-variation-distance-tvd-in-matlab#answer_1283367

Modificato: Bruno Luong il 4 Ago 2023

Apri in MATLAB Online

Supremum is very often implemented by max, since one can only list or compute a finite set on computer.

However your formula d = max(abs(P-Q)) is not correct to compute TVD.

According to this wiki page; correct formula is given bellow "When Ω is countable"

d = 0.5 * norm(P-Q,1)

or

d = 0.5 * sum(abs(P-Q));

8 Commenti
Mostra 6 commenti meno recentiNascondi 6 commenti meno recenti

Sim il 4 Ago 2023

Modificato: Sim il 4 Ago 2023

Apri in MATLAB Online

OK... so, basically, what I wrote initially i.e.

d = max(abs(P-Q))

was not fully correct, right?

I tried to compare all your code to what I wrote initially, and there is a small difference between what you did and what I wrote initially:

% Generate random test discrete probability density (pdf) P and Q
n = 5;
P=rand(1,n); P=P/sum(P);
Q=rand(1,n); Q=Q/sum(Q);
% Compute TVD using definition
n = length(P);
b = logical(dec2bin(0:2^n-1)-'0');
d = zeros(1,size(b,1));
for k=1:size(b,1)
    bk = b(k,:);
    Pa = sum(P(bk));
    Qa = sum(Q(bk));
    dPaQa = abs(Pa-Qa);
    d(k)= dPaQa;
end
dPQ = max(d)                 % <-- (1) First equation for TVD (from Wikipedia's Definition)
dPQ = 0.4406
dFormula = 0.5 * norm(P-Q,1) % <-- (2) Second equation for TVD (from Wikiperida's Properties)
dFormula = 0.4406
d_Sim = max(abs(P-Q))        % <-- what I wrote initially
d_Sim = 0.2920

Final message to future readers: What I wrote initially is not correct. Please use the @Bruno Luong's code! :-)

Bruno Luong il 4 Ago 2023

Modificato: Bruno Luong il 4 Ago 2023

Apri in MATLAB Online

Don't use the brute force implementation of the initial definition for any discrete pdf with more than 20 values (n = cardinal of Omega), rather use

dFormula = 0.5 * norm(P-Q,1)

The for-loop I made is just to illustrate the correctness of the formula. Just like no-one would computes the determinant of matrix 30 x 30 using Leibniz formula.

Sim il 4 Ago 2023

Apri in MATLAB Online

Ah ok..great..!! Many many thanks!

Then, I will use:

dFormula = 0.5 * norm(P-Q,1)

Accedi per commentare.

Answer 2

Debadipto il 4 Ago 2023

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1991183-implement-the-total-variation-distance-tvd-in-matlab#answer_1283242

Hi Sim,

Upon searching, I found the exact question being asked on stackoverflow (I'm assuming it was posted by you only), where somebody has already answered the question. I am attaching the link to that answer for future reference:

max - Implement the "Total variation distance of probability measures" in Matlab - Stack Overflow

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Sim il 4 Ago 2023

Modificato: Sim il 4 Ago 2023

Yes exactly! :-)

Accedi per commentare.

Implement the "total variation distance" (TVD) in Matlab

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

8 Commenti
Mostra 6 commenti meno recentiNascondi 6 commenti meno recenti

Più risposte (1)

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

Implement the "total variation distance" (TVD) in Matlab

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

8 Commenti Mostra 6 commenti meno recentiNascondi 6 commenti meno recenti

Più risposte (1)

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

8 Commenti
Mostra 6 commenti meno recentiNascondi 6 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti