How to export INT8 quantized weight of deep neural network?

Question

Jisu Kwon il 29 Mag 2024

1
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2123701-how-to-export-int8-quantized-weight-of-deep-neural-network

Commentato: Angelo Yeo il 30 Mag 2024

Risposta accettata: Angelo Yeo

Apri in MATLAB Online

I trained neural network using Deep Learning Toolbox, and quantized it.

Below code is what I used to INT8 quantize network model.

% Create a dlquantizer object for quantization
quantObj = dlquantizer(net);
% quantOpts = dlquantizationOptions(target='host');
calibrate(quantObj,imdsTrain);
% valResults = validate(quantObj, imdsValidation, quantOpts);
% valResults.Statistics
% Perform quantization
quantObj = quantize(quantObj);
qDetailsQuantized = quantizationDetails(quantObj)
% Save the quantized network
save('quantizedNet.mat', 'quantObj');
exportONNXNetwork(quantObj,'quantizedNet.onnx')

After quantization, I got quantized network quantObj .

However, I cannot access weight and bias which coverted to INT8 format.

When I display quantized networks' weight and bias using bwloe code,

>> disp(quantObj.Layers(2).Bias(:,:,1))
-6.9011793e-12

It still shows float type value.

Even I tried to export network as ONNX, MATLAB shows below warning,

>> exportONNXNetwork(quantObj,'quantizedNet.onnx')
Warning: Exported weights are not quantized when exporting quantized networks. 

How can I access INT8 quantized weight and bias value?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Angelo Yeo il 30 Mag 2024

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2123701-how-to-export-int8-quantized-weight-of-deep-neural-network#answer_1465151

Use the quantizationDetails function to extract quantization details.

You should inspect your qDetailsQuantized which was extracted with quantizationDetails. Would you look up the qDetailsQuantized.QuantizedLearnables?

The following example can be helpful for you.

Display quantization details for a neural network - MATLAB quantizationDetails (mathworks.com)

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Jisu Kwon il 30 Mag 2024

Apri in MATLAB Online

I found it, qDetailsQuantized.QuantizedLearnables was what I want...

It was already obviously shown in member of table.

>> qDetailsQuantized.QuantizedLearnables
ans =
8×3 table
Layer      Parameter          Value      
________    _________    _________________
"conv_1"    "Weights"    {3×3×1×60  int8 }
"conv_1"    "Bias"       {1×1×60    int32}
"conv_2"    "Weights"    {3×3×60×60 int8 }
"conv_2"    "Bias"       {1×1×60    int32}
"conv_3"    "Weights"    {3×3×60×56 int8 }
"conv_3"    "Bias"       {1×1×56    int32}
"conv_4"    "Weights"    {3×3×56×12 int8 }
"conv_4"    "Bias"       {1×1×12    int32}

I can access value like this.

>> conv_1_weight = qDetailsQuantized.QuantizedLearnables.Value(1)
conv_1_weight =
1×1 cell array
{3×3×1×60 int8}
>> conv_1_weight{:,:,:,1}
3×3×1×60 int8 array
ans(:,:,1,1) =
18   -16   -50
-6   -54   -10
-37   -49   -18

Thanks again for your response!

Angelo Yeo il 30 Mag 2024

Yes, exactly. Thanks for the feedback. It's great to know it worked for you.

Accedi per commentare.

How to export INT8 quantized weight of deep neural network?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

How to export INT8 quantized weight of deep neural network?

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

3 Commenti Mostra 1 commento meno recenteNascondi 1 commento meno recente

Più risposte (0)

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente