What do I see performance and numerical accuracy issues with quantized INT8 deep learning networks using GPU Coder in R2021a?

3 visualizzazioni (ultimi 30 giorni)
I’m generating code for a quantized deep learning network using GPU Coder but experiencing performance and numerical accuracy issues when using INT8 precision with cuDNN 8.
What versions of cuDNN are supported by GPU Coder in R2021a?

Risposta accettata

Bill Chou
Bill Chou il 24 Mar 2021
In R2021a, GPU Coder supports cuDNN 8.1.0. For more information, see Installing Prerequisite Products (GPU Coder). It is recommended to use this version of cuDNN as other versions have significant performance and accuracy issues with INT8 workflows.
When using GPU Coder with cuDNN 8.0.x to generate CUDA code for a quantized deep learning network in INT8 precision, you may experience different issues depending on the version of cuDNN 8.0 used. The table below summarizes the issues you may experience.

Più risposte (0)

Categorie

Scopri di più su Deep Learning Code Generation Fundamentals in Help Center e File Exchange

Prodotti


Release

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by