CUDA_ERROR_ILLEGAL_ADDRESS
Mostra commenti meno recenti
The graphic card is GIGABYTE GTX 1070 mini ITX with the latest driver. The Matlab is 2016a. The CUDA ToolkitVersion reported by Matlab is 7.5.
- gpuDevice(1) call takes 4 minutes to complete. Each call. Later I realized that actually any GPU call after Matlab start causes NVIDIA to rebuild its cache in ...\AppData\Roaming\NVIDIA\ComputeCache\. This is what takes so long. A remark: only the first call of train(net1,x,t,'useGPU','yes') has this overhead, the subsequent calls are fast. But gpuDevice(1) forces the NVIDIA cache rebuild always.
- A call to train(net2,xg,tg) stops with error: "Error using gpuArray/gather.An unexpected error occurred during CUDA execution. The CUDA error was:CUDA_ERROR_ILLEGAL_ADDRESS". And this is end of the game. After a train call with arguments prepared by gpu2nndata no other calls to GPU can be made. Not even gpuArray(). All calls end with some error. Matlab must be restarted (and NVIDIA cache rebuild) to be able to use train with useGPU flag at least.
Anybody has similar experience? Or does anybody has a clue what's going on and if this is solvable? Thank you.
3 Commenti
Dalibor Knis
il 17 Set 2016
mizuki
il 18 Set 2016
GTX 1070 and 1080 have some problems with MATLAB R2016a and R2016b, which is probably caused by cuDNN library. I recommend you to use another GPU if you have.
Dalibor Knis
il 24 Set 2016
Risposta accettata
Più risposte (2)
David Jaquier
il 29 Ago 2018
Modificato: David Jaquier
il 29 Ago 2018
1 voto
Hi everybody, I had the same problem as Dalibor. I have a gtx 1070 (version 399.07) and Matlab R2018a. Last week the example from matlab "DeepLearningFasterRCNNObjectDetectionExample.mlx" run properly and today the program make an CUDA_ERROR_ILLEGAL_ADDRESS. I had to clean reinstall the new driver and the problem was solved.
I hope that's will can help somebody else.
Nicao
il 30 Ago 2018
1 voto
Hi everybody, I have the same problem as you. I got a K80 (driver version 384.130) with Matlab R2017a. I am trying to use a VGG19 network with large images (800*800 pixels) and I got this following message : "An unexpected error occurred during CUDA execution. The CUDA error was: an illegal memory access was encountered ", "CUDA_ERROR_ILLEGAL_ADDRESS " whereas there is still 5Go of free memory on the GPU.
The error occurs during the backpropagation.
According to vl_simplenn_display the data will just take 2Go from the memory.
I also try on a P100 with the same driver version (384.130) and I still have the problem.
I don't have any problem if I use smaller image (250*250 pixels).
Do you have any idea of what can be the problem ?
Categorie
Scopri di più su Parallel and Cloud in Centro assistenza e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!