parallel code execution on MATLAB cluster

As I run a code on a cluster using spmd, sometimes a worker gets disconnected and the execution stops. In another instance, the job became 'queued' after running for multiple hours and then eventually the execution stopped. What could be potential reasons for these?

1 Commento

Are you using Linux? Could you cofirm the maximum process is sufficient?
ulimit -a

Accedi per commentare.

Risposte (0)

Categorie

Scopri di più su MATLAB Parallel Server in Centro assistenza e File Exchange

Richiesto:

il 10 Gen 2018

Commentato:

il 11 Gen 2018

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by