How to stop all workers simultaneously when an error occurs in one of the workers?
3 visualizzazioni (ultimi 30 giorni)
Mostra commenti meno recenti
Hi guys
I am working with parpool with n number of workers. It is likely that one of the workers returns error at some points. So, I would like to catch error by means of:
parfor i = 1:length(Data)
Try
Simulation(i);
catch ME
stop all workers; % Not the parpool.I want the workers to stop doing %simulations. I do not want them to be closed
change something in Simulation(i);
start workers to do simulation(i);
continue;
end
end
and make some changes and start workers again.
Could you please let me know how to handle it?
Regards,
Vahid
0 Commenti
Risposte (2)
Edric Ellis
il 27 Lug 2015
You can do this using parfeval to send off individual tasks for execution on the workers, and then you can call cancel() on those tasks if you spot an error. Something like this:
% Initiate the work on the workers:
for i = 1:length(Data)
f(i) = parfeval(@Simulation, 1, i);
end
% Check the results, cancel all execution if an error is spotted
completedSuccessfully = true;
for i = 1:length(f)
try
[idx, result] = fetchNext(f);
catch E
% Get here if a simulation threw an error
cancel(f);
completedSuccessfully = false;
break;
end
end
if ~completedSuccessfully
% do stuff...
end
Walter Roberson
il 24 Lug 2015
You can cancel() task objects. I think at one point I saw a way to determine all of the task IDs, but that is not something I have researched.
2 Commenti
Walter Roberson
il 25 Lug 2015
I see http://www.mathworks.com/help/distcomp/findtask.html and http://www.mathworks.com/help/distcomp/createcommunicatingjob.html and http://www.mathworks.com/help/distcomp/createtask.html
For example,
CreateTask(j, @Simulation, num2cell(1:length(Data)))
At the moment I do not see a way to access the results of one task other than to know which state it is in. I have not used these facilities so I am likely overlooking something.
Vedere anche
Categorie
Scopri di più su Parallel Computing Fundamentals in Help Center e File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!