I know MPI_Barrier() can guarantee synchronization, but when one of the hosts occurring error, the other hosts all cannot continue to run. what I wish is that even if some of the processes occur error, the remaining processes will still work. Dose any one
knows whether the MPI provides such a function,?
Very grateful.