This is the dlb-adapted version of the pils program. As the baseline program,
it iterates on different phases, splitted by MPI_barrier()
’s. Each phase
execute a given number of parallel regions (determined by the grain
parameter), which main body is a loop executing iterations of task-duration
.
Changes in the source code:
DLB_Borrow()
just before each parallel region.Changes in the execution (at run.sh
):