Home Resources for Co-Design Programs Bem4i Exp_serialized_comp_and_comm

BEM4I miniApp - Serialized computation and communication

Another issue in the BEM4I kernel might be seen in a very long computation part which is followed by a quite long collective communication (MPI_Allreduce function). During this MPI communication, all threads are doing nothing. Since this presented code appears multiple-times within each iteration of the GMRES solver, we may expect that it will be repeated a thousand times.

Here we present useful computation and MPI communication during one matrix-vector multiplication.

OneIteration_longComputation_comm

OneIteration_comm

This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreements No 676553 (POP1) and 824080 (POP2).

Currently, the project receives funding from the European High-Performance Computing Joint Undertaking (JU) under grant agreement No 101143931 (POP3). The JU receives support from the European Union's Horizon Europe research and innovation programme and Spain, Germany, France, Portugal and the Czech Republic.