This example does allow all data transfers to take place simultaneously. Whether or not the underlying
MPI
implementation and hardware can do this is something that this example evaluates.