Question in OpenMC MPI-parallelization

antonis · September 24, 2015, 12:01pm

Dear all,

I have a question on the OpenMC MPI-parallelization

In Romano et al. (2014) it is written that OpenMC results are “reproducible” or independent of the number of MPI-processes that are used.

This means that exactly the same results should be expected for the same problem using different numbers of MPI-processes?

Thanks,

Antonios

paulromano · September 24, 2015, 12:49pm

Hi Antonios,

The answer is “yes and no”. In a world without limited precision floating point numbers, the results truly would be exactly the same no matter how many MPI processes are used. In practice, this usually holds true in a run for a while, but eventually there will be some difference in a result due to the order of operations being different, e.g., an MPI_REDUCE of tally results across multiple processes.

Best regards,
Paul

Sterling_Harper · September 24, 2015, 3:06pm

Am I correct in thinking that all of our particle tracks will be exactly reproducible in parallel?

paulromano · September 24, 2015, 11:36pm

Subject to the same caveat, yes.

maximeguo · March 24, 2022, 11:57am

Is this could rise a issue of ‘fake statistic covergence’ in a MPI parallelization? Is there any way to set different seed for each node?

paulromano · March 24, 2022, 1:57pm

@maximeguo No, statistical convergence should look the same whether you are using MPI parallelization or not. There is no way to set a different seed for each node nor is that necessary – the parallelization works by dividing the total number of particles per batch over the MPI processes, and each particle is initialized with a different seed (that is based on the starting seed), so results from different MPI processes are independent of one another.

maximeguo · March 24, 2022, 2:12pm

Thanks a lot. When openmc do tally in MPI parallelization, it exchange data after each batch. So a large particles per batchs is favorable. But, how much particle per batch is a good option？ For example, a calculation is 500 b * 50000 p/b. Could we change it, in a extreme way, to 5 b * 5000000 p/b?

paulromano · March 24, 2022, 2:40pm

The best advice I can give is that your particles per batch should be large enough such that the time being spent on tally synchronization should be a small percentage of the overall runtime. If you have a lot of tallies and not so many particles per batch, you could end up wasting a lot of time on parallel communication. However, you should be cautious not to go too far – a batch is a statistical realization, so if you only have 5 batches, you only have 5 realizations for each tallied quantity. This means, e.g., that you will have a higher Student’s t factor on confidence intervals.

Topic		Replies	Views
Reproducibility of single-thread results on parallel cases User Support	3	548	July 13, 2021
Huge difference of elapsed times between openmp and mpi in parallel methods User Support	4	607	August 16, 2022
Ideal numbers of setting particle, batch particles and inactive? User Support	13	754	February 5, 2021
Questions about running simulation and depletion in parrellel using python script. User Support	3	749	August 3, 2020
Best workflow when running large single large number particle simulations on HPC User Support	1	418	April 28, 2023

Question in OpenMC MPI-parallelization

Related topics