MPI Multi-Node Calculations - SLURM

ssivaku7 · January 18, 2026, 9:28am

I am currently trying to reduce the run-time of the MC portion by increasing the number of nodes and running in parallel. When running on 1 node, with all cores available, there are no issues. But when using ‘mpirun’, there seems to be a conflict between the nodes when writing to the ‘summary.h5’ file.

These are my current SLURM commands:

#SBATCH -t 48:00:00 # Walltime
#SBATCH --nodes=2
#SBATCH --ntasks=96
#SBATCH -J run_PBR_v_1_8 # Job name

And here is the run command line:

mpirun -np $SLURM_NNODES --bind-to core --map-by core python3 run_PBR_v_1_8.py

Would greatly appreciate any clarity that can be provided regarding this. Happy to provide more clarifying information if necessary.

yrrepy · January 19, 2026, 12:04am

did you build with MPI?

I usually prefer srun to mpirun, particularly multi-node

You are likely spawning -np instances of OpenMC each at 1xMPI

ssivaku7 · January 19, 2026, 2:38am

Could you elaborate on using srun instead? For example, would I define:

mpirun -np $SLURM_NNODES --bind-to core --map-by core python3 run_PBR_v_1_8.py

I was also referencing the following link:

However, this did not fix the issue. Appreciate any assistance you can provide.

yrrepy · January 19, 2026, 5:40am

instead of
mpirun -np $SLURM_NNODES --bind-to core --map-by core python3 run_PBR_v_1_8.py
you put srun XYZ
than Slurm takes care of running the MPI and handles dispatching

below runs a calculation on 2 nodes, each with 4xMPI * 96xOMP
for a total of 768 threads

some aspects of your Slurm script will vary based on your HPC environment/build and OpenMC build
I always build OpenMC with Intel oneAPI and I also build my own parallel HDF5

#SBATCH -J  OMC-mpi                     # Job name
#SBATCH --nodes=2                       # -N;  number of nodes on which to run 
#SBATCH --ntasks=8                      # -n;  number of tasks to run = Total Number of MPI ranks
#SBATCH --ntasks-per-node=4             #      number of MPI tasks to invoke on each node
#SBATCH --cpus-per-task=96              # OMP ranks per MPI rank
#SBATCH --threads-per-core=2            # Lock threads to Cores
#SBATCH --hint=multithread              # Allow hyperthreads
#SBATCH --exclusive                                                                                                                                              
#SBATCH --no-requeue
#SBATCH --output=job_%j.log             # Standard output and error log 
#SBATCH -vv

# MPI Settings
module load intel/2025.3.0
module load   mpi/2021.17
export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK    # use cpus-per-task
export OMP_PLACES=threads                      # physical cores + hyperthreads
export OMP_PROC_BIND=close                     # close for cache locality 
export I_MPI_DEBUG=9
export LD_LIBRARY_PATH=/opt/local/lib64:$LD_LIBRARY_PATH
export HDF5_USE_FILE_LOCKING=FALSE

srun -vv   --mpi=pmi2  ~/home/bin/openmc  -s $OMP_NUM_THREADS  model*xml

ssivaku7 · January 19, 2026, 10:24am

Thanks for your reply. I tried to emulate your script, and ran into ‘invalid distribution specification’ errors. The system architecture I am working with is listed below:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              96
On-line CPU(s) list: 0-95
Thread(s) per core:  1
Core(s) per socket:  48
Socket(s):           2
NUMA node(s):        16
Vendor ID:           AuthenticAMD
CPU family:          25
Model:               17
Model name:          AMD EPYC 9454 48-Core Processor
Stepping:            1
CPU MHz:             2750.000
CPU max MHz:         3810.7910
CPU min MHz:         1500.0000
BogoMIPS:            5499.87
Virtualization:      AMD-V
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            32768K
NUMA node0 CPU(s):   0-5
NUMA node1 CPU(s):   6-11
NUMA node2 CPU(s):   12-17
NUMA node3 CPU(s):   18-23
NUMA node4 CPU(s):   24-29
NUMA node5 CPU(s):   30-35
NUMA node6 CPU(s):   36-41
NUMA node7 CPU(s):   42-47
NUMA node8 CPU(s):   48-53
NUMA node9 CPU(s):   54-59
NUMA node10 CPU(s):  60-65
NUMA node11 CPU(s):  66-71
NUMA node12 CPU(s):  72-77
NUMA node13 CPU(s):  78-83
NUMA node14 CPU(s):  84-89
NUMA node15 CPU(s):  90-95

Because of this, I defined my SLURM allocations as:

#SBATCH -p od3
#SBATCH -t 48:00:00 # Walltime
#SBATCH --nodes=2
#SBATCH --ntasks-per-node=96
#SBATCH --cpus-per-task=1
#SBATCH --exclusive
#SBATCH -J run_PBR_v_1_6 # Job name

I also included the following:

export OMP_NUM_THREADS=1
srun -vv -mpi=pmi2 python3 run_PBR_v_1_6.py -s $OMP_NUM_THREADS

As far as I can see, this is fairly similar to your script. Do you see anything that could be causing this issue? I am unfamiliar with using srun, so I am unable to proficiently troubleshoot.

yrrepy · January 20, 2026, 3:30am

pre-run
python3 run_PBR_v_1_6.py

for now, generate the model.xml and stick with running OpenMC directly
you are not familiar with MPI, so you are going to have difficulty executing an MPI OpenMC Python run
you can get back to python later

#SBATCH -p od3
#SBATCH --nodes=2
#SBATCH --ntasks=32                      # 2xNUMA
#SBATCH --ntasks-per-node=16             # 16 NUMA 
#SBATCH --cpus-per-task=6                # CPUs per NUMA
#SBATCH --exclusive

export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK    # use cpus-per-task

srun -vv   --mpi=pmi2  ~/home/bin/openmc  -s $OMP_NUM_THREADS  model*xml

you may need a different --mpi depending on your HPC
check srun --mpi=list
use whatever is available, preferably pmi2+ (pmix3, pmix3_v5, etc)

you may need to module load depending on your HPC
you’re on your own for that

ssivaku7 · January 20, 2026, 5:31pm

I attempted to run this, but it gave me the following error:

‘Invalid --distribution specification’

Also - I was curious why I would use 5 NUMA nodes, when there are 16 available. Is it possible to use the full 16?

yrrepy · January 20, 2026, 5:55pm

yes, 16 NUMA. edited
I didn’t scroll down and didn’t see the rest (5 is def not a very whole number for NUMAs)

I’m guessing that its --pmi2 that caused:
‘Invalid --distribution specification’

But I don’t have a complete picture

ssivaku7 · January 20, 2026, 8:23pm

I think I was able to get it working. However, I’m running into issues writing to the summary.h5 file:

When running in parallel, is there a way to ensure that the h5 file is written properly?

Topic		Replies	Views
Best workflow when running large single large number particle simulations on HPC User Support	1	446	April 28, 2023
Multi-node job do not scale User Support	1	245	July 16, 2024
Multi-node parallel in openmc.run() and integrator.integrate() User Support	2	503	June 6, 2023
Questions about running simulation and depletion in parrellel using python script. User Support	3	789	August 3, 2020
Running OpenMC on heterogenous clusters User Support	2	248	March 6, 2019

MPI Multi-Node Calculations - SLURM

Related topics