Simulation gets "stuck" between batches

alex · August 21, 2024, 8:25pm

Hi all, I am having an issue where OpenMC simulations get “stuck” between batches (e.g. it completes N of N+M batches and then never makes further process). When it gets stuck, each MPI process sits using 100% of a single thread, when running an actual batch I have it configured with OpenMP to run 24 threads per MPI process. A problem file seems to always get stuck on the same batch number. Running the problem with fewer total particles (less than the point it gets stuck) completes successfully.

It appears my problem is similar, or possibly identical, to these two threads:

in which it sounded like this was an issue with the tracklength tally estimator. Changing to a collision estimator is not a good solution for my problem, I think, as I have significant vacuum/void regions. I will try changing the tally dimensions to see if I can work around it, but I wanted to ask if there has been any progress on this bug? And if I need to change tally dimensions, does anyone have an understanding of what causes the problem to hang?

I am running v0.15.0 of OpenMC

Thank you,
Alex

wahidluthfi · August 21, 2024, 9:35pm

Hi Alex, welcome to the community.
I also encountered a similar issue a long time ago, and then I realized that I had some geometry error after doing geometry debug even in low n histories. openmc.run(geometry_debug=True)
Have you checked your geometry?
Sorry if it didn’t solve your problem.

alex · August 21, 2024, 10:09pm

Yes, I’ve tried (with a much smaller number of particles) running geometry debug mode and it doesn’t turn up any cell overlap or other geometry issues. I have not tried to run a full number of particles in geometry debug mode as I assumed it would take a very long time, but if that’s recommended I will try it.

wahidluthfi · August 21, 2024, 10:22pm

Sorry Alex, I can’t recommend it if you think this calculation is time-consuming and you need that time for other calculations.
I am just thinking that the problem might come from the geometry and the neutrons get into that coordinate after the long run. Because the random number generator in all mc code is not fully random right?
But yeah, maybe it came from the mpi - omp threading, and it doesn’t really a problem to the geometry. Have you tried to use thread only? So rather than using 12 mpi @ 12 thread, you use 144 threads by using OMP_NUM_THREADS=144 or openmc.run(threads=144). But that also going to need a lot of time and right now, time is limited.

I hope other members can get to you later. Sorry Alex

alex · August 22, 2024, 10:08pm

Right - when doing geometry debugging I did enough particles that each cell was checked but not the full number of particles, because the user guide suggested that there was a computational cost and the full run takes a long time already.

I’ve tried a few combinations of run configurations including a threaded only (no MPI) run and all failed.

After posting I found these two threads on gitlab:

github.com/openmc-dev/openmc

Mesh Tally Crashes the Simulation

opened 09:34AM - 23 Jun 24 UTC

itay-space

Bugs

## Bug Description There is a serious bug in the calculations involving a mesh …tally, specifically with the track-length estimator. The simulation freezes without producing any additional statepoints. This issue is evident because when it occurs, the number of CPUs in use drops from utilizing all available OpenMP threads to just one. Similar behavior has been reported multiple times in the OpenMC discussion forum: [openmc-freezes-when-simulating-batches](https://openmc.discourse.group/t/openmc-freezes-when-simulating-batches/2530) [simulation-freezing](https://openmc.discourse.group/t/simulation-freezing/4082) [particle-lost-without-warning-error-or-being-killed-stalling-the-run](https://openmc.discourse.group/t/particle-lost-without-warning-error-or-being-killed-stalling-the-run/3098) ## Steps to Reproduce Attached is a straightforward demonstration of this problem. The input file can be generated using the following Python code. This code will create a wall made of the H1 isotope and a mesh filter surrounding it. Additionally, there is an attached figure illustrating the simple geometry, with an arrow indicating the beam direction. In this example (with seed=17) the simulation will crash around the 6th batch. ![image](https://github.com/openmc-dev/openmc/assets/126396074/f873194e-63e8-48d0-9abd-71c3aa8a1edd) ```import openmc H = openmc.Material(name='Hydrogen') H.add_nuclide('H1', 1) materials = openmc.Materials([H]) materials.export_to_xml() z1 = openmc.ZPlane( z0 = 0) z2 = openmc.ZPlane( z0 = 200) cyl = openmc.ZCylinder(r = 200) wall_reg = -cyl & +z1 & -z2 wall_cell = openmc.Cell(name = "wall" , region= wall_reg , fill = H) world_sphere = openmc.Sphere(r=1000,name="world_sphere",boundary_type='vacuum') world = openmc.Cell(region=-world_sphere &(~wall_reg) , name = 'world') univ = openmc.Universe(cells=[wall_cell , world]) geometry = openmc.Geometry(univ) geometry.export_to_xml() settings = openmc.Settings() settings.run_mode = 'fixed source' settings.particles = 1000000 bat = 200 settings.batches = bat settings.statepoint['batches'] = list(range(bat+1))[::1] angle = openmc.stats.Monodirectional((0,0,1)) energy = openmc.stats.Discrete(10e6,1) point = openmc.stats.Point((0, 0, -0.5)) source = openmc.Source(space=point,angle=angle,energy=energy,particle="neutron") settings.source = source settings.seed = 17 settings.export_to_xml() mesh = openmc.RegularMesh() mesh.dimension = [50, 50,50] mesh.lower_left = [-200,-200,-10] mesh.upper_right = [200,200,300] mesh_filter = openmc.MeshFilter(mesh) tallies = openmc.Tallies() mesh_tally = openmc.Tally(name="mesh_tally") mesh_tally.filters = [mesh_filter] mesh_tally.scores = ["flux"] mesh_tally.estimator = 'tracklength' tallies.append(mesh_tally) tallies.export_to_xml() ``` ## Environment The tests were done using the latest docker image: ```{ "Id": "sha256:1b0e57ac2bdddda83097c9e3370c173f5dd9e92b4f4c818fc31d441a4dc25bbb", "Digest": null, "RepoDigests": [ "openmc/openmc@sha256:56a36944da2cf5f2c2cb4054c8389344ec3068f36b8134af77c63e5f9abc6f57" ], "Labels": null } ``` And some more details: ``` OpenMC version 0.14.0 Git SHA1: fa2330103de61a864c958d1a7250f11e5dd91468 Copyright (c) 2011-2023 MIT, UChicago Argonne LLC, and contributors MIT/X license at <https://docs.openmc.org/en/latest/license.html> Build type: RelWithDebInfo Compiler ID: GNU 10.2.1 MPI enabled: yes Parallel HDF5 enabled: yes PNG support: yes DAGMC support: no libMesh support: no MCPL support: no NCrystal support: no Coverage testing: no Profiling flags: no ``` Data library: fendl-3.2-hdf5

github.com/openmc-dev/openmc

Simple Simulation runs into endless loop with openmc.MeshSurfaceFilter

opened 04:21PM - 23 Jan 24 UTC

riwa73

Bugs

## Bug Description With the following setup I run into an endless loop. A simple mono-energetic Source of quadratic-plane shape and a little farther away a rectangular _RegularMesh_. On this mesh the current is tallied with the _openmc.MeshSurfaceFilter_. fill = None, so no materials involved. I'm happy to help finding the bug. ## Steps to Reproduce The following code reproduces the error. Note: with `settings.seed= 21` it happens in batch 3. If seed is altered the occurrence time gets delayed (resp shifted) `import openmc import numpy as np ## Materials materials = openmc.Materials([]) materials.export_to_xml() geometry = openmc.Geometry([]) Big_sphere_radius = 80 # cm - for outside of the simulation # Definition of surfaces outside_Sphere = openmc.Sphere(r=Big_sphere_radius, boundary_type='vacuum') outside_cell = openmc.Cell(cell_id=999, fill = None, region=-outside_Sphere #& ~( rotated_detector_Plate) ) uni = openmc.Universe(cells=[outside_cell]) geometry = openmc.Geometry(root=uni) geometry.export_to_xml() #Place Source source_position = 0 offset_x = 0 offset_y = 0 x_source = openmc.stats.Uniform(-2+offset_x,2+offset_x) # source with no thickness y_source = openmc.stats.Uniform(-2+offset_y,2+offset_y) z_source = openmc.stats.Uniform(source_position, source_position) source = openmc.IndependentSource() source.energy = openmc.stats.Discrete([3e6], [1.0]) # 3MeV source.space=openmc.stats.CartesianIndependent(x=x_source, y=y_source, z=z_source) # MESH SURFACE TALLY mesh_surface = openmc.RegularMesh(mesh_id = 3) mesh_surface.lower_left = [-20, -20, 56] mesh_surface.upper_right = [20, 20, 60] mesh_surface.dimension = [10, 10, 1] reg_filter = openmc.MeshSurfaceFilter(mesh_surface, filter_id=4) mesh_surface_tally = openmc.Tally(tally_id=5, name='mesh_surface') mesh_surface_tally.filters = [reg_filter] mesh_surface_tally.scores = ['current'] tallies = openmc.Tallies([mesh_surface_tally]) ## Settings and Model settings = openmc.Settings() settings.source = source settings.run_mode = 'fixed source' settings.particles = 100000 settings.batches = 50 settings.verbosity= 6 settings.seed= 21 model = openmc.Model() model.geometry = geometry model.settings = settings model.tallies = tallies # Run Simulation sp_filename = model.run(output=True)` ## Environment Ubuntu 22.04 with OpenMC (0.14.1) (but happened also at the previous release)

Changing the mesh dimensions by a factor of 2 did not change behavior, but my simulations did complete with either a switch from tracklength to collision estimator, or by using RectilinearMesh instead of RegularMesh. So if there are bug(s) in those modules they seem to still be present?

wahidluthfi · August 22, 2024, 11:43pm

Hi Alex, is it possible for you to give us the script to reproduce this problem? The minimum set of model: materials, geometry, tally, and settings for it to get stuck?

alex · August 23, 2024, 4:21pm

Unfortunately I’m not able to share the actual script used. I did confirm that the small example in this (Simple Simulation runs into endless loop with openmc.MeshSurfaceFilter · Issue #2855 · openmc-dev/openmc · GitHub) hangs for me with RegularMesh but completes with RectilinearMesh.

wahidluthfi · August 24, 2024, 12:14am

Hi Alex, I tried to replicate the case scenario from git and also got “stuck” when using RegularMesh and it works if it changes to RectilinearMesh as you said before. here is the notebook regmeshstuck.ipynb (111.1 KB)

But, when I plot the source defined by marquezj using regular mesh to see the xy and xz plot, I see that the source itself was a thin source in the z-axis, here is the plot

so basically I expect the flux will be low on the coordinate that was interested by marquezj which is x (-30 to 30) y(-30 to 30) z(30 to 60) as defined

mesh_surface = openmc.RegularMesh() # <-- Important
mesh_surface.lower_left = [-30, -30, 30]  # <-- Min z is important
mesh_surface.upper_right = [30, 30, 60]  # <-- Max z is important
mesh_surface.dimension = [1, 1, 1]

when I changed the z-min in lower left into slightly higher or lower, openmc did not get “stuck”

mesh_surface.lower_left = [-30, -30, 30.001]  # <-- Min z is important

that will also work if you use [-30, -30, 29.999].
and as predicted, the current is all zero

if I move the mesh coordinate to z between 0-1 cm, because I know that there are a lot of particles on that z position,

mesh_surface.lower_left = [-30, -30, 0]  # <-- Min z is important
mesh_surface.upper_right = [30, 30, 1]  # <-- Max z is important
mesh_surface.dimension = [1, 1, 1]

then as expected, we will get the surface current from that position and source configuration

In another case, when I used the original mesh specified mesh_surface.lower_left = [-30, -30, 30] but change the source definition to default isotropic at the origin by simply adding a hashtag to the source definition

# settings.source = source
settings.run_mode = 'fixed source'
settings.particles = 10000
settings.batches =  10

then, I expect that the source will be distributed isotropic, even on the z-axis like this

and we can expect that the surface current will have value on the x (-30 to 30) y(-30 to 30) z(30 to 60), and here the output surface current tally

so if this is the same case as what you encounter, you might want to try to plot your source using a regular mesh tally to see the source you defined, and you can slightly change the mesh coordinate if the problem is on the mesh definition.
sorry if it didn’t help in solving your problem

IoannisKourasis · January 2, 2025, 1:30pm

Hello, I came across the same issue, fixed source simulation with a mesh tally, hanging at a certain batch, same batch every time.
@wahidluthfi solution of changing mesh.lower_left zmin by .001, fixed it.

Do we know what causes this bug?

Topic		Replies	Views
Simulation Freezing User Support	2	211	April 1, 2024
Simulation freezes when using weight windows (version 0.14) User Support	6	228	April 30, 2025
Simulation left hanging on the addition of a mesh tally User Support	0	35	August 16, 2024
Particle lost without warning/error or being killed, stalling the run User Support	7	426	June 17, 2025
Stuck in When running OpenMC User Support	1	277	December 7, 2022

Simulation gets "stuck" between batches

Related topics