opengate Multiprocessing

Enable GATE 10 to split a simulation into multiple parallel processes. THIS IS WORK IN PROGRESS

First implemented items:

split run timing intervals
adapt dynamic objects (run-based)
spawn processes via Pool
write output into a separate subfolder per process

Still missing:

merge actor output from different processes

Oct 09 '24 23:10 nkrah

Yes, but letting the user define the total number of processes rather than the process per run is more intuitive and will not require an API change if we implement a more advanced splitting scheme in the future. So I think it's better this way.

On Oct 11 2024, at 11:00 am, David Sarrut @.***> wrote:

@dsarrut commented on this pull request.

In opengate/managers.py (https://github.com/OpenGATE/opengate/pull/486#discussion_r1796655841):
                       output = se.run_engine()
             return output
   def run(self, start_new_process=False):
   def generate_run_timing_interval_map(self, number_of_processes):
           if number_of_processes % len(self.run_timing_intervals) != 0:
why ? I thought we just divide ALL time_interval by the number_of_processes

—

Reply to this email directly, view it on GitHub (https://github.com/OpenGATE/opengate/pull/486#pullrequestreview-2362317849), or unsubscribe (https://github.com/notifications/unsubscribe-auth/AIFQFYM2QBJ7YWBUP4ELK2TZ26HTVAVCNFSM6AAAAABPVS5KBSVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDGNRSGMYTOOBUHE).

You are receiving this because you authored the thread.

Oct 11 '24 09:10 nkrah

I figured out a flexible mechanism to merge data back into one single actor output (if data is mergeable: true for images, not true yet for ROOT). We will need a new type of method, common to all actors, namely FinalizeSimulation(), to be triggered from the Simulation after all processes have finished. Writing the combined output (from the processes) to disk will be done in FinalizeSimulation(). The EndOfSimulation(), where writing currently takes place, is called inside the process and therefore before combining the output. We can also add an option to not store intermediate, i.e. per process, output on disk if not needed. For example: images are accessible directly via memory and can be merged that way. No need to access data from disk.

Note: FinalizeSimulation() will not have access to engines because they do not exist any more outside of the subprocess.

Oct 11 '24 09:10 nkrah

New: The following actors now work in multiprocessing (local machine):

SimulationStatisticsActor: data is merged in memory and accessible after the simulation; written to disk if requested
Actors with ROOT output: root files (from subdirectories per process) are merged into new root file in main output folder structure. Event IDs are automatically incremented. RunIDs are recreated as per the original simulation.

Works with test019_phsp_actor -> created a new variant of the test.

Still need to create variants of other tests that use ROOT output to check.

Oct 28 '24 16:10 nkrah

@nkrah I think all actors shall have atomic variables, this way all actors will be thread safe by default, watch this library https://pypi.org/project/atomicx/ . It already implemented the atomic doubles on my suggestion

from atomicx import AtomicFloat

# Create an atomic float with an initial value of 0.0
atom = AtomicFloat()
print(f"Initial Value: {atom.load()}")

# Perform atomic operations
atom.store(3.14)
value = atom.load()
print(f"Value: {value}")

# See docs for more operations

Nov 13 '24 08:11 BishopWolf

@BishopWolf Thanks for the suggestion. I think atomic doubles will be useful for certain parts of the actors.

Bear in mind that this PR is about multiprocessing, i.e. running a (independent) simulation in a newly spawn process. There is no issue with shared memory handling in this case.

Concerning multithreading: We are actually using the multithreading architecture from Geant4 which means that not every part of a simulation runs in separate threads, only certain methods. Therefore, only certain shared data structures, e.g. images into which all threads write, need to be thread safe. Currently, there is no python-side function that accesses shared data on a per-thread basis, only C++ functions. In case this changes in the future, I think the package you suggest could be a good option.

Nov 13 '24 08:11 nkrah

I will pick this up again once PR #599 is done.

Nov 28 '24 13:11 nkrah