MEDYANSimRunner.jl

Manage long running restartable MEDYAN.jl simulations.

Simulations run using julia code in a main.jl script and write outputs to an output directory.

Inspired by how build scripts work in https://github.com/JuliaPackaging/BinaryBuilder.jl

Installation

First install and run Julia https://julialang.org/downloads/

Then in Julia install this repo as a regular Julia package.

import Pkg
Pkg.add("MEDYANSimRunner")

Warning: MEDYANSimRunner may be incompatible with a future release of Julia.

Specifically MEDYANSimRunner expects copy(Random.default_rng()) to be Xoshiro. This is a Julia internal.

Example

Run the following in the root of this repo.

julia --project=test -e 'using Pkg; pkg"dev ."; pkg"instantiate";'
JULIA_LOAD_PATH="@" julia --project=test --startup-file=no test/example/main.jl --out=test/output --batch=1 --continue

This will run the 1st batch of the example simulation in test/example/main.jl with the test/ environment and store the output in test/output/.

The output directory will be created if it doesn't already exist.

If the "--batch=<job index or range>" option is not included, all jobs specified in main.jl will be run. <job index or range> can be a : delimited range for example 1:3:end to run every third job.

`main.jl` script

This file contains the julia functions used when running the simulation. These functions can modify the input state variable, but in general should return the state. These functions can also use the default random number generator, this will automatically saved and loaded.

At the end of main.jl there should be the lines:

if abspath(PROGRAM_FILE) == @__FILE__
    MEDYANSimRunner.run(ARGS; jobs, setup, loop, load, save, done)
end

To run the simulation if main.jl is called as a julia script.

Performance Profiling with ZoneProfilers

MEDYANSimRunner integrates with ZoneProfilers.jl to provide detailed performance instrumentation of your simulations. This allows you to visualize where time is spent in your setup, loop, save, load, and done functions, as well as in file I/O operations.

Using the Profiler

To enable profiling, pass a profiler instance to the run function:

if abspath(PROGRAM_FILE) == @__FILE__
    # Open and connect to the tracy GUI
    using ZoneProfilerTracy: TracyProfiler
    import TracyProfiler_jll
    profiler = TracyProfiler(TracyProfiler_jll)
    MEDYANSimRunner.run(ARGS; jobs, setup, loop, load, save, done, profiler)
end

Adding Custom Profiling

You can add additional profiling zones within your user functions by using the profiler keyword argument:

function loop(step::Int, state; output, profiler=NullProfiler())
    state = @zone profiler compute_physics(state)
    @zone profiler collect_data!(output, state)
    return state
end

For production runs without profiling overhead, simply omit the profiler parameter (defaults to NullProfiler() which has zero runtime cost).

See the ZoneProfilers.jl documentation for more advanced profiling features.

Standard input parameters.

step::Int: starts out at 0 after setup and is incremented right before every call to loop.

`jobs::Vector{String}`

A vector of jobs to run. Each job represents one variant of the simulation that can be run. This is useful if many simulations need to be run in parallel. The "--batch=<job index>" argument can be used to pick just one job to run.

The selected job string gets passed to the setup function in main.jl. The job string is also used to seed the default RNG right before setup is called.

`setup(job::String; kwargs...) -> header_dict, state`

Return the header dictionary to be written as the header.json file in output trajectory. Also return the state that gets passed on to loop and the state that gets passed to save and load.

job::String: The job. This is used for multi job simulations.

`save(step::Int, state; kwargs...)-> group::SmallZarrGroups.ZGroup`

Return the state of the system as a SmallZarrGroups.ZGroup This function should not mutate state When saving the snapshot, this group will get saved as "snap"

`load(step::Int, group::SmallZarrGroups.ZGroup, state; kwargs...) -> state`

Load the state saved by save This function can mutate state. state may be the state returned from setup or the state returned by loop. This function should return the same output if state is the state returned by loop or the state returned by setup.

`done(step::Int, state; kwargs...) -> done::Bool, expected_final_step::Int`

Return true if the simulation is done, or false if loop should be called again.

Also return the expected value of step when done will first be true, used for displaying the simulation progress.

This function should not mutate state

`loop(step::Int, state; output::SmallZarrGroups.ZGroup, kwargs...) -> state`

Return the state that gets passed to save and load

Optionally, mutate the output keyword argument. When saving the snapshot, this group will get saved as "out"

Main loop pseudo code

activate and instantiate the environment
include("main.jl")
create output directory based on job if it doesn't exist
Random.seed!(collect(reinterpret(UInt64, sha256(job))))
job_header, state =  setup(job)
save job_header
step = 0
group = ZGroup(childern=Dict("snap" => save(step, state))
SmallZarrGroups.save_zip(snapshot_zip_file, group)
state = load(step, SmallZarrGroups.load_zip(snapshot_zip_file)["snap"], state)
while true
    step = step + 1
    output = ZGroup()
    state = loop(step, state; output)
    group = ZGroup(childern=Dict("snap"=>save(step, state), "out"=>output)
    SmallZarrGroups.save_zip(snapshot_zip_file, group)
    state = load(step, SmallZarrGroups.load_zip(snapshot_zip_file)["snap"], state)
    if done(step::Int, state)[1]
        break
    end
end

`output` directory

The output directory has a subdirectory for each job's output. The job string is the name of the subdirectory.

Each job's output subdirectory has the following files.

`logs/<timestamp_randomstring>/{info|warn|error}.log`

Any logs, warnings, and errors generated by the simulation are saved in these files.

`traj/header.json`

A description of the system.

`traj/<i÷1000>/<i%1000 zero padded to 3 digits>.zip`

Contains the snapshot at the end of the i'th step of the simulation. The state returned by setup is stored in 0/000.zip The step_path function can be used to convert for example 123 to "0/123.zip" The steps_traj_dir function can be used to get the steps of snapshots in a "traj" directory. The user data is stored in the "snap" and "out" sub groups. The root group contains some metadata used by MEDYANSimRunner.

`traj/footer.json`

This is created to show a trajectory is complete. It contains some metadata about the trajectory.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.github/workflows		.github/workflows
src		src
test		test
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MEDYANSimRunner.jl

Installation

Warning: MEDYANSimRunner may be incompatible with a future release of Julia.

Example

`main.jl` script

Performance Profiling with ZoneProfilers

Using the Profiler

Adding Custom Profiling

Standard input parameters.

`jobs::Vector{String}`

`setup(job::String; kwargs...) -> header_dict, state`

`save(step::Int, state; kwargs...)-> group::SmallZarrGroups.ZGroup`

`load(step::Int, group::SmallZarrGroups.ZGroup, state; kwargs...) -> state`

`done(step::Int, state; kwargs...) -> done::Bool, expected_final_step::Int`

`loop(step::Int, state; output::SmallZarrGroups.ZGroup, kwargs...) -> state`

Main loop pseudo code

`output` directory

`logs/<timestamp_randomstring>/{info|warn|error}.log`

`traj/header.json`

`traj/<i÷1000>/<i%1000 zero padded to 3 digits>.zip`

`traj/footer.json`

About

Uh oh!

Releases 14

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

medyan-dev/MEDYANSimRunner.jl

Folders and files

Latest commit

History

Repository files navigation

MEDYANSimRunner.jl

Installation

Warning: MEDYANSimRunner may be incompatible with a future release of Julia.

Example

main.jl script

Performance Profiling with ZoneProfilers

Using the Profiler

Adding Custom Profiling

Standard input parameters.

jobs::Vector{String}

setup(job::String; kwargs...) -> header_dict, state

save(step::Int, state; kwargs...)-> group::SmallZarrGroups.ZGroup

load(step::Int, group::SmallZarrGroups.ZGroup, state; kwargs...) -> state

done(step::Int, state; kwargs...) -> done::Bool, expected_final_step::Int

loop(step::Int, state; output::SmallZarrGroups.ZGroup, kwargs...) -> state

Main loop pseudo code

output directory

logs/<timestamp_randomstring>/{info|warn|error}.log

traj/header.json

traj/<i÷1000>/<i%1000 zero padded to 3 digits>.zip

traj/footer.json

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 14

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

`main.jl` script

`jobs::Vector{String}`

`setup(job::String; kwargs...) -> header_dict, state`

`save(step::Int, state; kwargs...)-> group::SmallZarrGroups.ZGroup`

`load(step::Int, group::SmallZarrGroups.ZGroup, state; kwargs...) -> state`

`done(step::Int, state; kwargs...) -> done::Bool, expected_final_step::Int`

`loop(step::Int, state; output::SmallZarrGroups.ZGroup, kwargs...) -> state`

`output` directory

`logs/<timestamp_randomstring>/{info|warn|error}.log`

`traj/header.json`

`traj/<i÷1000>/<i%1000 zero padded to 3 digits>.zip`

`traj/footer.json`

Packages