Skip to content

Runs N' Poses dataset pipeline#68

Open
Nonso-Duaka wants to merge 1 commit into
gnina:mainfrom
Nonso-Duaka:runs-n-poses-evaluation
Open

Runs N' Poses dataset pipeline#68
Nonso-Duaka wants to merge 1 commit into
gnina:mainfrom
Nonso-Duaka:runs-n-poses-evaluation

Conversation

@Nonso-Duaka

Copy link
Copy Markdown

Summary

Adds the Runs N' Poses dataset processing pipeline and training-launch scripts.

What's new

  • omtra_pipelines/runsNposes_dataset/ — full RnP zarr build pipeline (parquet builder, processor, writer, SLURM driver)
  • omtra_pipelines/plinder_ligand_properties/run_pipeline*.slurm — populates ligand/extra_feats on RnP and PLINDER time-split zarrs
  • omtra_pipelines/plinder_clustering/ — fingerprint blocks for clustering at 8/12 Å pocket cutoffs
  • omtra_pipelines/plinder_dataset/generate_filtered_parquet.py — builds the filtered PLINDER training parquet
  • train_omtra_gnn_cmds.txt + train_omtra_multigpu.slurm — training array launchers

Result

RnP zarrs match the paper's evaluable set.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant