lcr-scripts

Collection of curated scripts from the Morin Lab, used primarily within lcr-modules Snakemake pipelines.

Repository structure

Each script lives under <script_name>/<version>/ and typically contains:

The script itself (.py, .sh, etc.)
A conda environment YAML (<script_name>-<major>.yaml)
Optionally, a Dockerfile for containerized use
Optionally, a run_tests.sh and tests/ directory for regression tests

Docker containers

A GitHub Actions workflow (.github/workflows/build-containers.yml) automatically builds and pushes a container to the GitHub Container Registry (GHCR) whenever a Dockerfile is merged into master. Containers are tagged using the script name and version derived from the directory path:

ghcr.io/lcr-bccrc/lcr-scripts/<script_name>:<version>

For example, augment_manta_vcf/1.1/Dockerfile produces:

ghcr.io/lcr-bccrc/lcr-scripts/augment_manta_vcf:1.1

These images are referenced in lcr-modules via container_envs in each module's config/default.yaml and used with Snakemake's --use-apptainer flag.

Adding a container for a new script

Create a Dockerfile in the appropriate <script_name>/<version>/ directory.
The Dockerfile should copy the conda environment YAML and install it into the base environment, e.g.:

FROM condaforge/mambaforge:latest
LABEL org.opencontainers.image.source="https://github.com/LCR-BCCRC/lcr-scripts"

RUN mamba install --yes --name base \
        --channel bioconda \
        --channel conda-forge \
        <package1> \
        <package2> \
    && mamba clean --all --yes

CMD ["/bin/bash"]

Install only the packages the script actually imports — avoid reusing the conda YAML directly, as it is a fully-pinned export that may be incompatible with the Python version in the base image.

Open a PR and merge to master — the CI workflow will build and push the image automatically.

Note: The conda YAML files in each script directory are symlinks into envs/. Docker cannot follow symlinks that escape the build context, so Dockerfiles must use the real path relative to the repo root (e.g. COPY envs/<script_name>/<script_name>-<major>.yaml /tmp/env.yaml) and the CI workflow sets the build context to . (repo root). Do not use the local symlink path.

Testing

Scripts can include a run_tests.sh that runs the script on small curated inputs and writes outputs to tests/output/. This follows a golden-file pattern:

Establish a baseline (once): Activate the conda environment for the script, run ./run_tests.sh from the script's version directory, then commit the files written to tests/output/. These are the golden files — generated from the conda environment, which is the source of truth.
Detect regressions (every subsequent run): Run ./run_tests.sh again in the conda environment — it overwrites tests/output/. Then check for unexpected changes with:
```
git diff tests/output/
```
No diff means the outputs are identical to the baseline. Any diff indicates a regression.
Environment-variable header lines: Some output files (e.g. VCFs) include header lines with absolute paths (##cmdline, ##regions_bed) that differ between environments and will always appear in git diff. To compare only the meaningful content, filter them out:
```
grep -v "^##cmdline\|^##regions_bed" tests/output/file.vcf | diff - tests/output/file.vcf
```

CI integration

When a Dockerfile is present alongside a run_tests.sh, the CI workflow automatically:

Builds the Docker image from the Dockerfile.
Runs run_tests.sh inside the container, writing outputs to tests/docker_output/.
Diffs the Docker outputs against the committed golden files in tests/output/, ignoring environment-specific header lines (##cmdline, ##regions_bed).
Fails the build if any output differs from the golden file.

This means Docker is tested against a conda-generated baseline — not against itself. If no golden files are committed yet, the comparison step is skipped with a warning.

Adding tests for a new script

Create tests/input/ with small, self-contained input files.
Create an empty tests/output/ directory (add a .gitkeep so git tracks it).
Write run_tests.sh — start with set -euo pipefail so any failure exits non-zero. Put positional arguments before any option that uses nargs="+" (e.g. --bed_regions) to avoid argparse consuming them greedily.
Run ./run_tests.sh once, inspect the outputs, and commit tests/output/.

Name		Name	Last commit message	Last commit date
Latest commit History 433 Commits
.github/workflows		.github/workflows
augment_manta_vcf		augment_manta_vcf
augment_ssm/1.0		augment_ssm/1.0
battenberg/1.0		battenberg/1.0
calc_manta_vaf/1.0		calc_manta_vaf/1.0
cnv2igv		cnv2igv
crossmap		crossmap
deblacklist_maf		deblacklist_maf
envs		envs
fill_segments		fill_segments
filter_vcf/1.0		filter_vcf/1.0
find_noncanonical_transcripts		find_noncanonical_transcripts
generate_igv_batch		generate_igv_batch
generate_smg_inputs		generate_smg_inputs
generate_smr_inputs/1.0		generate_smr_inputs/1.0
get_bams		get_bams
gsutil/4.53		gsutil/4.53
liftover		liftover
oncodriveclustl/1.1.3		oncodriveclustl/1.1.3
query_api/1.0		query_api/1.0
salmon2counts/1.0		salmon2counts/1.0
sigprofiler/1.1		sigprofiler/1.1
snpeff/1.0		snpeff/1.0
sra-tools/1.0		sra-tools/1.0
starfish/2.0		starfish/2.0
.gitignore		.gitignore
README.md		README.md
find_snakemake_env.py		find_snakemake_env.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lcr-scripts

Repository structure

Docker containers

Adding a container for a new script

Testing

CI integration

Adding tests for a new script

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

lcr-scripts

Repository structure

Docker containers

Adding a container for a new script

Testing

CI integration

Adding tests for a new script

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages