The nci-lp-dragen partition as of June 2023 includes one dragen server configured to run the TSO500 ctDNA pipeline. It has been funded by NCI/CCR/Lab of Pathology until 30-Sep-2026.
Notes:
/staging
disk partition.Changelog:
Create a batch script similar to the following:
#! /bin/bash
# this is dragen.sh
# set up paths etc
source /etc/profile.d/edico.sh
RUNPATH=/path/to/runs
RUNFOLDER=NNNNNN_XXXXXX_NNNN_XXXXXXXXXX
ANALYSIS="/staging/${RUNFOLDER}-ctDNA-$(date +%s)"
METRICS=${ANALYSIS}/Results/MetricsOutput.tsv
RESULTPATH=/path/to/results
# clean up after run
trap 'rm -rf "/staging/${RUNFOLDER}" "${ANALYSIS}"' EXIT
cp -r "${RUNPATH}/${RUNFOLDER}" /staging || exit 100
mkdir -p "${ANALYSIS}" || exit 101
# may need to specify --sampleSheet if not in the default location
DRAGEN_TSO500_CTDNA-2.6.0.sh --analysisFolder "${ANALYSIS}" \
--runFolder "/staging/${RUNFOLDER}" \
2>&1 | tee "${ANALYSIS}/test.out" || exit 102
[[ -f "${METRICS}" ]] || printf "Missing metrics file\n"
grep COMPLETED_ALL_STEPS "${METRICS}"
# copy results back to working directory
cp -r "${ANALYSIS}" "${RESULTPATH}" || exit 103
And submit with
[user@biowulf]$ sbatch --mem=0 --cpus-per-task=64 --partition nci-lp-dragen --qos=compass dragen.sh 12345678
if you have access to the compass QOS which increases priority and max. walltime. Users without access to the QOS are limited to 8h max walltime.
Note that the $ANALYSIS folder is lager than the input
with Logs_Intermediates taking up most the space. The script
above could be modified to only transfer a subset of files back to shared
storage.
Create a batch script similar to the following:
#! /bin/bash
# this is dragen.sh
# set up paths etc
source /etc/profile.d/edico.sh
RUNPATH=/path/to/runs
RUNFOLDER=NNNNNN_XXXXXX_NNNN_XXXXXXXXXX
ANALYSIS="/staging/${RUNFOLDER}-ctDNA-$(date +%s)"
METRICS=${ANALYSIS}/Results/MetricsOutput.tsv
RESULTPATH=/path/to/results
# clean up after run
trap 'rm -rf "/staging/${RUNFOLDER}" "${ANALYSIS}"' EXIT
cp -r "${RUNPATH}/${RUNFOLDER}" /staging || exit 100
mkdir -p "${ANALYSIS}" || exit 101
# may need to specify --sampleSheet if not in the default location
DRAGEN_TruSight_Oncology_500_ctDNA.sh --analysisFolder "${ANALYSIS}" \
--runFolder "/staging/${RUNFOLDER}" \
2>&1 | tee "${ANALYSIS}/test.out" || exit 102
[[ -f "${METRICS}" ]] || printf "Missing metrics file\n"
grep COMPLETED_ALL_STEPS "${METRICS}"
# copy results back to working directory
cp -r "${ANALYSIS}" "${RESULTPATH}" || exit 103
And submit with
[user@biowulf]$ sbatch --mem=0 --cpus-per-task=64 --partition nci-lp-dragen --qos=compass dragen.sh 12345678
if you have access to the compass QOS which increases priority and max. walltime. Users without access to the QOS are limited to 8h max walltime.
Note that the $ANALYSIS folder is lager than the input
with Logs_Intermediates taking up most the space. The script
above could be modified to only transfer a subset of files back to shared
storage.
Please send questions and comments to staff@hpc.nih.gov