Tractor is a statistical framework and software package to facilitate the inclusion of admixed individuals in association studies by leveraging local ancestry. It generates accurate ancestry-specific effect-size estimates and P values, can boost genome-wide association study (GWAS) power and improves the resolution of association signals
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive [user@cn0799 ~]$ module load Tractor [+] Loading java 17.0.3.1 ... [+] Loading singularity 4.2.2 on cn0799 [+] Loading nextflow 24.10.3 [+] Loading Tractor 1.4.0 ... [user@cn0799 ~]$ ls $TRACTOR_BIN flare phase_common_static rfmix simulate_static xcftools_static gnomix.py phase_rare_static simulate switch_static [user@cn0799 ~]$ cp -r $TRACTOR_DATA . [user@cn0799 ~]$ tree test_data test_data ├── admixed_cohort │ ├── ASW.deconvoluted.fb.tsv │ ├── ASW.deconvoluted.msp.tsv │ ├── ASW.deconvoluted.rfmix.Q │ ├── ASW.deconvoluted.sis.tsv │ ├── ASW.phased.vcf.gz │ └── ASW.phased.vcf.gz.csi ├── phenotype │ ├── Phe_linear_covars.txt │ ├── Phe_linear.txt │ └── Phe_logistic.txt └── references ├── chr22.b37.gmap.gz ├── chr22.genetic_map.modified.txt ├── TGP_HGDP_QC_hg19_chr22.vcf.gz ├── TGP_HGDP_QC_hg19_chr22.vcf.gz.csi └── YRI_GBR_samplemap.txt 3 directories, 14 files [user@cn0799 ~]$ phase_common_static \ --input test_data/admixed_cohort/ASW.phased.vcf.gz \ --reference test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz \ --region 22 \ --map test_data/references/chr22.b37.gmap.gz \ --output test_data/admixed_cohort/ASW.phased.bcf [SHAPEIT5] phase_common (jointly phase multiple common markers) * Author : Olivier DELANEAU, University of Lausanne * Contact : olivier.delaneau@gmail.com * Version : 5.1.1 / commit = 990ed0d / release = 2023-05-08 * Run date : 29/05/2025 - 11:19:37 Files: * Input : [admixed_cohort/ASW.phased.vcf.gz] * Reference : [test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz] * Genetic Map : [test_data/references/chr22.b37.gmap.gz] * Output : [test_data/admixed_cohort/ASW.phased.bcf] * Output format : [bcf] Parameters: * Seed : 15052011 * Threads : 8 threads * MCMC : 15 iterations [5b + 1p + 1b + 1p + 1b + 1p + 5m] * PBWT : [window = 4cM / depth = auto / modulo = auto / mac = 5 / missing = 0.1] * HMM : [window = 4cM / Ne = 15000 / Recombination rates given by genetic map] Reading genotype data: [W::hts_idx_load3] The index file is older than the data file: test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz.csi * VCF/BCF scanning done (9.04s) + Variants [#sites=182525 / region=22] - 11249 sites removed in reference panel [not in main panel] + Samples [#target=61 / #reference=1572] [W::hts_idx_load3] The index file is older than the data file: test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz.csi * VCF/BCF parsing done (10.94s) + Genotypes [0/0=76.122%, 0/1=16.366%, 1/1=7.512%, ./.=0.000%, 0|1=0.000%] + Reference haplotypes [0=85.090%, 1=14.910%] Setting up genetic map: * GMAP parsing [n=45329] (0.02s) * cM interpolation [s=37946 / i=144579] (0.02s) * Region length [34861020 bp / 72.8 cM] * HMM parameters [Ne=15000 / Error=0.0001 / #rare=8659] Initializing data structures: * Impute monomorphic [n=330498] (0.01s) * HAP update (0.04s) * H2V transpose (0.80s) * PBWT parameters auto setting : [modulo = 0.045 / depth = 6] * PBWT initialization [#eval=173447 / #select=1550 / #chunk=11] (0.01s) * PBWT phasing sweep (1.72s) * Build genotype graphs [seg=607426] (0.11s) Burn-in iteration [1/5] * PBWT selection (0.90s) * HMM computations [K=716.5+/-235.1 / W=3.54Mb / US=0 / UP=0] (36.17s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.06s) Burn-in iteration [2/5] * PBWT selection (0.90s) * HMM computations [K=635.5+/-222.1 / W=3.52Mb / US=0 / UP=0] (32.56s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.04s) Burn-in iteration [3/5] * PBWT selection (0.90s) * HMM computations [K=629.1+/-207.3 / W=3.47Mb / US=0 / UP=0] (31.67s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.05s) Burn-in iteration [4/5] * PBWT selection (0.90s) * HMM computations [K=645.1+/-227.3 / W=3.57Mb / US=0 / UP=0] (33.06s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.06s) Burn-in iteration [5/5] * PBWT selection (0.90s) * HMM computations [K=631.5+/-212.7 / W=3.50Mb / US=0 / UP=0] (31.98s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.06s) Pruning iteration [1/1] * PBWT selection (0.91s) * HMM computations [K=648.8+/-219.8 / W=3.61Mb / US=0 / UP=0] (34.58s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.04s) * Trimming [pc=43.57%] Burn-in iteration [1/1] * PBWT selection (0.90s) * HMM computations [K=632.6+/-210.8 / W=3.50Mb / US=0 / UP=0] (29.10s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.05s) Pruning iteration [1/1] * PBWT selection (0.90s) * HMM computations [K=642.6+/-213.5 / W=3.57Mb / US=0 / UP=0] (30.67s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.05s) * Trimming [pc=68.12%] Burn-in iteration [1/1] * PBWT selection (0.87s) * HMM computations [K=635.6+/-225.6 / W=3.55Mb / US=0 / UP=0] (28.82s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.04s) Pruning iteration [1/1] * PBWT selection (0.90s) * HMM computations [K=650.4+/-232.0 / W=3.66Mb / US=0 / UP=0] (29.86s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.05s) * Trimming [pc=80.64%] Main iteration [1/5] * PBWT selection (0.90s) * HMM computations [K=619.3+/-210.4 / W=3.44Mb / US=0 / UP=0] (27.01s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.04s) Main iteration [2/5] * PBWT selection (0.90s) * HMM computations [K=623.0+/-213.4 / W=3.48Mb / US=0 / UP=0] (27.55s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.05s) Main iteration [3/5] * PBWT selection (0.90s) * HMM computations [K=633.2+/-209.4 / W=3.51Mb / US=0 / UP=0] (27.49s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.05s) Main iteration [4/5] * PBWT selection (0.90s) * HMM computations [K=638.1+/-219.3 / W=3.59Mb / US=0 / UP=0] (28.06s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.06s) Main iteration [5/5] * PBWT selection (0.90s) * HMM computations [K=627.7+/-213.9 / W=3.49Mb / US=0 / UP=0] (27.20s) * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0] * HAP update (0.03s) * H2V transpose (0.03s) Finalization: * HAP solving (0.06s) * HAP update (0.03s) * H2V transpose (0.03s) * VCF/BCF writing [N=61 / L=182525] (0.52s) * Indexing files * Total running time = 493 seconds [user@cn0799 ~]$ exit salloc.exe: Relinquishing job allocation 46116226 [user@biowulf ~]$