nested (now also called TE-greedy) is software to analyze nested LTR transposable elements in DNA sequences, such as reference genomes. It is made of two components: nested-generator for generating simulated sequences of nested retrotransposons, and nested-nester (now called TE-greedy-nester) that looks for nested, as well as non-nested and solo-LTR repeat sequences in the input. Unlike other similar software, TE-greedy-nester is structure-based by using de-novo retrotransposon identification software LTR Finder, relying on sequence information only secondarily.
Allocate an interactive session and run the program.
Sample session (user input in bold):
[user@biowulf]$ sinteractive
salloc.exe: Pending job allocation 46116226
salloc.exe: job 46116226 queued and waiting for resources
salloc.exe: job 46116226 has been allocated resources
salloc.exe: Granted job allocation 46116226
salloc.exe: Waiting for resource configuration
salloc.exe: Nodes cn3144 are ready for job
[user@cn3144 ~]$ module load nested
[+] Loading nested 2.0.0 on cn3144
[+] Loading singularity 3.10.5 on cn3144
[user@cn3144 ~]$nested-nester --help
Usage: nested-nester [OPTIONS] INPUT_FASTA
Options:
-s, --sketch Sketch output.
-f, --format TEXT Format for GFF.
-o, --output_fasta_offset INTEGER
Number of bases around the element included
in output fasta files.
-d, --output_folder PATH Output data folder.
-t, --initial_threshold INTEGER
Initial threshold value.
-m, --threshold_multiplier FLOAT
Threshold multiplier.
-n, --threads INTEGER Number of threads
-dt, --discovery_tool [LTR_finder|LTRharvest|finder|harvest]
Determines which tool is used for
retrotransoson discovery. Default:
LTR_finder
-solo, --solo_ltrs Run solo LTR module
--help Show this message and exit.
[user@cn3144 ~]$ nested-generator --help
Usage: nested-generator [OPTIONS] INPUT_DB OUTPUT_DB
Options:
-l, --baselength INTEGER Baselength for generated elements.
-i, --number_of_iterations INTEGER
Number of inserted elements.
-n, --number_of_elements INTEGER
Number of generated sequences.
-f, --filter Filter database and create new one with
given output db path.
-s, --filter_string TEXT Filter entries by given string [ONLY
RELEVANT WITH -filter OPTION].
-o, --filter_offset INTEGER LTR offset allowed [ONLY RELEVANT WITH
-filter OPTION].
-p, --percentage INTEGER Percentage of elements in generated
sequence.
-a, --average_element INTEGER Average element length in database.
-e, --expected_length INTEGER Expected output sequence length [ONLY WORKS
WITH -percentage and -average_element].
-d, --output_directory TEXT Output directory.
--help Show this message and exit.
Example running nested
[user@cn3144 ~]$cp -a /usr/local/apps/nested/2.0.0/test_data/ . [user@cn3144 ~]$ nested-nester test_data/151kb_adh1_bothPrimes.fasta /usr/local/lib/python3.9/dist-packages/nested-1.0.0-py3.9.egg/nested/config/config.py:14: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details. Processing adh1_vicinity_150bpPlMin Processing adh1_vicinity_150bpPlMin: DONE [0:00:10.348563] Total time: 0:00:10.350736 Number of errors: 0
For more information please see the GitLab Page