Skip to content

Latest commit

 

History

History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

README.md

VADR container

Main tool : VADR

Additional tools:

  • perl v5.34.0
  • infernal v1.1.5
  • ncbi-blast+ v2.15.0
  • fasta v36.3.8h (the tool, not the file format)
  • minimap2 2.26-r1175

Basic information on how to use this tool:

  • executable: v-annotate.pl
  • additional executable: fasta-trim-terminal-ambigs.pl
  • help: -h
  • version: -v

Description

VADR is a suite of tools for classifying and analyzing sequences homologous to a set of reference models of viral genomes or gene families. It has been mainly tested for analysis of Norovirus, Dengue, and SARS-CoV-2 virus sequences in preparation for submission to the GenBank database.

Additional information

Included models:

  • Caliciviridae version 1.2.1 (including Norovirus virus; default values)
  • Flaviviridae version 1.2.1 (including Dengue virus; --mdir /opt/vadr/vadr-models-flavi/ --mkey flavi)
  • Flu version 1.6.3-2 (-mkey flu)
  • HAV version 1.0.0 (-mkey hav.vadr)
  • MPOX/mpxv version 1.4.2-1 (-mkey mpxv)
  • RSV version 1.5-2 (-mkey rsv)
  • SARS-CoV-2 version 1.3-2 (--mkey sarscov2)

Most of the VADR model files are located at /opt/vadr/vadr-models in the container filesystem and this path is stored in the globally accessible bash variable $VADRMODELDIR. For most applications, there is no need to specify v-annotate.pl --mdir /path/to/model/files since $VADRMODELDIR is set in the environment. Flaviviridae model files are located at /opt/vadr/vadr-models-flavi/ within the container filesystem. To use these models, please specify the 2 options: v-annotate.pl --mdir /opt/vadr/vadr-models-flavi/ --mkey flavi.

Example Usage

# trim fasta file
fasta-trim-terminal-ambigs.pl input.consensus.fa > trimmed.fasta

# trimming for mpxv of long viruses
fasta-trim-terminal-ambigs.pl \
    input.consensus.fa \
    --minlen 50 \
    --maxlen 210000 \
    > trimmed.fasta

# run v-annotate.pl using Flu models
v-annotate.pl --split --cpu 8 -r --atgonly --xnocomp --nomisc \
    --alt_fail extrant5,extrant3 --mkey flu \
    flu.fa \
    flu-test-output

# run v-annotate.pl using HAV models to annotate an HAV FASTA file
# the "-mkey hav.vadr" option is required to specify the HAV model
v-annotate.pl --split \
     -r \
     -xnocomp \
     -mkey hav.vadr \
     trimmed.fasta \
     hav-test-output

# run v-annotate.pl using mpxv model
v-annotate.pl --split --cpu 8 --glsearch -s -r --nomisc --mkey mpxv \
    --r_lowsimok --r_lowsimxd 100 --r_lowsimxl 2000 --alt_pass discontn,dupregin \
    --minimap2 --s_overhang 150 \
    trimmed.fasta \
    mpxv-test-output

# run v-annotate.pl using RSV models
v-annotate.pl --split 
  -r \
  -xnocomp \
  -mkey rsv \
  trimmed.fasta \
  rsv-test-output

# run v-annotate.pl using the sarscov2 model to annotate a trimmed input genome
v-annotate.pl --noseqnamemax --glsearch -s -r --nomisc \
    --mkey sarscov2 --lowsim5seq 6 \
    --lowsim3seq 6 --alt_fail lowscore,insertnn,deletinn \
    trimmed.fasta \
    sarscov2-test-outdir

# run v-annotate.pl using Flaviviridae model to annotate a Dengue viral genome
v-annotate.pl --split --cpu 1 --group Dengue --nomisc --noprotid \
    --mdir /opt/vadr/vadr-models-flavi/ --mkey flavi \
    trimmed.fasta \
    dengue-test-outdir

# run v-annotate.pl using Caliciviridae model to annotate a Norovirus viral genome
v-annotate.pl \
    trimmed.fasta \
    noro-test-outdir