raccoon

raccoon CLI

Use the raccoon top-level command with subcommands for different QC tasks.

Top-level usage:

raccoon <subcommand> [options]

Run raccoon <subcommand> --help for full command-specific help.

seq-qc subcommand

Purpose: combine one or more FASTA files into a single upper-case, unwrapped FASTA, with optional metadata-driven header harmonisation.

Basic usage:

raccoon seq-qc -f a.fasta b.fasta -o combined.fasta

With metadata-driven headers:

raccoon seq-qc -f a.fasta b.fasta -o combined.fasta \
  -m metadata.csv other_metadata.csv \
  --metadata-id-field sample \
  --metadata-location-field location \
  --metadata-date-field date \
  --header-separator '|'

With custom template fields:

raccoon seq-qc -f a.fasta b.fasta -o combined.fasta \
  -m metadata.csv \
  --header-fields "{id}|{country}|{date}"

Key options:

Notes:

aln-qc subcommand

Purpose: run alignment quality-control checks and produce a mask file and summary.

Basic usage:

raccoon aln-qc <alignment.fasta> -d outdir

With GenBank reference for frame-break checks:

raccoon aln-qc <alignment.fasta> -d outdir \
  --genbank <reference.gb> --reference-id <ref_id>

Disable selected flag classes:

raccoon aln-qc <alignment.fasta> -d outdir \
  --no-flag-n-adjacent --no-flag-gap-adjacent

Key options:

mask subcommand

Purpose: apply an aln-qc mask CSV to an alignment and write a masked FASTA.

Basic usage:

raccoon mask data/alignment.fasta \
  --mask-file results/alignment_qc/mask_sites.csv \
  -d results/alignment_qc

Key options:

tree-qc subcommand

Purpose: run phylogenetic QC and generate an interactive tree report.

Basic usage:

raccoon tree-qc --tree <treefile> -d outdir \
  --alignment <alignment.fasta> --asr-state <treefile>.state \
  --run-adar --adar-window 300 --adar-min-count 3

Key options:

Global options