MacVector icon

Contig Editor for Reference Assemblies

The Contig Editor is used for viewing and editing de novo and reference assembly contigs in Assembler and as the main editor in Align to Reference. However, the Editor's interface does change depending on which function it is being used for. This page deals with the Contig Editor as used in Assembler for reference assemblies as produced by using Bowtie. Please see Contig Editor for de novo Assemblies and Contig Editor - Align to Reference to see how this is used for the other alignments.

Map tab

Coverage plot

The Map view of a reference or child contig will show details of the depth of reads in a coverage map. The coverage map shows four statistics.

The four statistics are used to provide a better visualization of coverage when viewed over an entire assembly. As you zoom into an assembly then these three values will become increasingly similar to the extent that when viewed at residue level these three plots will be close to being a single plot. Note that areas with excessively high coverage can be indicative of a repeat.

However, areas of zero coverage are always displayed even when they are disproportionate to the level of magnification. For example a region of zero coverage over a single base will always be displayed even when you are viewing a five megabase contig in its entirety. Also note that there are no areas of zero coverage in child contigs as by definition they are bounded by either end of the reference contig and/or an area of zero coverage.

For RNAseq analysis look at the Coverage tab of a contig. This contains very detailed information needed for such analysis such as RKPM and TPM for every feature.

Variant reporting in the Map View

Variants are graphically represented in two ways. There are the VCF report and SNP Report tabs and they are also graphically represented in the Map view with two plots just above the coverage map. A SNP or INDEL is represented by a vertical line. Additionally if you mouseover the SNP or INDEL plot you will see a tooltip showing the closest SNP or INDEL. The information shows

SNPs

If any SNPs are detected in the reference contig these will also be shown in a horizontal band just above the coverage map.

INDELs

Bowtie2 is a gapped aligner and will insert a gap into the reference or the reads where doing so results in a higher alignment score. There are two types of reported INDEL.

When there is only a single allele (blue) the alignment shown in the editor tab will be adjusted. However, for multiple alleles in the reads the alignment is NOT adjusted and so may not be entirely correct at that point.

Note that if you have only a single dataset, with only one sample, in the reads then a blue INDEL would indicate it is a homozygous and a grey INDEL indicates it is heterozygous.

This is determined using the FQ column (consensus quality) in the VCF file (see VCF tab). If positive, FQ equals the phred-scaled probability of there being two or more different alleles. If negative, FQ equals the minus phred-scaled probability of all chromosomes being identical.

Misc.

Related Topics.

Assembler

Viewing coverage data across a reference contig for example for RNASeq analysis

Align to Reference

Editing Assembler contigs

Saving assemblies

Map View

Bowtie

Bowtie: Technical details