MacVector icon

Sequence Files

MacVector sequence files are binary files, rather than ASCII text files. This means that you cannot read them using a word processor or text editor. MacVector uses a sequence file's annotation and features table data as well as the sequence data itself. By using binary files that can be edited only with MacVector, we can ensure that the files will be properly formatted.

MacVector provides full feature table support only for files in GenBank format (or GenBank variants such as BIONET format). When non-GenBank files are imported, MacVector creates a GenBank-style annotation section in memory for these files. Any nonsequence information in the original files is placed in the annotation section under Comment, so no information is lost.

When directed to open a file of type TEXT, MacVector parses it to see if it matches one of the supported file formats. If there is no match, it assumes that the file is a line file (sequence only).

MacVector reads the molecule type (nucleic acid or protein) directly from the file, if possible. When it encounters a file format that does not contain this information, such as line files, Staden files, and GCG files, MacVector attempts to determine the likely sequence type by looking for invalid DNA/RNA or Protein residues and ensuring there are relatively few ambiguities. If MacVector still cannot determine the likely sequence type, it displays an alert box for you to indicate whether the file contains a nucleic acid or a protein sequence.

TEXT FILE formats read by MacVector

- GenBank flat file format (including the DNASTAR, DuPont sequencer, and IBI Pustell variants of the GenBank format)

- GenPept files as distributed by NCBI

- IG_SUITE (formerly known as BIONET)

- EMBL flat file format (including PC Gene)

- UNIPROT/SWISSPROT format

- PHYLIP format

- NEXUS format

- Pearson FASTA format

- Staden format

- GCG formats (RSF and MSF)

- ASCII 1-letter (TEXT files containing only sequence data)

- CODATA format (both NBRF PIR and DNASTAR variants).

Related Topics.

Sequence editor

File extensions

Features table

Annotations

Setting origins