1. Choose Analyze | Pustell DNA Matrix.
The Protein and DNA Matrix Analysis dialog box is displayed.
2. In the left-hand scrolling list, select a sequence to display along the X-axis.
3. In the right-hand scrolling list, select a sequence to display along the Y-axis.
The name that is highlighted in each list is the sequence assigned to that axis.
4. You can limit the analysis to a region of each of the sequences by typing in the numbers that bracket the region in the X-Region and Y-Region text boxes, or by selecting a region from the features table drop-down menu that appears to the right of the text boxes.
If you are using a very long nucleic acid sequence for the comparison, the procedure will need less memory if you display the shorter sequence on the x-axis.
5. Select the Scoring Matrix button to choose a scoring matrix file.
A standard dialog box appears, enabling you to search for and select the file to use. Select the DNA Identity Matrix.
6. Enter a window size in the window size text box.
Whenever MacVector finds an exact match, it examines the segment (or window) of the aligned sequences that surround the matching region. The length of the segment is the value typed in for window size.
7. Enter a minimum score in the min. % score text box.
Whenever MacVector finds an exact match, it computes a total score for the window using the match / mismatch scores in the scoring matrix. It then determines a percent score by dividing the window's score by the score that would occur if all of the bases in the window matched. If this percent score equals or exceeds the value of min. % score, the window is saved.
8. Choose a value from the hash value drop-down menu.
The hash value is a measure of how long an exact match between two sequences must be before MacVector will attempt to score and align that matching region. A hash value of 1 is the most sensitive, 6 is the least sensitive.
TIP: For most comparisons, start with a hash value of 6, because it is unusual for two sequences to possess significant similarity without having regions of that size that match exactly.
9. Choose a value from the jump drop-down menu.
When the jump setting is 1, the hash value represents the number of bases in a row that must match perfectly. When the jump setting is 3, the hash value is the number of triplets in a row whose first base must match perfectly. When the jump setting is both, matching regions found for both jump settings are scored.
TIP: For the initial analysis stage, we recommend that the jump setting be set to both.
10. Use the strand drop-down menu to choose whether to compare the plus strands of the two sequences (++), the plus strand of the X-axis sequence with the reverse complement strand of the Y-axis sequence (+-), or both.
TIP: For the initial analysis, we recommend that you select both.
11. Select OK to perform the analysis.
Alternatively, select Defaults to restore the default settings, or Cancel to close the dialog box without performing the analysis.
When the analysis is complete, the DNA Matrix Analysis display dialog box is displayed. This dialog box is described in the Displaying DNA matrix results help topic.