4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
1 1 1 1 1 2 1 4 2 2 1 1 3 3 2 3 2 3
1 2 1 1 1 2 2 4 2 4 1 1 3 3 2 3 2 1
1 1 1 1 3 2 2 1 2 4 1 1 3 3 1 3 2 3
1 3 1 1 1 2 1 4 2 3 1 1 3 3 3 3 2 3
Note that the blanks between the numbers are necessary to separate the nucleotides. The first line indicates the number of species, 4 in this example. The program then reads in four lines, which represent the DNA sequences of the species of interest. If these sequences are very long, it might be more convenient to use line-wrapping. Here is an alternative format of the input data:
4 0 0 0 0 0 0 0 0 0
1 1 1 1 1 2 1 4 2 2
1 2 1 1 1 2 2 4 2 4
1 1 1 1 3 2 2 1 2 4
1 3 1 1 1 2 1 4 2 3
1 1 3 3 2 3 2 3 0 0
1 1 3 3 2 3 2 1 0 0
1 1 3 3 1 3 2 3 0 0
1 1 3 3 3 3 2 3 0 0
Lines 2-5 belong to species 1-4, respectively. Then comes the line wrap. Line 5 belongs to species 1, line 6 to species 2 and so on. Note that the matrix must not contain any missing values, therefore the last columns must be padded with zeros.