I am grateful to Wolfgang Lehrach for discovering the bugs reported below.
The transition-transversion ratio is defined as
alpha/beta.
This is not the standard definition, though.
In the Kimura model, the standard definition is
alpha/(2*beta).
The factor 1/2 results from the fact that there
are twice as many transversions than transitions.
In the HKY and F84 models, the transition/transversion
ratio depends on functions of the nucleotide frequencies;
see, for instance, equation 24 in
Felsenstein and Churchill (1996), Mol. Biol. Evol. 13:93-104.
You need to keep that difference in mind when comparing
your results with a program that uses the standard
definition, like
Seq-Gen.
Here are further details.
In phylogenetic analysis, there is an identifiability problem
between the branch lengths and the parameters of the rate matrix
unless some further conditions are specified.
The standard condition that is required to hold is:
sum_i Q_ii pi_i = -1
where Q_ii are the elements of the rate
matrix, and pi_i is the equilibrium
distribution over nucleotides.
This condition allows the branch lengths to be
interpreted as the expected number of nucleotide
substitutions per site.
Note that my code uses a different equation of constraint
that does not lead to this convenient interpretation of the
branch lengths. This deviation is irrelevant for the
detection of mosaic structures in DNA sequence alignments itself.
However, it can cause confusions when comparing
your results with those of a standard phylogeny package,
like
SeqGen.
To avoid this confusion, replace function
Barge_NucSubstMatrix.m
by the
following revised function, written
by Wolfgang Lehrach.
When correcting this bug, one has to transpose the transition probability matrix used in my code. For the Kimura model, this is obsolete, as the transition probability matrix is symmetric. But that does not hold for the HKY85 model.
I have not yet made these corrections for two reasons. Firstly, I no longer have the time to support this software. Secondly, I find that my software has been superseded by a more recent program package developed by Wolfgang Lehrach, whose software will be made available here shortly.
Back to the previous page.