Jukes-Cantor distance

In the Jukes and Cantor (1969) model, the rate of nucleotide substitution is the same for all pairs of the four nucleotides A, T, C, and G. As is shown below, the multiple hit correction equation for this model produces a maximum likelihood estimate of the number of nucleotide substitutions between two sequences. It assumes an equality of substitution rates among sites (see the related gamma distance), equal nucleotide frequencies, and it does not correct for higher rate of transitional substitutions as compared to transversional substitutions.

 

The Jukes-Cantor model

images\ebx_363058755.gif

 

MEGA provides facilities for computing the following quantities:

d: Transitions + Transversions : Number of nucleotide substitutions per site.

L: No of valid common sites: Number of sites compared.

Formulas for computing these quantities are as follows:

Distance

images\ebx_-1649912316.gif

where p is the proportion of sites with different nucleotides.

 

Variance

images\ebx_-1396568479.gif

See also Nei and Kumar (2000), page 36.