29 Evidences for Macroevolution: Part 1

Introduction to Phylogenetics

escent from a common ancestor entails a process of branching and divergence of species, in common with any genealogical process. Genealogies can be graphically illustrated by tree-like diagrams, and this is why you will hear evolutionists refer to the genealogy of species as the "tree of life." Diagrams such as these are known as phylogenetic trees or phylogenies. Figure 1 shows an example I will use here. The macroevolutionary prediction of one true historical phylogenetic tree is the most important, powerful, and basic conclusion from the hypothesis of common descent. A thorough grasp of this concept is necessary for understanding macroevolutionary deductions. In the following section, I give a brief overview of phylogenetic trees and how biologists determine them.

Figure 1. The standard phylogenetic tree.

Phylogenetic Reconstructions: Reliability

In order to establish their validity in reliably determining phylogenies, cladistic methods have been empirically tested in cases where the true phylogeny is known with certainty, since the true phylogeny was directly observed.

Bacteriophage T7 was propagated and split sequentially in the presence of a mutagen, where each lineage was tracked. Out of 135,135 possible phylogenetic trees, the true tree was correctly determined by cladistic methods in a blind analysis. Five different statistical cladistic methods were used independently, and each one chose the correct tree (Hillis, Bull et al. 1992).
In another study, 24 strains of mice were used in which the genealogical relationships were known. Cladistic analysis reproduced almost perfectly the known phylogeny of the 24 strains (Atchely and Fitch 1991).
Bush et. al. used phylogenetic analysis to retrospectively predict the correct evolutionary tree of human Influenza A virus 83% of the time for the flu seasons spanning 1983 to 1994.
In 1998, researchers used 111 modern HIV-1 (AIDS virus) sequences in a phylogenetic analysis to predict the nucleotide sequence of the viral ancestor of which they were all descendants. The predicted ancestor sequence closely matched, with high statistical probability, the actual HIV sequence found in an HIV-1 seropositive African plasma sample collected and archived in the Belgian Congo in 1959 (Zhu, Korber et al. 1998).

Phylogenetic trees are normally portrayed with two dimensions: a time dimension and a morphological dimension. In most examples, such as this one, the time dimension is plotted on the vertical axis, and the morphological dimension is plotted on the horizontal axis. It is important to be aware that morphology is extremely multidimensional in reality, but it is usually displayed as one-dimensional for graphical simplicity and clarity. When phylogenies are constructed from molecular sequence data, sequence similarity replaces morphology on the horizontal axis. Note that in most phylogenies, as with this illustration, both time and morphology are relative, not absolute. Exceptions include phylogenies that are correlated with fossil data and phylogenies that incorporate a "molecular clock."

A common misconception is that some modern species are ancestral to another modern species. However, all modern species are found at the tips of the tree's branches, and one modern species is as "evolved" as any other. That is, although mammals are thought to have evolved from something that resembled modern reptiles, modern reptiles are just as "old" evolutionarily as modern mammals (Brooks 1991, p.68; Futuyma 1998, p.113).

A method for determining the true phylogenetic tree: Cladistics

Of all clean birds ye shall eat.
But these are they of which ye shall not eat:

The eagle, and the ossifrage, and the ospray,
And the glede, and the kite, and the vulture after his kind,
And every raven after his kind,
And the owl, and the night hawk, and the cuckow, and the hawk after his kind,
The little owl, and the great owl, and the swan,
And the pelican, and the gier eagle, and the cormorant,
And the stork, and the heron after her kind, and the lapwing,
and the bat.

Deuteronomy 14:11-18, KJV

If modern species have descended from ancestral ones in this tree-like, branching manner, a rigorous classification of species should reflect their divergence and it should be possible to infer the true historical tree that traces their paths of descent. The consensus model which evolutionary biologists use to represent the one true tree I will refer to as the "standard phylogenetic tree" (see Figure 1). In 1950, taxonomist Willi Hennig proposed a method for determining the standard phylogenetic tree based on morphology by classifying organisms according to their shared derived characters (Hennig 1966). This method, now called cladistics, does not assume genealogical relatedness a priori, since it can be used to classify anything in principle, even things like books, cars, or chairs that are obviously not genealogically related (although I'm not sure why one would want to). Neither does it use circular logic; the conclusions and predictions from a cladistic analysis are not part of the input data. Using firm evolutionary arguments, Hennig reasoned that his method was the most appropriate classification technique for determining evolutionary relationships generated by lineal descent. In fact, Hennig's cladistics is really just a rigorous formalization of the classification methods biologists had been using intuitively ever since Linnaeus. Evolutionists today construct their phylogenetic trees based on Hennig's method, and because of cladistics these phylogenetic trees are reproducible and independently testable. (Brooks 1991, Ch. 2).

Phylogenetic Reconstructions: Caveats

As with any investigational scientific method, certain conditions must hold in order for the results to be reliable. The list below gives some of the more important caveats that scientists must keep in mind when interpreting the results of a phylogenetic analysis (Swofford 1996, pp. 493-509). In general, the contribution of each of these concerns will be "averaged out" by including more independent characters in the cladistic analysis.

Cladistics

Correlated characters: each character used in the analysis optimally should be genetically independent. Characters that are hereditarily correlated are better thought of as a single character.
True structural convergence: structures that have undergone convergent evolution can artificially result in incorrect tree topologies. Fortunately, compared to other phylogenetic methods, cladistics is especially useful for resolving apparent structural convergence. Including more characters in the analysis also aids in overcoming convergent effects.
Character reversals: characters that revert to an ancestral state pose a challenge similar to convergence.
Lost characters: lineages that have lost characters (such as whales and their hindlimbs) can also pose cladistic problems. Often, if a cladistic analysis indicates strongly that a certain character has been lost during evolution, it is best to omit this character in the analysis for that lineage.
Missing characters: incomplete fossils are problematic, since they may lack important characters. Better fossils are the answer.
Parsimony: requiring that the best tree is the one with the least amount of change is only justified if each character under analysis has the same probability of evolving (in the long term). If there is independent evidence that this assumption is not justified, it is best to weight characters according to their evolutionary probability. For instance, it is known that certain base mutations are more likely to happen than others.
Intractable number of possible phylogenetic trees: for computational reasons, this is one of the most important phylogenetic challenges to overcome. The goal of a phylogenetic reconstruction is to determine the best tree that the data supports. For an analysis of only five species, there are 15 possible trees. For an analysis of 50 species, there are over 10⁷⁴ possible trees that must be searched - which is computationally impossible. This problem is not as bad as it first sounds, since narrowing down the number of reasonable trees can be trivial in many cases. Several methods have been developed to work around this issue successfully, and ultimately more powerful computers are better.

It is important to keep in mind that Hennig's phylogenies are determined using only shared derived characters of organisms, not primitive, paralogous, or analogous characters (Brooks 1991, pp. 35-36). "Primitive" and "derived" are relative terms. As a generality, primitive characters are characters shared by the most organisms; derived characters are shared by fewer organisms. For example, backbones are primitive characters of vertebrates; hair is a derived character particular to mammalian vertebrates. However, when considering mammals only, hair is primitive, whereas an opposable thumb is derived. Paralogous characters have the same structure but different functions; analogous characters have different structures but the same function. For example, bird wings and ape arms are paralogous; bird wings and insect wings are analogous.

In contrast, shared derived characters must have the same function and structure (Brooks 1991, Chs. 3 and 5). Consequently, paralogous and analogous characters cannot be used to determine a cladistic phylogeny; in fact, the phylogeny ultimately determines which characters are analogous and/or paralogous (Brooks 1991, pp. 25-26). Hennig's method of maximum statistical parsimony resolves which characters are the most probable primitive and derived characters for a group of species. Thus, some parts of a phylogeny can be ambiguous, and some parts can be well determined.

Every branch of a phylogeny can be assigned an objective statistical confidence based on the data (Maddison 1992, pp. 112-123; Li 1997, pp. 36-146; Felsenstein 1985; Futuyma 1998, p. 99, Hillis and Bull 1993; Huelsenbeck, et al. 2001; Swofford et al. 1996, pp. 504-509). The preceding point is significant - any cladogram, including what I call the standard phylogenetic tree, is an approximation of the true tree. Some cladistic analyses are strongly supported by the data, some are weakly supported. When comparing two independent phylogenies of the same organisms, one must take into account the statistical significance assigned to each branch of the phylogenies. As with all scientific analyses, the details of a cladogram may change as new information and data are incorporated.

For more information, you can consult one of several excellent online cladistic resources, such as the World of Cladistics, the SASB Introduction to Phylogenetics, UC Berkeley's Integrative Biology Phylogenetics Lab, or Diana Lipscomb's stellar Basics of Cladistic Analysis, downloadable in Adobe Acrobat PDF format. A good, concise description for the layperson can be found at the Journal of Avocational Paleontology. Finally, you can read Charles Darwin's explanation in The Origin of Species of the "Tree of Life," where the concept of a phylogenetic tree was first introduced.

Prediction 1: The fundamental unity of life

"Oh Jehova, Quam ampla sunt Tua Opera."

- Carolus Linnaeus
at the beginning of Systema Naturae, 1757

According to the theory of common descent, modern living organisms, with all their incredible differences, are the progeny of one single species in the distant past. In spite of the extensive variation of form and function among organisms, several fundamental criteria characterize all life. Some of the macroscopic properties that characterize all of life are (1) replication, (2) information flow in continuity of kind, (3) catalysis, and (4) energy utilization (metabolism). At a very minimum, these four functions are required to generate a physical historical process that can be described by a phylogenetic tree.

If every living species descended from an original species that had these four obligate functions, then all living species today should necessarily have these functions (a somewhat trivial conclusion). Most importantly, however, all modern species should have inherited the structures that perform these functions. Thus, a basic prediction of the genealogical relatedness of all life, combined with the constraint of gradualism, is that organisms should be very similar in the particular mechanisms and structures that execute these basic life processes.

Confirmation:

All known living things use polymers to perform these four basic functions. Organic chemists have synthesized hundreds of different polymers, yet the only ones used by life, irrespective of species, are polynucleotides, polypeptides, and polysaccharides. Regardless of the species, the DNA, RNA and proteins used in known living systems all have the same chirality, even though there are at least two chemically equivalent choices of chirality for each of these molecules. For example, RNA has four chiral centers in its ribose ring, which means that it has 16 possible stereoisomers - but only one of these stereoisomers is found in the RNA of known living organisms.

Ten years after the publication of The Origin of Species, nucleic acids were first isolated by Friedrich Miescher in 1869. It took another 75 years after this discovery before DNA was identified as the genetic material of life (Avery, McCleod, et. al 1944). It is quite conceivable that we could have found a different genetic material for each species. In fact, it is still possible that newly identified species might have unknown genetic materials. However, all known life uses the same polymer, polynucleotide (DNA or RNA), for storing species specific information. All known organisms base replication on the duplication of this molecule. The DNA used by living organisms is synthesized using only four nucleosides (deoxyadenosine, deoxythymidine, deoxycytidine, and deoxyguanosine) out of the dozens known (at least 99 occur naturally and many more have been artificially synthesized) (Rozenski, Crain et al. 1999; Voet and Voet 1995, p. 969).

In order to perform the functions necessary for life, organisms must catalyze chemical reactions. In all known organisms, enzymatic catalysis is based on the abilities provided by protein molecules (and in relatively rare, yet important, cases by RNA molecules). There are 293 naturally occurring amino acids known (Voet and Voet 1995, p. 69; Garavelli, Hou, et al. 2001); however, the protein molecules used by all known living organisms are constructed with the same subset of 22 amino acids.

There must be a mechanism for transmitting information from the genetic material to the catalytic material; all known organisms, with extremely rare exceptions, use the same genetic code for this. The few known exceptions are, nevertheless, simple and minor variations from the "universal" genetic code (see Figure 1.1.1) (Lehman 2001; Voet and Voet 1995, p. 967).

Figure 1.1.1. The standard genetic code and known variant nuclear codes. (1) Candida, a unicellular yeast. (2) Micrococcus. (3) ciliated protozoans and green algae. (4) Mycoplasma. (5) suppressor codon in bacteria. (6) Euplotes. (7) the selenocysteine codon (8) Spiroplasma. (9) Micrococcus. (10) resume codon in ssrA RNA (Lehman 2001).

All known organisms use extremely similar, if not the same, metabolic pathways and metabolic enzymes in processing energy-containing molecules. For example, the fundamental metabolic systems in living organisms are glycolysis, the citric acid cycle, and oxidative phosphorylation. In all eukaryotes and in the majority of prokaryotes, glycolysis is performed in the same ten steps, in the same order, using the same ten enzymes (Voet and Voet 1995, p. 445). In addition, the most basic unit of energy storage, the adenosine triphosphate molecule (ATP), is the same in all species that have been studied.

Potential Falsification:

Thousands of new species are discovered yearly, and new DNA and protein sequences are determined daily from previously unexamined species (Wilson 1992, Ch. 8); each and every one is a test of the theory of common descent. Based solely on the theory of common descent and the genetics of known organisms, we strongly predict that we will never find any modern species from known phyla on this Earth with a foreign, non-nucleic acid genetic material. We also make the strong prediction that all newly discovered species that belong to the known phyla will use the "standard genetic code" or a close derivative thereof. For example, according to the theory, none of the thousands of new and previously unknown insects that are constantly being discovered in the Brazilian rainforest will have non-nucleic acid genomes. Nor will these yet undiscovered species of insects have genetic codes which are not close derivatives of the standard genetic code. In the absence of the theory of common descent, it is quite possible that every species could have a very different genetic code, specific to it only, since there are 1.4 x 10⁷⁰ informationally equivalent genetic codes, all of which use the same codons and amino acids as the standard genetic code (Yockey 1992). This possibility could be extremely useful for organisms, as it would preclude interspecific viral infections; however, it has not been observed, and the theory of common descent effectively prohibits such an observation.

As another example - nine new lemur and two marmoset species (all primates) were discovered in the forests of Madagascar and Brazil in 2000 (Groves 2000; Rasoloarison, Goodman, et al. 2000; Thalmann and Geissmann 2000). Ten new monkey species have been discovered in Brazil alone since 1990 (Van Roosmalen, Van Roosmalen, et al. 2000). Nothing in biology prevents these various species from having a hitherto unknown genetic material or a previously unused genetic code - nothing, that is, except for the theory of common descent. However, we now know definitively that the new lemurs use DNA with the standard genetic code (Yoder, Rasoloarison, et al. 2000); the marmosets have yet to be tested.

Furthermore, each species could use a different polymer for catalysis. The polymers that are used could still be chemically identical yet have different chiralities in different species. There are thousands of thermodynamically equivalent glycolysis pathways (even using the same ten reaction steps but in different orders), so it is possible that every species could have its own specific glycolysis pathway, tailored to its own unique needs. The same reasoning applies to other core metabolic pathways, such as the citric acid cycle and oxidative phosphorylation.

Finally, many molecules besides ATP could serve equally well as the common currency for energy in various species (CTP, TTP, UTP, ITP, or any ATP-like molecule with one of the 293 known amino acids or one of the dozens of other bases replacing the adenosine moiety immediately come to mind). Discovering any new animals or plants that contained any of the anomalous examples proffered above would be strong falsifications of common ancestry, but they have not been found.

Prediction 2: A nested hierarchy of species

As you can see from the phylogeny in Figure 1, the predicted pattern of organisms at any given point in time can be described as "groups within groups." This nested hierarchical organization of species contrasts sharply with the continuum of "the great chain of being" and the continuum predicted by Lamarck's theory of organic progression (Darwin 1872, pp. 552-553; Futuyma 1998, pp. 88-92). Mere similarity between organisms is not enough to support macroevolution; the nested classification pattern that satisfies the macroevolutionary process is more specific than simple similarity. Few other natural processes would predict a nested hierarchical classification. Real world examples that cannot be objectively classified in nested hierarchies are the elementary particles (which are described by quantum chromodynamics), the elements (whose organization is described by quantum mechanics and illustrated by the periodic table), the planets in our Solar System, books in a library, or specially designed objects like buildings, furniture, cars, etc.

Phylogenetic Reconstructions: Caveats

Molecular methods

While, theoretically, molecular systematic methods have all the same caveats as any cladistic method, in practice molecular phylogenetics has several specific concerns. In fact, some molecular methods (such as UPGMA) are not strictly cladistic, and they are better described as phenetic methods.

Maximum Likelihood assumptions: the Maximum Likelihood method makes explicit assumptions about the pattern of nucleotide substitutions. These assumptions are based upon a solid statistical foundation; however, their validity must be considered when evaluating the results.
Reversals: because DNA and RNA only have four different character states, they are especially prone to reversals during evolution.
Long branch attraction: a theoretical phenomenon in which lineages that diverged relatively long ago will tend to "cluster" together in a phylogenetic reconstruction under the appropriate conditions. The mathematical reasons are somewhat complicated, but using more slowly evolving genes helps overcome the problem.
Rate variation between lineages: rates of nucleotide substitution may differ between lineages; this can contribute to long branch attraction and result in incorrect tree topologies.
Rate variation within a single gene: rates of nucleotide substitution can vary along the length of a single gene - this also exacerbates long branch attraction.
Gene trees are not equivalent to species trees: from simple Mendelian genetics we know that genes segregate individually, and that throughout time individual genes do not necessarily follow organismic genealogy (Avise and Wollenberg 1997). An obvious example is the fact that while you may have brown eyes, your child may have the genes for blue eyes - but that does not mean your child is not your descendent, or that your brown-eyed children are more closely related to you than your blue-eyed children. Including multiple genes in the analysis is a solution to this conundrum.
However, it should be noted that a basic assumption of all molecular phylogenetic methods is that genes are transmitted via vertical inheritance, i.e. from ancestor to descendant. If this assumption is violated, then gene trees will never recapitulate an organismic phylogeny. This assumption obviously is violated in instances of horizontal transfer, e.g. in transformation of a bacterium by a DNA plasmid, or in retroviral insertion into a host's genome. During the early evolution of life, before the advent of multicellular organisms, horizontal transfer was likely very frequent (as it is today in the observed evolution of bacteria and other unicellular organisms). Thus, it is doubtful whether molecular methods are applicable, even in principle, to resolving the phylogeny of the early evolution of life (near the most recent common ancestor of all living organisms) (Doolittle 1999, 2000; Woese 1998).

Although it is trivial to classify anything subjectively in a hierarchical manner, only certain things can be classified objectively in a consistent nested hierarchy. The difference drawn here between "subjective" and "objective" is crucial and requires some elaboration, and it is best illustrated by example. Different models of cars certainly could be classified hierarchically - perhaps one could classify cars first by color, then within each color by number of wheels, then within each wheel number by manufacturer, etc. However, another individual may classify the same cars first by manufacturer, then by size, then by year, then by color, etc. The particular classification scheme chosen for the cars is subjective. In contrast, human languages, which have common ancestors and are derived by descent with modification, generally can be classified in objective nested hierarchies (Pei 1949; Ringe 1999). Nobody would reasonably argue that Spanish should be categorized with German instead of with Portugese. The difference between classifying cars and classifying languages lies in the fact that, with cars, certain characters (for example, color or manufacturer) must be considered more important than other characters in order for the classification to work. Which types of car characters are more important depends upon the personal preference of the individual who is performing the classification. In other words, certain types of characters must be weighted subjectively in order to classify cars in nested hierarchies; cars do not fall into natural, unique, objective nested hierarchies.

Because of these facts, a cladistic analysis of cars will not produce a unique, consistent, well-supported tree that displays nested hierarchies. A cladistic analysis of cars (or, alternatively, a cladistic analysis of imaginary organisms with randomly assigned characters) will of course result in a phylogeny, but there will be a very large number of other phylogenies, many of them with very different topologies, that are as well-supported by the same data. In contrast, a cladistic analysis of organisms or languages will generally result in a well-supported nested hierarchy, without arbitrarily weighting certain characters (Ringe 1999). Cladistic analysis of a true genealogical process produces one or relatively few phylogenetic trees that are much more well-supported by the data than the other possible trees.

The degree to which a given phylogeny displays a unique, well-supported, objective nested hierarchy can be rigorously quantified. Several different statistical tests have been developed for determining whether a phylogeny has a subjective or objective nested hierarchy, or whether a given nested hierarchy could have been generated by a chance process instead of a genealogical process (Swofford 1996, p. 504). These tests measure the degree of "cladistic hierarchical structure" (also known as the "phylogenetic signal") in a phylogeny, and phylogenies based upon true genealogical processes give high values of hierarchical structure, whereas subjective phylogenies that have only apparent hierarchical structure (like a phylogeny of cars, for example) give low values (Archie 1989; Faith and Cranston 1991; Farris 1989; Felsenstein 1985; Hillis 1991; Hillis and Huelsenbeck 1992; Huelsenbeck, et al. 2001; Klassen et al. 1991).

There is one caveat to consider with this prediction: if rates of evolution are fast, then cladistic information can be lost over time since it would be essentially randomized. The faster the rate, the less time needed to obliterate information about the historical branching pattern of evolution. Slowly evolving characters let us see farther back into time; faster evolving characters restrict that view to more recent events. If the rate of evolution for a certain character is extremely slow, a nested hierarchy will be observed for that character only for very distantly related taxa. However, "rate of evolution" vs. "time since divergence" is relative; if common descent is true, then in some time frame we will always be able to observe a nested hierarchy for any given character. Furthermore, we know empirically that different characters evolve at different rates (e.g. some genes have higher background mutation rates than others). Thus, if common descent is true, we should observe nested hierarchies over a broad range of time at various biological levels.

Therefore, since common descent is a genealogical process, common descent should produce organisms that can be organized into objective nested hierarchies. Equivalently, we predict that, in general, cladistic analyses of organisms should produce phylogenies that have large, statistically significant values of hierarchical structure (in standard scientific practice, a result with "high statistical significance" is a result that has a 1% probability or less of occurring by chance [P < 0.01]). As a representation of universal common descent, the universal tree of life should have very high, very significant hierarchical structure and phylogenetic signal.

Confirmation:

Most existing species can be organized rather easily in a nested hierarchical classification. This is evident in the use of the Linnaean classification scheme. Based on shared derived characters, closely related organisms can be placed in one group (such as a genus), several genera can be grouped together into one family, several families can be grouped together into an order, etc.

As a specific example (see Figure 1), plants can be classified as vascular and nonvascular (i.e. they have or lack xylem and phloem). Nested within the vascular group, there are two divisions, seed and non-seed plants. Further nested within the seed plants are two more groups, the angiosperms (which have enclosed, protected seeds) and the gymnosperms (having non-enclosed seeds). Within the angiosperm group are the monocotyledons and the dicotyledons.

Most importantly, the standard phylogenetic tree and nearly all less inclusive evolutionary phylogenies have statistically significant, high values of hierarchical structure (Hillis 1991; Hillis and Huelsenbeck 1992; Klassen et al. 1991).

Figure 1.2.1. A plot of the CI values of cladograms versus the number of taxa in the cladograms. CI values are on the y-axis; taxa number are on the x-axis. The 95% confidence limits are shown in light turquoise. All points above and to the right of the turquoise region are statistically significant high CI values. Similarly, all points below and to the left of the turquoise region are statistically significant low values of CI. (reproduced from Klassen et al. 1991).

Potential Falsification:

It would be very problematic if many species were found that combined characteristics of different nested groupings. Proceeding with the previous example, some nonvascular plants could have seeds or flowers, like vascular plants, but they do not. Gymnosperms (e.g. conifers or pines) occasionally could be found with flowers, but they never are. Non-seed plants, like ferns, could be found with woody stems; however, only some angiosperms have woody stems. Conceivably, some birds could have mammary glands or hair; some mammals could have feathers (they are an excellent means of insulation). Certain fish or amphibians could have differentiated or cusped teeth, but these are only characteristics of mammals. A mix and match of characters like this would make it extremely difficult to objectively organize species into nested hierarchies. Unlike organisms, cars do have a mix and match of characters, and this is precisely why a nested hierarchy does not flow naturally from classification of cars.

If it were impossible, or very problematic, to place species in an objective nested classification scheme (as it is for the car, chair, book, atomic element, and elementary particle examples mentioned above), macroevolution would be effectively disproven. More precisely, if the phylogenetic tree of all life gave statistically significant low values of phylogenetic signal (hierarchical structure), common descent would be resolutely falsified. In fact, it is possible to have a "reciprocal" pattern from nested hierarchies. Mathematically, a nested hierarchy is the result of specific correlations between certain characters of organisms. When evolutionary rates are fast, characters become randomly distributed with respect to one another, and the correlations are weakened. However, the characters can also be anti-correlated - it is possible for them to be correlated in the opposite direction from what produces nested hierarchies (Archie 1989; Faith and Cranston 1991; Hillis 1991; Hillis and Huelsenbeck 1992; Klassen et al. 1991). The observation of such an anti-correlated pattern would be a strong falsification of common descent, regardless of evolutionary rates.

As a specific example of the above - one widely used measure of cladistic hierarchical structure is the consistency index (CI). The statistical properties of the CI measure were investigated in a frequently cited paper by Klassen et al. (Klassen et al. 1991; see Figure 1.2.1). The exact CI value is dependent upon the number of taxa in the phylogenetic tree under consideration. In this paper, the authors calculated what values of CI were statistically significant for various numbers of taxa. Higher values of CI indicate a greater degree of hierarchical structure. As an example, a CI of 0.2 is expected from random data for 20 taxa. A value of 0.3 is, however, highly statistically significant. Most interesting for the present point is the fact that a CI of 0.1 for 20 taxa is also highly statistically significant, but it is too low - it is indicative of anti-cladistic structure. Klassen et al. took 75 CI values from published cladograms in 1989 (combined from three papers) and noted how they fared in terms of statistical significance. The cladograms used from 5 to 49 different taxa (i.e. different species). Three of the 75 cladograms fell within the 95% confidence limits for random data, which means that they were indistinguishable from random data. All the rest exhibited highly statistically significant values of CI. None exhibited significant low values; none displayed an anti-correlated, anti-hierarchical pattern. And note, this study was performed before there were measures of statistical significance which would allow researchers to "weed out" the bad cladograms. Predictably, the three "bad" data sets considered under ten taxa - it is of course more difficult to determine statistical significance with very little data. Seventy-two independent studies from different researchers on different organisms and genes with high values of CI (P > 0.01) is an incredible confirmation with an astronomical degree of combined statistical significance. If the reverse were true, that all of these studies gave statistically significant low values of CI (i.e. cladistic hierarchical structure), common descent would have been firmly falsified.

Keep in mind that ~1.5 million species are known, and that the majority of these species has been discovered since Darwin first stated his hypothesis of common ancestry. Even so, they all have fit the correct hierarchical pattern within the error of our methods. Furthermore, it is estimated that only 1 to 10% of all living species has even been catalogued, let alone studied in detail. New species discoveries pour in daily, and each one is a test of the theory of common descent (Wilson 1992, Ch. 8).

Prediction 3: convergence of independent phylogenies

If there is one true historical phylogenetic tree which unites all species in an objective genealogy, all separate lines of evidence should converge on the same tree (Penny et al. 1982; Penny et al. 1991). Independently derived phylogenetic trees of all organisms should match each other with a high degree of statistical significance.

Confirmation:

Many genes with very basic cellular functions are ubiquitous – they occur in the genomes of most or all organisms. An oft-cited example, which I will use frequently, is the cytochrome c gene. Since all eukaryotes contain the gene for this essential protein, neither its presence nor its function correlates with organismal morphology. Additionally, because of the fact of DNA coding redundancy, parts of certain DNA sequences have absolutely no correlation with phenotype (e.g. certain introns or the four-fold degenerate third-base position of most DNA codons). Due to these two aspects of certain DNA sequences, ubiquity and redundancy, DNA sequences can be carefully chosen that constitute completely independent data from morphology. (See point 17 and 18 for more background about the molecular sequence evidence and for more detail about how it is independent of morphology.) It turns out that the phylogenetic tree determined by morphological analysis indeed matches closely the phylogenetic tree constructed from morphologically independent protein, DNA, and ribosomal RNA sequences. The degree of congruence between these independently derived phylogenies is extremely high, in terms of statistical significance.

In science, independent measurements of some value (such as a physical constant like the charge of the electron, the mass of the proton, or the speed of light) are never exact. There always exists some error in the measurement, and all independent measurements are incongruent to some extent. Of course, the true value of something is never known for certain - all we have are measurements that we hope approximate the true value. Scientifically, then, the important relevant questions are "When comparing two measurements, how much of a discrepancy does it take to be a problem?" and "How close must the measurements be in order to give a strong confirmation?" Scientists answer these questions with probability and statistics. Some measurements match with statistical significance, some do not.

So, how well do phylogenetic trees from morphological studies match the trees made from independent molecular studies? There are over 10⁴¹ different possible ways to arrange the 31 major taxa represented in Figure 1 into a phylogenetic tree (Felsenstein 1982; Li 1997, p. 102). In spite of the odds, the exact relationships given in Figure 1 were independently determined from morphological characters and from cytochrome c molecular studies (for consensus phylogenies from pre-molecular studies see Carter 1954, Figure 1, p. 13; Dodson 1960, Figures 43, p. 125, and Figure 50, p. 150; Osborn 1918, Figure 42, p. 161; Haeckel 1898, p. 55; Gregory 1951, Fig. opposite title page; for phylogenies from the early cytochrome c studies see McLaughlin and Dayhoff 1973; Dickerson and Timkovich 1975, pp. 438-439). Speaking quantitatively, these independent measurements have determined the standard phylogenetic tree, as shown in Figure 1, to better than 41 decimal places. This phenomenal corroboration of macroevolutionary theory is referred to as the "twin nested hierarchy."

In general, phylogenetic trees may be incongruent (i.e., some of the branches may not match), while still retaining an extremely high degree of statistical significance (Hendy et al. 1984; Penny et al. 1982; Penny and Hendy 1986; Steel and Penny 1993). Even for a phylogeny with a small number of organisms, the total number of possible trees is extremely large. For example, there are about a thousand different possible phylogenies for only six organisms; for nine organisms, there are millions of possible phylogenies; for 12 organisms, there are nearly 14 trillion different possible phylogenies (Felsenstein 1982; Li 1997, p. 102). Thus, the probability of finding two similar trees by chance via two independent methods is extremely small in most cases. In fact, two different trees of 16 organisms that mismatch by as many as 10 branches still match with high statistical significance (Hendy et al. 1984, Table 4; Steel and Penny 1993).

The stunning degree of match between even the most incongruent phylogenetic trees found in the biological literature is widely unappreciated, mainly because most people (including many scientists) are unaware of the mathematics involved (Penny et al. 1982; Penny and Hendy 1986). To put the significance of this incredible confirmation in perspective, consider the modern theory of gravity. Both Newton's Theory of Universal Gravitation and Einstein's General Theory of Relativity rely upon a fundamental physical constant, G, the gravitational constant. If these theories of gravity are correct, independent methods should determine similar values for G. However, to date, very precise independent measurements of the gravitational constant G disagree by nearly 1% (Kestenbaum 1998; Quinn 2000). Here is how David Kestenbaum describes the current scientific status of the theory of gravity, as reported in the prestigious journal Science:

"While the charge of the electron is known to seven decimal places, physicists lose track of G after only the third. For some, that's an embarrassment. 'It grates on me like a burr in the saddle,' says Alvin Sanders, a physicist at the University of Virginia in Charlottesville. Over the past few decades, he and a handful of other physicists have dedicated themselves to measuring G more accurately. To their dismay, they've come up with wildly different values. 'You might say we've had negative progress,' says Barry Taylor, a physicist at the National Institute of Standards and Technology (NIST) in Gaithersburg, Maryland. ... 'Nobody understands it [the far-out results of the PTB, the German standards lab in Braunschweig],' says Meyer. 'They must have made an unbelievable mistake, but we cannot find it.' ... says Terry Quinn, 'we may just have to throw the PTB result out.'" (Kestenbaum 1998)

Over two years later, the same Terry Quinn (of the International Bureau of Weights and Measures [BIPM] in Sèvres, France) summarized the situation in a review for the journal Nature:

"The current interest in measuring G was stimulated by the publication in 1996 of a value for G that differed by 0.6% from the accepted value given in the previous 1986 CODATA report. To take account of this, the 1998 CODATA report recommends a value for G ... with an uncertainty of 0.15%, some ten times worse than in 1986. Whereas the other fundamental constants were more accurately known in 1998 than in 1986, the uncertainty in G increased dramatically. The G community appeared to be going backwards rather than forwards." (Quinn 2000)

Nevertheless, a disagreement of just under 1% is still pretty good; it is not enough, at this point, to cause us to cast much doubt upon the validity and usefulness of modern theories of gravity. However, if tests of the theory of common descent performed that poorly, the standard phylogenetic trees independently determined by morphological and molecular methods would have to differ by more than 25 branches (out of 31)! In their quest for scientific perfection, some biologists are rightly rankled at the very minor discrepancies between some phylogenetic trees (Gura 2000; Patterson et al. 1993; Maley and Marshall 1998); however, the standard phylogenetic tree is known with much greater precision and accuracy than even the most well-determined physical constants (e.g., the charge of the electron is known to only seven decimal places, the Planck constant is known to eight decimal places, and the mass of the neutron, proton, and electron are all known to nine decimal places).

Furthermore, if common descent is true, we expect that including more data in phylogenetic analyses will increase the correspondence between phylogenetic trees. As explained in the phylogenetic caveats sidebar, gene trees are not equivalent to species trees (Avise and Wollenberg 1997). Genetics and heredity are stochastic (i.e. probabilistic) processes, and consequently we expect that phylogenies constructed with single genes will be partially incongruent. However, including multiple independent genes in a phylogenetic analysis should circumvent this difficulty. Phylogenetic trees constructed with multiple genes should thus be more accurate than those constructed with single genes, and indeed combined gene trees are more congruent (Baldauf et al. 2000; Hedges 1994; Hedges and Poling 1999; Penny et al. 1982).

Potential Falsification:

When it became possible to sequence biological molecules, the realization of a markedly different tree based on the independent molecular evidence would have been a fatal blow to the theory of evolution, even though that is by far the most likely result. More precisely, the common descent hypothesis would have been falsified if the universal phylogenetic trees determined from the independent molecular and morphological evidence did not match with statistical significance. Furthermore, we are now in a position to begin construction of phylogenetic trees based on other independent lines of data, such as chromosomal organization. In a very general sense, chromosome number and length and the chromosomal position of genes are all causally independent of both morphology and of sequence identity. Phylogenies constructed from these data should recapitulate the standard phylogenetic tree as well (Hillis, Moritz et al. 1996; Li 1997).

One common objection is the assertion that anatomy is not independent of biochemistry, and thus anatomically similar organisms are likely to be similar biochemically (e.g. in their molecular sequences) simply for functional reasons. According to this argument, then, we should expect phylogenies based on molecular sequences to be similar to phylogenies based on morphology even if organisms are not related by common descent. This argument is very wrong. There is no known biological reason, besides common descent, to suppose that similar morphologies must have similar biochemistry. Though this logic may seem quite reasonable initially, all of molecular biology refutes this "common sense" correlation. In general, similar DNA and biochemistry give similar morphology and function, but the converse is not true - similar morphology and function is not necessarily the result of similar DNA or biochemistry. The reason is easily understood once explained; many very different DNA sequences or biochemical structures can result in the same functions and the same morphologies (see point 17 and 18 for a detailed explanation).

As a close analogy, consider computer programs. Netscape works essentially the same on a Macintosh, an IBM, or a Unix machine, but the binary code for each program is quite different. Computer programs that perform the same functions can be written in most any computer language - Basic, Fortran, C, C++, Java, Pascal, etc. and identical programs can be compiled into binary code many different ways. Furthermore, even using the same computer language, there are many different ways to write any specific computer program, even using the same algorithms and subroutines. In the end, there is no reason to suspect that similar computer programs are written with similar code, based solely on the function of the program. This is the reason why software companies keep their source code secret, but they don't care that competitors can use their programs - it is essentially impossible to deduce the program code from the function and operation of the software. The same conclusion applies to biological organisms, for very similar reasons.

To reiterate, although similar genotypes (e.g. molecular sequences) often give similar phenotypes (e.g. morphological characters), similar phenotypes are not necessarily the result of similar genotypes. Thus, it is entirely possible that phylogenetic trees constructed from genotypic data could be radically different from phylogenetic trees constructed from phenotypic data. In fact, in the absence of common descent or any other reason to suppose that these two types of trees should be similar, the most likely result by far is that they will be radically different. This is precisely why it is possible to falsify the macroevolutionary prediction that independently derived phylogenies should be similar.

Prediction 4: Intermediate and transitional forms: the possible morphologies of predicted common ancestors

[Figure1.4.1
(cartoon of vertebrate jaws)]

Figure 1.4.1. The jaws of three vertebrates - mammal, therapsid, and pelycosaur. A side view of three idealized skulls of mammals, therapsids (mammal-like reptiles), and pelycosaurs (early reptiles). The figure shows the differences between mammal and reptilian jaws and ear-bone structures. The jaw joint is shown as a large black dot, the quadrate (mammalian anvil or incus) is in turquoise, the articular (mammalian hammer or malleus) is in yellow, and the angular (mammalian tympanic annulus) is in pink. (Reproduced with modification from Kardong 2002, pp. 275)

Any fossilized animals found should conform to the standard phylogenetic tree. If all organisms are united by descent from a common ancestor, then there is one single true historical phylogeny for all organisms, just like there is one single true historical genealogy for any individual human. It directly follows that if there is one true phylogeny, then all organisms fit in that phylogeny uniquely. In other words, all organisms, both past (e.g. fossils) and present, must conform to the true phylogeny. Since the standard phylogenetic tree is the best approximation of the true historical phylogeny, we expect that all fossilized animals should conform to the standard phylogenetic tree within the error of our scientific methods.

Every node shared between two branches in a phylogeny or cladogram represents a predicted common ancestor; thus there are ~30 common ancestors predicted from the tree shown in Figure 1. Our standard tree shows that the bird grouping is most closely related to the reptilian grouping, with a node linking the two (A in Figure 1); thus we predict the possibility of finding fossil intermediates between birds and reptiles. The same reasoning applies to mammals and reptiles (B in Figure 1). However, we predict that we should never find fossil intermediates between birds and mammals.

It should be pointed out that there is no requirement for intermediate organisms to go extinct. In fact, all living organisms can be thought of as intermediate between adjacent taxa in a phylogenetic tree. For instance, modern reptiles are intermediate between amphibians and mammals, and reptiles are also intermediate between amphibians and birds. As far as macroevolutionary predictions of morphology are concerned, this point is trivial, as it is essentially just a restatement of the concept of a nested hierarchy.

However, a phylogenetic tree does make significant predictions about the morphology of intermediates which no longer exist or which have yet to be discovered. Each predicted common ancestor has a set of explicitly specified morphological characteristics, based on each of the most common derived characters of its descendants and based upon the transitions that must have occurred to transform one taxa into another (Futuyma 1998, pp. 107-108). From the knowledge of avian and reptilian morphology, it is possible to predict some of the characteristics that a reptile-bird intermediate should have, if found. Therefore, while pterodactyl fossils are not considered possible candidates for reptile-bird intermediates (Futuyma 1998, pp. 154-155), we do expect the possibility of finding reptile-like fossils with feathers, bird-like fossils with teeth, or bird-like fossils with long reptilian tails.

Confirmation:

Example 1

In the case just mentioned, we have found a quite complete set of dinosaur-to-bird transitional fossils with no morphological "gaps" (Sereno 1999), represented by Ceratosaurus, Allosaurus, Sinosauropteryx, Protarchaeopteryx, Caudipteryx, Velociraptor, Archaeopteryx, Confuciusornis, Sinornis, and Columba, among many others (Carroll 1997, pp. 306-323; Sereno 1999). All have the expected possible morphologies (see Figure 3.11.1 from Prediction 11 for a few examples), including organisms such as Sinosauropteryx, Protarchaeopteryx, and Caudipteryx which are flightless bipedal dinosaurs with feathers (Chen, Dong et al. 1998; Qiang, Currie et al. 1998). The All About Archaeopteryx FAQ gives a detailed listing of the various characters of Archaeopteryx which are intermediate between reptiles and modern birds.

Example 2

We also have an exquisitely complete series of fossils for the reptile-mammal intermediates, ranging from the pelycosauria, therapsida, cynodonta, up to primitive mammalia (Carroll 1988, pp. 392-396; Futuyma 1998, pp. 146-151; Gould 1990; Kardong 2002, pp. 255-275). As mentioned above, the standard phylogenetic tree indicates that mammals gradually evolved from a reptile-like ancestor, and that there must have existed transitional species that were morphologically intermediate between reptiles and mammals - even though none are found living today. However, there are significant morphological differences between modern reptiles and modern mammals. Bones, of course, are what fossilize most readily, and that is where we look for transitional species from the past. Osteologically, two major striking differences exist between reptiles and mammals: (1) reptiles have at least four bones in the lower jaw (e.g. the dentary, articular, angular, surangular, and coronoid), while mammals have only one (the dentary), and (2) reptiles have only one middle ear bone (the stapes), while mammals have three (the hammer, anvil, and stapes). Early in the 20^th century, developmental biologists discovered something that further complicates the picture. In the reptilian fetus, two developing bones from the head eventually form two bones in the reptilian lower jaw, the quadrate and the articular (see the Pelycosaur in Figure 1.4.1). Surprisingly, the corresponding developing bones in the mammalian fetus eventually form the anvil and hammer of the unique mammalian middle ear (also known more formally as the incus and malleus, respectively) (Gilbert 1997, pp. 894-896). These facts strongly indicated that the hammer and anvil had evolved from these reptilian jawbones - that is, if common descent was in fact true. This result was so striking, and the required intermediates so outlandish, that many anatomists had extreme trouble imagining how transitional forms bridging these morphologies could have existed while retaining function. Young-earth creationist Duane Gish stated the problem this way:

"All mammals, living or fossil, have a single bone, the dentary, on each side of the lower jaw, and all mammals, living or fossil, have three auditory ossicles or ear bones, the malleus, incus and stapes. ... Every reptile, living or fossil, however, has at least four bones in the lower jaw and only one auditory ossicle, the stapes. ... There are no transitional fossil forms showing, for instance, three or two jawbones, or two ear bones. No one has explained yet, for that matter, how the transitional form would have managed to chew while his jaw was being unhinged and rearticulated, or how he would hear while dragging two of his jaw bones up into his ear." (Gish, 1978, p. 80)

Gish was incorrect in stating that there were no transitional fossil forms, and he has been corrected on this gaff numerous times since he wrote these words. However, Gish's statements nicely delineate the morphological conundrum at hand. Let's review the required evolutionary conclusion. During their evolution, two mammalian middle ear bones (the hammer and anvil, aka malleus and incus) were derived from two reptilian jawbones. Thus there was a major evolutionary transition in which several reptilian jawbones (the quadrate, articular, and angular) were extensively reduced and modified gradually to form the modern mammalian middle ear. At the same time, the dentary bone, a part of the reptilian jaw, was expanded to form the major mammalian lower jawbone. During the course of this change, the bones that form the hinge joint of the jaw changed identity. Importantly, the reptilian jaw joint is formed at the intersection of the quadrate and articular whereas the mammalian jaw joint is formed at the intersection of the squamosal and dentary (see Figure 1.4.1).

How could hearing and jaw articulation be preserved during this transition? As clearly shown from the many transitional fossils that have been found (see Figure 1.4.2), the bones that transfer sound in the reptilian and mammalian ear were in contact with each other throughout the evolution of this transition. In reptiles, the stapes contacts the quadrate, which in turn contacts the articular. In mammals, the stapes contacts the incus, which in turn contacts the malleus. Since the quadrate evolved into the incus, and the articular evolved into the malleus, these three bones were in constant contact during this impressive evolutionary change. Furthermore, a functional jaw joint was maintained by redundancy - several of the intermediate fossils have both a reptilian jaw joint (from the quadrate and articular) and a mammalian jaw joint (from the dentary and squamosal). Several late cynodonts and Morganucodon clearly have a double-jointed jaw. In this way, the reptilian-style jaw joint was freed to evolve a new specialized function in the middle ear. It is worthy of note that some modern species of snakes have a double-jointed jaw involving different bones, so such a mechanical arrangement is certainly possible and functional.

Since Figure 1.4.2 was made, sevveral important intermediate fossils have been discovered that fit between Morganucodon and the earliest mammals. These new discoveries include a complete skull of Hadrocodium wui (Luo et al. 2001) and cranial and jaw material from Repenomamus and Gobiconodon (Wang et al. 2001). These new fossil finds clarify exactly when and how the malleus, incus, and angular completely detached from the lower jaw and became solely auditory ear ossicles.

[Figure1.4.2 (cartoon of vertebrate
jaws)]

Figure 1.4.2. A comparison of the jawbones and ear-bones of several transitional forms in the evolution of mammals. Approximate stratigraphic ranges of the various taxa are indicated at the far left (more recent on top). The left column of jawbones shows the view of the left jawbone from the inside of the mouth. The right column is the view of the right jawbone from the right side (outside of the skull). As in Figure 1.4.1, the quadrate (mammalian anvil or incus) is in turquoise, the articular (mammalian hammer or malleus) is in yellow, and the angular (mammalian tympanic annulus) is in pink. For clarity, the teeth are not shown, and the squamosal upper jawbone is omitted (it replaces the quadrate in the mammalian jaw joint, and forms part of the jaw joint in advanced cynodonts and Morganucodon). Q = quadrate, Ar = articular, An = angular, I = incus (anvil), Ma = malleus (hammer), Ty = tympanic annulus, D = dentary. (Reproduced with modification from Kardong 2002, pp. 274)

Example 3

One of the most celebrated examples of transitional fossils is our collection of fossil hominids (see Figure 1.4.3 below). Based upon the consensus of numerous phylogenetic analyses, Pan troglodytes (the chimpanzee) is the closest living relative of humans. Thus, we expect that organisms lived in the past which were intermediate in morphology between humans and chimpanzees. Over the past century, many spectacular paleontological finds have identified such transitional hominid fossils.

(A) Pan troglodytes, chimpanzee, modern
(B) Australopithecus africanus, STS 5, 2.6 My
(C) Australopithecus africanus, STS 71, 2.5 My
(D) Homo habilis, KNM -ER 1813, 1.9 My
(E) Homo habilis, OH24 , 1.8 My
(F) Homo ergaster (late H. erectus), KNM -ER 3733, 1.75 My
(G) Homo heidelbergensis, "Rhodesia man," 300,000 - 125,000 y
(H) Homo sapiens neanderthalensis, La Ferrassie 1, 70,000 y
(I) Homo sapiens neanderthalensis, La Chappelle-aux-Saints, 60,000 y
(J) Homo sapiens neanderthalensis, Le Moustier, 45,000 y
(K) Homo sapiens sapiens, Cro-Magnon I, 30,000 y
(L) Homo sapiens sapiens, modern

Example 4

Another impressive example of incontrovertible transitional forms predicted to exist by evolutionary biologists is the collection of land mammal-to-whale fossil intermediates. Whales, of course, are sea animals with flippers. Since they are also mammals, the consensus phylogeny indicates that whales and dolphins evolved from land mammals with legs. In recent years, we have found several transitional forms of whales with legs, both capable and incapable of terrestrial locomotion.

There are many other examples such as these - most can be found in the excellent Transitional Vertebrate Fossils FAQ.

Potential Falsification:

Any finding of a striking mammal-bird intermediate would be highly inconsistent with common descent. Many other examples of prohibited intermediates can be thought of, based on the standard tree (Kemp 1982; Stanley 1993; Carroll 1997; Chaterjee 1997).

A subtle, yet important point is that a strict cladistic evolutionary interpretation precludes the possibility of identifying true ancestors; only intermediates or transitionals can be positively identified. (For the purposes of this article, transitionals and intermediates are considered synonymous.) The only incontrovertible evidence for an ancestor-descendant relationship is the observation of a birth; obviously this is normally rather improbable in the fossil record. Intermediates are not necessarily the same as the exact predicted ancestors; in fact, it is rather unlikely that they would be the same. Simply due to probability considerations, the intermediates that we find will most likely not be the true ancestors of any modern species, but will be closely related to the predicted common ancestor. The minor implication concerning fossil intermediates is that the intermediates we do find will likely have additional derived characters besides the characters that identified them as intermediates. Because of these considerations, when a new and important intermediate fossil species is discovered, careful paleontologists will often note that the transitional species under study is probably not an ancestor, but rather is an evolutionary "side-branch." For further clarification see prediction 25.

Prediction 5: Chronological order of intermediates

Fossilized intermediates should appear in the correct general chronological order based on the standard tree. Any phylogenetic tree predicts a relative chronological order of the evolution of hypothetical common ancestors and intermediates between these ancestors. For instance, in our current example, the reptile-mammal common ancestor (B) and intermediates should be older than the reptile-bird common ancestor (A) and intermediates.

Note, however, that there is some "play" within the temporal constraints demanded by any phylogeny, for two primary reasons: (1) the statistical confidence (or conversely, the error) associated with a phylogeny and its specific internal branches, and (2) the inherent resolution of the fossil record (ultimately stemming from the vagaries of the fossilization process). As mentioned earlier, most phylogenetic trees have some branches with high confidence, because they are well-supported by the data, and other branches in which we have less confidence, because they are statistically less significant and poorly supported by the data. See also the caveats associated with phylogenetic analysis.

When evaluating the geological order of fossils, remember that once a transitional species appears there is no reason why it must become extinct and be replaced. For instance, some organisms have undergone little change in as much as 100 to 200 million years in rare cases. Some familiar examples are the "living fossils," such as the coelacanth, which has persisted for approximately 80 million years; the bat, which has not changed much in the past 50 million years; and even the modern tree squirrel, which has not changed in 35 million years. In fact, paleontological studies indicate the average longevity of 21 living families of vertebrates is approximately 70 million years (Carroll 1997, p. 167).

Furthermore, the fossil record is demonstrably incomplete; species appear in the fossil record, then disappear, then reappear later. An exceptional instance is the coelacanth, which last appeared in the fossil record 80 million years ago, yet it is alive today. During the Cretaceous (a critical time in bird evolution), there is a 50 million-year gap in the diplodocoidean record, greater than a 40 million-year gap in the pachycephalosaurian record, greater than a 20 million-year gap in the trodontidiae, and about a 15 million-year gap in the oviraptosaurian fossil record (both of these last two orders of dinosaurs are maniraptoran coelurosaurian theropods, which figure significantly in the evolution of birds). During the Jurassic, there is a 40 million-year gap in the fossil record of the heterodontosauridae (Sereno 1999). Most organisms do not fossilize, and there is no reason why a representative of some species must be found in the fossil record. As every graduate student in scientific research knows (or eventually learns, perhaps the hard way), arguments based upon negative evidence are very weak scientific arguments, especially in the absence of proper positive controls. Thus, based on the fossil remains of modern species and the known gaps in the current paleontological records of extinct species, the observation of transitional species "out of order" by 40 million years should be fairly common. This degree of "play" in the fossil record is actually rather minor, considering that the fossil record of life spans between 2 to 3.8 billion years and that of multicellular organisms encompasses a total of ~660 million years. An uncertainty of 40 million years is equivalent to about a 1% or 6% relative error, respectively - rather small overall.

Confirmation:

The reptile-bird intermediates mentioned above date from the Upper Jurassic and Lower Cretaceous (about 150 million years ago), whereas pelycosauria and therapsida (reptile-mammal intermediates) are older and date from the Carboniferous and the Permian (about 250 to 350 million years ago, see the Geological Time Scale). This is precisely what should be observed if the fossil record matches the standard phylogenetic tree.

The most scientifically rigorous method of confirming this prediction is to demonstrate a positive corellation between phylogeny and stratigraphy, i.e. a positive corellation between the order of taxa in a phylogenetic tree and the geological order in which those taxa first appear and last appear (whether for living or extinct intermediates). For instance, within the error inherent in the fossil record, prokaryotes should appear first, followed by simple multicellular animals like sponges and starfish, then lampreys, fish, amphibians, reptiles, mammals, etc., as shown in Figure 1. Contrary to the erroneous (and unreferenced) opinions of some anti-evolutionists (e.g. Wise 1994, p. 225-226), studies from the past ten years addressing this very issue have confirmed that there is indeed a positive corellation between phylogeny and stratigraphy, with statistical significance (Benton 1998; Benton and Hitchin 1996; Benton and Hitchin 1997; Benton et al. 1999; Benton et al. 2000; Benton and Storrs 1994; Clyde and Fisher 1997; Hitchin and Benton 1993; Huelsenbeck 1994; Norell and Novacek 1992a; Norell and Novacek 1992b; Wills 1999). Using three different measures of phylogeny-stratigraphy correlation [the RCI, GER, and SCI (Ghosts 2.4 software, Wills 1999)], a high positive correlation was found between the standard phylogenetic tree portrayed in Figure 1 and the stratigraphic range of the same taxa, with very high statistical significance (P < 0.0001) (this work, Ghosts input file available upon request).

As another specific example, an early analysis published in Science by Mark Norell and Michael Novacek (Norell and Novacek 1992b) examined 24 different taxa of vertebrates (teleosts, amniotes, reptiles, synapsids, diapsids, lepidosaurs, squamates, two orders of dinosaurs, two orders of hadrosaurs, pachycephalosaurs, higher mammals, primates, rodents, ungulates, artiodactyls, ruminants, elephantiformes, brontotheres, tapiroids, chalicotheres, Chalicotheriinae, and equids). For each taxa, the phylogenetic position of known fossils was compared with the stratigraphic position of the same fossils. A positive correlation was found for all of the 24 taxa, 18 of which were statistically significant. Note that the correlation theoretically could have been negative. A statistically significant negative correlation would indicate that, in general, organisms rooted deeply in the phylogeny are found in more recent strata - a strong macroevolutionary inconsistency. However, no negative correlations were observed.

As a third example, Michael Benton and Rebecca Hitchin published a more recent, greatly expanded, and detailed stratigraphic analysis of 384 published cladograms of various multicellular organisms (Benton and Hitchin 1997). Using the three measures of congruence between the fossil record and phylogeny mentioned above (the RCI, GER, and SCI), these researchers observed values "skewed so far from a normal distribution [i.e. randomness] that they provide evidence for strong congruence of the two datasets [fossils and cladograms]." The results were overall extremely statistically significant (P < 0.0005). As the authors comment in their discussion:

"... the RCI and SCI metrics showed impressive left-skewing; the majority of cladograms tested show good congruence between cladistic and stratigraphic information. Cladists and stratigraphers may breathe easy: the cladistic method appears, on the whole, to be finding phylogenies that may be close to the true phylogeny of life, and the sequence of fossils in the rocks is not misleading. ... it would be hard to explain why the independent evidence of the stratigraphic occurrence of fossils and the patterns of cladograms should show such striking levels of congruence if the fossil record and the cladistic method were hopelessly misleading." (Benton and Hitchin 1997, p. 889)

Additionally, if the correlation between phylogeny and stratigraphy is due to common descent, we would expect the correlation to improve over longer geological time frames (since the relative error associated with the fossil record decreases). This is in fact observed (Benton et al. 1999). We also would expect the correlation to improve, not to get worse, as more fossils are discovered, and this has also been observed (Benton and Storrs 1994).

Potential Falsification:

It would be highly inconsistent if the chronological order were reversed in the reptile-bird and reptile-mammal example. More generally, the strongest falsification of this prediction would be the finding that there was a negative correlation between stratigraphy and the phylogenetic tree that describes the genealogical relatedness of all living organisms. Even the finding that there was no overall correlation, neither positive nor negative, between stratigraphy and the consensus phylogeny of the major taxa would be very problematic for the theory of common descent. In addition, the observed correlation could decrease over longer time frames or as we acquire more paleontological data - but neither is the case (Benton et al. 1999; Benton and Storrs 1994).

Based on the high confidence in certain branches of phylogenetic trees, some temporal constraints are extremely rigid. For example, we should never find mammalian or avian fossils in or before Devonian deposits, before reptiles had diverged from the amphibian tetrapod line. This excludes Precambrian, Cambrian, Ordovician, and Silurian deposits, encompassing 92% of the earth's geological history and 65% of the biological history of multicellular organisms. Even one incontrovertible find of any pre-Devonian mammal, bird, or flower would shatter the theory of common descent (Kemp 1982; Carroll 1988; Stanley 1993; Chaterjee 1997).

References

Archie, J. W. (1989). "A randomization test for phylogenetic information in systematic data." Systematic Zoology 38: 219-252.

Atchely, W. R. and W. M. Fitch (1991). "Gene trees and the origins of inbred strains of mice." Science 254: 554-558.

Avery, O. T., MacLeod, C. M. and M. McCarty. (1944). "Studies on the chemical nature of the substance inducing transformation of pneumococcal types." J. Exp. Med. 79:137-158.

Avise, J.C., and Wollenberg, K. (1997) "Phylogenetics and the origin of species." PNAS 94: 7748-7755.

Baldauf, S.L., Roger, A.J., Wenk-Siefert, I., and Doolittle, W.F. (2000) "A kingdom-level phylogeny of eukaryotes based on combined protein data." Science 290: 972-7.

Benton, M. J. (1998) "Molecular and morphological phylogenies of mammals: Congruence with stratigraphic data." Molecular Phylogenetics and Evolution 9: 398-407.

Benton, M. J. and Hitchin, R. (1996). "Testing the quality of the fossil record by groups and by major habitats." Historical Biol. 12: 111-157.

Benton, M. J. and Hitchin, R. (1997). "Congruence between phylogenetic and stratigraphic data on the history of life." Proc. R. Soc. Lond. B. 264: 885-890.

Benton, M. J., Hitchin, R., and Wills, M. A. (1999). "Assessing congruence between cladistic and stratigraphic data." Syst. Biol. 48: 581-596.

Benton, M. J. and Storrs, G. W. (1994). "Testing the quality of the fossil record: paleontological knowledge is improving." Geology 22: 111-114.

Benton, M. J., Wills, M. A., and Hitchin R. (2000). "Quality of the fossil record through time." Nature 403: 534-537.

Bush, R. M., C. A. Bender, et al. (1999). "Predicting the evolution of human influenza A." Science 286: 1921-1925.

Brooks, D. R. and. D. A. McLennan. (1991). Phylogeny, ecology, and behavior. Chicago, University of Chicago Press.

Carroll, R. L. (1988). Vertebrate Paleontology and Evolution. New York, W.H. Freeman and Co.

Carroll, R. L. (1997). Patterns and Processes of Vertebrate Evolution. Cambridge, Cambridge University Press.

Carter, G. S. (1954). Animal Evolution. London, Sidgwick and Jackson.

Chaterjee, S. (1997). The Rise of Birds: 225 million years of evolution. Baltimore, MD, Johns Hopkins University Press.

Chen, P.-j., Z.-m. Dong, et al. (1998). "An exceptionally well-preserved theropod dinosaur from the Yixian formation of China." Nature 391: 147-152.

Clyde, W.C. and Fisher, D.C. (1997) "Compaing the fit of stratigraphic and morphologic data in phylogenetic analysis." Paleobiology 23: 1-19.

Darwin, C. (1872). The Origin of Species. Sixth Edition. The Modern Library, New York.

Dickerson, R. E. and R. Timkovich (1975). cytochrome c. The Enzymes. P. D. Boyer. New York, Academic Press. 11: 397-547.

Dodson, E. O. (1960). Evolution: Process and Product. New York, Reinhold Publishers.

Doolittle, W. F. (1999). "Phylogenetic Classification and the Universal Tree." Science 284: 2124.

Doolittle, W. F. (2000). "The nature of the universal ancestor and the evolution of the proteome." Current Opinion in Structural Biology 10: 355-358.

Faith, D. P., and Cranston, P. S. (1991). "Could a cladogram this short have arisen by chance alone?: on permutation tests for cladistic structure." Cladistics 7: 1-28.

Farris, J.S. (1989). "The retention index and the rescaled consistency index." Cladistics 5:417-419.

Felsenstein, J. (1982). "Numerical methods for inferring evolutionary trees." Quart. Rev. Biol. 57: 379-404.

Felsenstein, J. (1985). "Confidence limits on phylogenies: an approach using the bootstrap." Evolution 39: 783-791.

Futuyma, D. (1998). Evolutionary Biology. Third edition. Sunderland, MA, Sinauer Associates.

Garavelli, J. S., Hou, Z., et al. (2001). "The RESID Database of protein structure modifications and the NRL-3D Sequence-Structure Database." Nucleic Acids Research 29: 199-201. http://www.ncifcrf.gov/RESID/

Gish, D. T. (1978) Evolution? The Fossils Say No! Public School Edition, San Diego: Creation-Life Publishers.

Gregory, W.K. (1951). Evolution Emerging: A survey of changing patterns from primeval life to man. Volume I. The Macmillan Co., New York.

Groves, C.P. (2000). "The genus Cheirogaleus: Unrecognized biodiversity in dwarf lemurs." International Journal of Primatology. 21(6): 943-962.

Gura, T. (2000) "Bones, molecules ... or both?" Nature 406: 230-233.

Haeckel, E. (1898). The Last Link. Adam and Charles Black, London.

Harris, T. E. (1989). The Theory of Branching Processes. New York, Dover.

Hedges, S.B. (1994) "Molecular evidence for the origin of birds." PNAS 91: 2621-2624.

Hedges, S.B. and Poling, L.L. (1999) "A molecular phylogeny of reptiles." Science 283: 998-1001.

Hendy, M. D., Little, C. H. C., and Penny, D. (1984). "Comparing trees with pendant vertices labelled." SIAM J. Appl. Math. 44: 1054-1065.

Hennig, W. (1966). Phylogenetic Systematics. Urbana, University of Illinios Press.

Hillis, D. M. (1991). "Discriminating between phylogenetic signal and random noise in DNA sequences." In Phylogenetic analysis of DNA sequences. pp. 278-294 M.M. Miyamoto and J. Cracraft, eds. Oxford University Press, New York.

Hillis, D. M. and J. J. Bull (1993). "An empirical test of bootstrapping as a method for assessing confidence on phylogenetic analysis." Syst. Biol. 42: 182-192.

Hillis, D. M., J. J. Bull, et al. (1992). "Experimental phylogenetics: Generation of a known phylogeny." Science 255: 589-592.

Hillis, D. M., and J. P. Huelsenbeck. (1992). "Signal, noise, and reliability in molecular phylogenetic analyses." Journal of Heredity 83: 189-195.

Hillis, D. M., C. Moritz, and B. K. Mable, Eds. (1996). Molecular systematics. Sunderland, MA, Sinauer Associates.

Hitchin, R. and Benton, M.J. (1997) "Congruence between parsimony and stratigraphy: comparisons of three indices." Paleobiology 23: 20-32.

Huelsenbeck, J. P. (1994). "Comparing the stratigraphic record to estimates of phylogeny." Palaeobiology 20: 470-483.

Huelsenbeck, J. P., Ronquist, F., Nielsen, R., and J. P. Bollback. (2001) "Bayesian inference of phylogeny and its impact on evolutionary biology." Science 294: 2310-2314.

Klassen, G.J., Mooi, R.D., and A. Locke. (1991). "Consistency indices and random data." Syst. Zool. 40:446-457.

Kemp, J. S. (1982). Mammal-like reptiles and the origin of mammals. New York, Academic Press.

Kestenbaum, D. (1998). "Gravity Measurements Close in on Big G." Science 282: 2180-2181.

Lehman, N. (2001). "Please release me, genetic code." Current Biology 11: R63-R66.

Li, W.-H. (1997). Molecular Evolution. Sunderland, MA, Sinauer Associates.

Luo, Z-X., Crompton, A. W. and A-L Sun. (2001) "A new mammaliaform from the early Jurassic and evolution of mammalian characteristics." Science 292: 1535-1539.

Lyons-Weiler, J., Hoeker, G. A., and Tausch, R. J. (1996). "Relative Apparent Synapomorphy Analysis (RASA) I: The Statistical Measurement of Phylogenetic Signal." Mol. Biol. Evol. 13: 749-757.

Maddison, W. P. and D. R. Maddison. (1992). MacClade. Sunderland, MA, Sinauer Associates.

Maley, L.E. and Marshall, C. R. (1998). "The coming age of molecular systematics." Science 279: 505-506.

McLaughlin, P. J. and M. O. Dayhoff (1973). "Eukaryote evolution: a view based on cytochrome c sequence data." Journal of Molecular Evolution 2: 99-116.

Norell, M. A. and Novacek, M. J. (1992a) "Congruence between superpositional and phylogenetic patterns: Comparing cladistic patterns with fossil records." Cladistics 8: 319-337.

Norell, M. A. and Novacek, M. J. (1992b) "The fossil record and evolution: Comparing cladistic and paleontologic evidence for vertebrate history." Science 255: 1690-93.

Norris, J. R. (1997). Markov Chains. Cambridge, Cambridge University Press.

Osborn, H.F. (1918). The Origin and Evolution of Life. Charles Scribner's Sons, New York.

Patterson, C., Williams, D., and Humphries, C. (1993). "Congruence between molecular and morphological phylogenies." Annual Review of Ecology and Systematics 24: 153-188.

Pei, M. (1949). The Story of Language. Philadelphia, Lippincott.

Penny, D. and Hendy M. D. (1986). "Estimating the reliability of phylogenetic trees." Mol. Biol. Evol. 3: 403-417.

Penny, D., Foulds, L. R., and Hendy, M. D. (1982). "Testing the theory of evolution by comparing phylogenetic trees constructed from five different protein sequences." Nature 297: 197-200.

Penny, D., Hendy, M. D., and Steel, M. A. (1991). "Testing the theory of descent." In Phylogenetic Analysis of DNA Sequences, eds. Miyamoto, M. and Cracraft, J., Oxford University Press, New York. pp. 155-183.

Qiang, J., Currie, P. J. et al. (1998). "Two feathered dinosaurs from northeastern China." Nature 393: 753-761.

Quinn, T. (2000). "Measuring big G." Nature 408: 919-921.

Rasoloarison, R.M., Goodman, S.M., and Ganzhorn, J.U. (2000). "Taxonomic revision of mouse lemurs (Microcebus) in the western portions of Madagascar." International Journal of Primatology. 21(6): 963-1019.

Ringe, D., (1999). "Language classification: scientific and unscientific methods." in The Human Inheritance, ed. B. Sykes. Oxford, Oxford University Press, pp. 45-74.

Rozenski, J., P. F. Crain, et al. (1999). "The RNA Modification Database: 1999 update." Nucleic Acids Research 27: 196-197. http://medlib.med.utah.edu/RNAmods/

Sereno, P. C. (1999). "The Evolution of Dinosaurs." Science 284: 2137-2147.

Stanley, S. (1993). Earth and Life Through Time. New York, W.H. Freeman.

Steel, M.A. and D. Penny. (1993). "Distributions of tree comparison metrics - some new results." Systematic Biology 42: 126-141.

Swofford, D. L., Olsen, G. J., Waddell, P. J., and Hillis, D. M. (1996). "Phylogenetic inference." In Molecular Systematics, pp 407-514. Hillis, D. M., Moritiz, C. and Mable, B. K. eds., Sinauer, Sunderland, Massachusetts.

Thalmann, U. and Geissmann, T. (2000). "Distribution and geographic variation in the western woolly lemur (Avahi occidentalis) with description of a new species (A. unicolor)." International Journal of Primatology. 21(6): 915-941.

van Roosmalen, M.G.M., van Roosmalen, T., Mittermeier, R. A. and Rylands, A. B. (2000). "Two new species of marmoset, genus Callithrix Erxleben, 1777 (Callitrichidae, Primates), from the Tapajos/Madeira interfluvium, south central Amazonia." Neotropical Primates 8(1): 2-18.

Voet, D. and J. Voet. (1995). Biochemistry. New York, John Wiley and Sons.

Wang, Y. Hu, Y., Meng, J. and C. Li. (2001) "An ossified meckel's cartilage in two Cretaceous mammals and origin of the mammalian middle ear." Science 294: 357-361.

Wills, M. A. (1999). "Congruence between phylogeny and stratigraphy: Randomization tests and the gap excess ratio." Syst. Biol. 48: 559-580.

Wilson, E. O. (1992). The Diversity of Life. Cambridge, MA, Harvard University Press.

Wise, K. P. (1994). "The Origin of Life's Major Groups." In The Creation Hypothesis, pp. 211-234. Moreland, J.P. ed., InterVarsity Press, Downers Grove, IL.

Woese, C. (1998). "The universal ancestor." PNAS 95: 6854-6859.

Yockey, H. P. (1992). Information Theory and Molecular Biology. New York, Cambridge University Press.

Yoder, A.D., Rasoloarison, R.M., Goodman, S.M., Irwin, J.A., Atsalis, S., Ravosa, M.J., and Ganzhorn, J.U. (2000). "Remarkable species diversity in Malagasy mouse lemurs (primates, Microcebus)." Proc Natl Acad Sci 97(21): 11325-30.

Zhu, T., B. Korber, et al. (1998). "An African HIV-1 sequence from 1959 and implications for the origin of the epidemic." Nature 391: 594-597.

29 Evidences for Macroevolution

Part 1:
The One True Phylogenetic Tree

Part 1 Outline

Phylogenetics introduction

Evidences

Introduction to Phylogenetics

Phylogenetic Reconstructions: Reliability

A method for determining the true phylogenetic tree: Cladistics

Phylogenetic Reconstructions: Caveats

Cladistics

Prediction 1: The fundamental unity of life

Confirmation:

Potential Falsification:

Prediction 2: A nested hierarchy of species

Phylogenetic Reconstructions: Caveats

Molecular methods

Confirmation:

Potential Falsification:

Prediction 3: convergence of independent phylogenies

Confirmation:

Potential Falsification:

Prediction 4: Intermediate and transitional forms: the possible morphologies of predicted common ancestors

Confirmation:

Example 1

Example 2

Example 3

Example 4

Potential Falsification:

Prediction 5: Chronological order of intermediates

Confirmation:

Potential Falsification:

References

29 Evidences for Macroevolution

Part 1:The One True Phylogenetic Tree

Part 1 Outline

Phylogenetics introduction

Evidences

Introduction to Phylogenetics

Phylogenetic Reconstructions: Reliability

A method for determining the true phylogenetic tree: Cladistics

Phylogenetic Reconstructions: Caveats

Cladistics

Prediction 1: The fundamental unity of life

Confirmation:

Potential Falsification:

Prediction 2: A nested hierarchy of species

Phylogenetic Reconstructions: Caveats

Molecular methods

Confirmation:

Potential Falsification:

Prediction 3: convergence of independent phylogenies

Confirmation:

Potential Falsification:

Prediction 4: Intermediate and transitional forms: the possible morphologies of predicted common ancestors

Confirmation:

Example 1

Example 2

Example 3

Example 4

Potential Falsification:

Prediction 5: Chronological order of intermediates

Confirmation:

Potential Falsification:

References

Part 1:
The One True Phylogenetic Tree