Cladogram
A cladogram is a diagram that shows the evolutionary relationships between species. It is constructed based on shared characteristics, such as similarities and differences in DNA base sequences or amino acid sequences of proteins.
Cladograms Represent Evolutionary Relationships
- Cladograms are branching diagrams showing evolutionary relationships between species.
- They're built using DNA sequences or protein amino acid sequences, which objectively reveal shared ancestry.
- More similar sequences mean species diverged more recently.
- Greater sequence differences mean species diverged longer ago.
Why Use Base and Amino Acid Sequences?
- DNA and proteins accumulate changes over time as a result of mutations.
- These changes occur at a roughly measurable rate, allowing scientists to infer how long ago two species diverged.
Base Sequence Comparisons
- DNA base sequences provide the most direct genetic evidence.
- Mutations alter the order of bases (A, T, C, G) over evolutionary time.
- Sequence alignment software compares stretches of DNA from two or more species, identifying similarities and differences.
- Closely related species show long stretches of identical sequences.
Amino Acid Sequence Comparisons
- Mutations in DNA can lead to changes in amino acids in proteins.Comparing amino acid sequences allows us to estimate evolutionary distance between species.
- Conserved proteins like cytochrome c (respiration) or haemoglobin (oxygen transport) are commonly used because they evolve slowly.
- Slow evolution makes these proteins ideal for studying distant evolutionary relationships across large groups (e.g., mammals, birds, reptiles).
Parsimony and Cladogram Construction
Parsimony
Parsimony favors the simplest explanation, requiring the fewest evolutionary changes.
- Computers analyse sequence data using the principle of parsimony, which selects the cladogram requiring the fewest evolutionary changes.
- This assumes the simplest explanation is most likely.
- Multiple cladograms can be generated from the same data, but the most parsimonious one is chosen as most probable.
It's more likely that fur evolved once in mammals than multiple times independently, so all mammals are grouped in a single clade.
Simple Example 1: DNA Base Sequences
- Consider three species, A, B, and C, with the following DNA sequences for a specific gene:
- Species A: ATGCTACG
- Species B: ATGCTGCG
- Species C: ATGATACG
- Step 1: Compare Sequences
- A vs. B: 1 difference (C vs. G at position 6).
- A vs. C: 2 differences (C vs. A at position 3, C vs. G at position 6).
- B vs. C: 2 differences (C vs. A at position 3, G vs. C at position 6).
- Step 2: Group Based on Similarity
- Species A and B have the fewest differences (1), so they are more closely related.
- Species C has more differences, so it diverged earlier.
── A
──┤
── B
───┤
── C
- Don't assume that the most detailed or complex cladogram is the most accurate.
- Remember, parsimony prioritizes simplicity, not complexity.
Simple Example 2: Amino Acid Sequences
- Here are amino acid sequences for the same protein in three species:
- Species X: MET-ALA-GLY-VAL
- Species Y: MET-ALA-GLY-ILE
- Species Z: MET-SER-GLY-ILE
- Step 1: Compare Sequences
- X vs. Y: 1 difference (VAL vs. ILE).
- X vs. Z: 2 differences (ALA vs. SER, VAL vs. ILE).
- Y vs. Z: 1 difference (ALA vs. SER).
- Step 2: Group Based on Similarity
- Species X and Y are more closely related (1 difference), while Z has more differences and diverged earlier.
── X
──┤
── Y
───┤
── Z
- DNA sequences act as a "molecular clock," with mutations accumulating over time.
- This allows scientists to estimate when species diverged from a common ancestor.
How does the principle of parsimony reflect broader scientific methods? Think about how simplicity is valued in theories across disciplines like physics or economics.
Self review- Define a clade and explain how base sequence data is used to identify clades.
- Why are conserved proteins such as cytochrome c often used in molecular comparisons?
- Describe the principle of parsimony and explain its role in cladogram construction.
- What are the main strengths and limitations of using sequence data for cladistics?



