How Chargaff’s Data Falsified the Tetranucleotide Hypothesis
- Erwin Chargaff’s chemical analyses of DNA showed that base composition varies among species and follows consistent pairing patterns (A=T, G=C).
- These results falsified the earlier tetranucleotide hypothesis, illustrating how science advances through falsification rather than proof.
The Problem of Induction
Induction
A way of reasoning where general principles are drawn from specific observations.
Example
Observing many white swans and concluding “all swans are white.”
- The problem is that no number of consistent observations can guarantee truth. one black swan disproves the claim.
- This is known as the problem of induction.
Scientific theories can never be proven absolutely true, only supported until contradictory evidence arises.
The Certainty of Falsification
Falsification
The idea that a hypothesis must make testable predictions that can be proven false by evidence.
- If a prediction fails, the hypothesis must be rejected or revised.
- So, if induction collects evidence; falsification tests it.
The Tetranucleotide Hypothesis
- In the early 1900s, scientists proposed that DNA was made of a repeating sequence of four nucleotides (A, T, G, C) in equal proportions.
- If this were true, DNA would be too simple to encode genetic information, so proteins were believed to be the genetic material instead.
- Assuming early scientists already knew DNA had complex coding capacity, they didn’t.
- Many things that seem obvious to us today were not obvious at the time.
Chargaff’s Findings
- By chemically analyzing DNA from many species, Chargaff discovered:
- The total number of purines = total number of pyrimidines.
- Purines: Adenine (A), Guanine (G).
- Pyrimidines: Cytosine (C), Thymine (T).
- Within each DNA sample:
- %A ≈ %T.
- %G ≈ %C.
- The relative proportion of A+T vs. G+C varies between species.
- Example: Humans ~31% A, 31% T, 19% G, 19% C.
- Mycobacterium tuberculosis: ~15% A, 15% T, 35% G, 35% C.
- The total number of purines = total number of pyrimidines.
The difference in base composition between humans and bacteria showed that DNA varies across species, contradicting the idea of identical repeating units.
How Chargaff Falsified the Hypothesis
- Chargaff’s results disproved the tetranucleotide hypothesis because:
- The bases were not present in equal amounts (A, T, G, C ≠ 25% each).
- The proportions differed among species, showing DNA had structural complexity.
- These findings directly falsified the prediction made by the hypothesis.
The strength of this discovery wasn’t that Chargaff “proved” anything, it’s that his data contradicted a specific, testable claim.
Self review- What does “induction” mean in scientific reasoning?
- What is the main limitation of induction?
- Who proposed the concept of falsification?
- What prediction did the tetranucleotide hypothesis make about DNA composition?
- What two base-pairing patterns did Chargaff discover?
- How did Chargaff’s findings differ between species?
- Why did Chargaff’s data falsify the tetranucleotide hypothesis?
- How did his work influence later discoveries about DNA structure?


