← All Guides

DNA Replication Explained

Every time a cell divides, it must first duplicate its entire genome. DNA replication is the process by which a cell makes an identical copy of its DNA — a feat of molecular precision carried out by a team of specialised enzymes.

Why Cells Need to Replicate DNA

Every human cell contains roughly 3.2 billion base pairs of DNA packed into 23 pairs of chromosomes. Before a cell can divide — whether by mitosis to produce two identical daughter cells or by meiosis to produce gametes — it must copy all of that genetic information so each daughter cell receives a complete set. Errors in replication can cause mutations, cancer, or cell death, so the process is both remarkably fast and remarkably accurate: human DNA polymerase makes about one error per billion base pairs copied, and most of those errors are immediately corrected.

Semi-Conservative Replication

DNA is a double helix: two complementary strands wound around each other, held together by hydrogen bonds between paired bases (adenine with thymine; guanine with cytosine). During replication, the two strands are separated and each serves as a template for a new complementary strand. The result is two double-stranded DNA molecules, each containing one original strand and one newly synthesised strand. This is called semi-conservative replication, because each daughter molecule conserves half of the parent molecule.

The semi-conservative model was confirmed experimentally in 1958 by Matthew Meselson and Franklin Stahl, who grew bacteria in a medium containing a heavy isotope of nitrogen and then switched them to normal nitrogen. After one round of replication, all DNA was hybrid (half heavy, half light), exactly as the semi-conservative model predicted.

Key Enzymes and Their Roles

DNA replication requires a coordinated team of proteins working at the replication fork — the Y-shaped site where the helix is being unwound:

  • Helicase unwinds and separates the two strands by breaking the hydrogen bonds between base pairs, creating the replication fork. It moves along the DNA at roughly 1,000 base pairs per second.
  • Single-strand binding proteins (SSBPs) bind to the separated strands and hold them apart, preventing them from re-annealing before copying can occur.
  • Topoisomerase relieves the torsional strain ahead of the fork by cutting, swivelling, and rejoining the DNA strands, preventing the helix from over-coiling and snapping.
  • Primase synthesises short RNA primers (typically 10–12 nucleotides long) on each template strand. DNA polymerase cannot start a new chain from scratch — it can only add to an existing 3′ end, so the primer provides that starting point.
  • DNA polymerase III (in bacteria; DNA polymerase δ and ε in eukaryotes) reads the template strand in the 3′ to 5′ direction and synthesises a new complementary strand in the 5′ to 3′ direction, adding matching nucleotides one at a time. It also proofreads as it goes, excising and replacing incorrect bases.
  • DNA polymerase I removes the RNA primers and replaces them with DNA.
  • DNA ligase seals the nicks between DNA fragments by forming the final phosphodiester bonds, joining the pieces into a continuous strand.

Leading and Lagging Strands

Because DNA polymerase can only synthesise in the 5′ to 3′ direction, the two template strands are copied differently:

The leading strand runs antiparallel to the direction of fork movement (i.e., its template is read 3′ to 5′ as the fork moves). DNA polymerase can synthesise continuously along this strand in one smooth run — it needs only a single primer at the origin of replication.

The lagging strand runs in the same direction the fork is moving, so its template is oriented 5′ to 3′. DNA polymerase cannot work backwards, so this strand is copied in short discontinuous segments called Okazaki fragments (100–200 nucleotides long in eukaryotes, 1,000–2,000 in bacteria). Each fragment requires its own RNA primer. After synthesis, the primers are removed, gaps are filled with DNA, and ligase joins the fragments into a continuous strand.

Okazaki Fragments

Reiji Okazaki and his colleagues discovered discontinuous lagging strand synthesis in 1968 while studying bacteriophage T4. The short fragments temporarily produced on the lagging strand are named after him. Their discovery explained how DNA polymerase's directional constraint is overcome during replication.

Origins of Replication

Replication does not start at a random position in the chromosome. It begins at specific DNA sequences called origins of replication. In bacteria, there is typically a single origin (oriC in Escherichia coli). Eukaryotic chromosomes are so large that replication initiates simultaneously at thousands of origins — a human chromosome may have one origin every 100,000 base pairs — and the resulting bubbles grow and merge until the entire chromosome is copied. This parallelism is why human cells can replicate their genome in 6–8 hours rather than the weeks it would take if replication proceeded from a single point.

Proofreading and Error Correction

DNA polymerase III has a 3′ to 5′ exonuclease activity — it can recognise and remove a mismatched base it has just added and insert the correct one. This proofreading reduces the error rate from about one in 105 base pairs to about one in 107. A second layer of correction, mismatch repair, scans newly synthesised DNA and fixes errors that slipped through, bringing the final error rate to roughly one in 109 base pairs.

Summary

DNA replication is a semi-conservative process in which each strand of the double helix serves as a template for a new complementary strand. Helicase unwinds the helix, primase lays down RNA primers, and DNA polymerase extends new strands 5′ to 3′. The leading strand is synthesised continuously; the lagging strand is synthesised as Okazaki fragments that are later joined by ligase. Multiple origins of replication allow eukaryotes to copy enormous genomes in hours. Proofreading and mismatch repair enzymes keep the error rate extraordinarily low, protecting the fidelity of inherited information from one generation to the next.