Protein Synthesis Explained: Transcription and Translation
Your body makes hundreds of thousands of different proteins — structural components like collagen, enzymes that drive every chemical reaction, antibodies that fight disease, hormones that carry signals. All of them are built by the same two-stage process: transcription (copying a gene from DNA into RNA) and translation (reading the RNA to assemble a protein). Understanding this process is essential to understanding genetics, medicine, and biotechnology.
The Central Dogma
The flow of genetic information in cells follows a principle called the central dogma of molecular biology: DNA → RNA → protein. DNA stores the instructions; RNA carries them from the nucleus to the cytoplasm; ribosomes use them to build proteins. This directional flow is almost universal in all living cells. (Retroviruses such as HIV are exceptions: they use reverse transcriptase to copy RNA back into DNA.)
The genetic code is read in triplets of bases called codons. Each codon (three consecutive bases on the mRNA) specifies one amino acid. There are 4³ = 64 possible codons but only 20 amino acids, so most amino acids are coded for by more than one codon — the code is degenerate. Three codons (UAA, UAG, UGA) are stop codons that signal the end of a protein chain. The codon AUG codes for methionine and also serves as the universal start codon.
Stage 1: Transcription (in the Nucleus)
- Initiation. The enzyme RNA polymerase binds to a specific sequence on the DNA called the promoter, located just before the gene to be copied. The two strands of the double helix unwind and separate at this point.
- Elongation. RNA polymerase moves along the template strand of the DNA in the 3′ to 5′ direction, reading the bases and assembling a complementary strand of messenger RNA (mRNA). The base-pairing rules apply — A pairs with U (uracil replaces thymine in RNA), T pairs with A, C pairs with G, G pairs with C. The mRNA strand grows in the 5′ to 3′ direction.
- Termination. RNA polymerase reaches a terminator sequence on the DNA. The newly made pre-mRNA molecule detaches, and the DNA strands re-join.
- mRNA processing (in eukaryotes). The initial transcript (pre-mRNA) is processed before leaving the nucleus. A protective 5′ cap and a poly-A tail are added to protect the mRNA from degradation and assist in ribosome binding. Introns (non-coding sequences) are spliced out; the remaining exons (coding sequences) are joined together. The mature mRNA exits through nuclear pores into the cytoplasm.
Stage 2: Translation (at the Ribosome)
- Initiation. The mRNA binds to the small subunit of a ribosome. The ribosome scans the mRNA until it finds the start codon, AUG. A transfer RNA (tRNA) carrying the amino acid methionine (with the anticodon UAC) hydrogen-bonds to this start codon. The large ribosomal subunit joins, completing the ribosome assembly.
- Elongation. The ribosome has three binding sites for tRNA molecules: the A (aminoacyl) site, P (peptidyl) site, and E (exit) site. A new tRNA carrying the next amino acid enters the A site, its anticodon complementary to the mRNA codon at that position. A peptide bond forms between the amino acid in the P site and the new one in the A site. The ribosome then moves three bases along the mRNA (translocation): the tRNA in the P site moves to the E site and is released, the tRNA in the A site shifts to the P site, and the A site is free for the next tRNA.
- Termination. When a stop codon (UAA, UAG, or UGA) enters the A site, no tRNA recognises it. Instead, a release factor protein binds, causing the ribosome to disassemble and the completed polypeptide chain to be released.
Post-Translational Modification
The polypeptide released from the ribosome is not always immediately functional. It must fold into its correct three-dimensional shape, often assisted by helper proteins called chaperones. Many proteins are further modified: signal peptides may direct them to the rough endoplasmic reticulum for secretion; sugar chains may be added in the Golgi apparatus (glycosylation); disulfide bridges may form between cysteine residues; or the protein may be cleaved into a shorter, active form (as happens when the hormone insulin is processed from its precursor).
Frequently Asked Questions
- Why does the cell use RNA rather than copying DNA directly to the ribosome?
- DNA must stay safe in the nucleus as the permanent master copy. Making an RNA copy allows many ribosomes to simultaneously read the same gene without risking damage to the DNA, and the mRNA can be degraded once it is no longer needed, allowing fine control of protein levels.
- What is the difference between mRNA, tRNA, and rRNA?
- mRNA (messenger RNA) carries the coded sequence from the gene. tRNA (transfer RNA) brings amino acids to the ribosome; each has an anticodon that matches a specific codon. rRNA (ribosomal RNA) is a structural and catalytic component of the ribosome itself.
- How does a mutation in DNA affect the protein?
- A single base change (point mutation) may change one codon, inserting a different amino acid (missense mutation), creating a stop codon (nonsense mutation), or leaving the same amino acid due to degeneracy (silent mutation). Insertions or deletions shift the reading frame of all subsequent codons (frameshift mutation), usually producing a non-functional protein.
Summary
Protein synthesis proceeds in two stages. Transcription in the nucleus uses RNA polymerase to copy a gene from the template DNA strand into an mRNA molecule; in eukaryotes the pre-mRNA is then processed by splicing out introns and adding a cap and tail. Translation at the ribosome reads the mRNA codon by codon: tRNAs bring the matching amino acids, peptide bonds link them into a growing chain, and a stop codon triggers release of the finished polypeptide. This process connects the information stored in DNA to the functional molecules that drive every aspect of cell life.