Module 2: The RNA World

Modern biology hands genetic memory to DNA and catalysis to proteins, with RNA as a messenger. But this neat division of labour cannot have been the starting point: information and function must have been unified in a single molecule. The RNA World hypothesis(Woese 1967, Crick 1968, Orgel 1968) proposes that early life used RNA for both. The discovery of catalytic RNA (ribozymes) by Cech and Altman, and the fact that peptide bond formation in the ribosome is ribozyme-catalysed, provide strong empirical support. This module examines the evidence, the in-vitro evolution of ribozymes, Eigen's quasispecies theory, and the pre-RNA-world alternatives.

Hammerhead ribozymeStem IStem IIStem III (substrate)CUGANcleavage siteMgSelf-replication cycleRibozyme ERibozyme FE + A + B -> FF + A' + B' -> ESubstrate pool A, B, A', B'N(t) grows exponentially

2.1 Evidence for the RNA World

Five major lines of evidence point toward a historical RNA-first stage:

  1. Catalytic RNA (ribozymes). Tom Cech (1981) showed that the Group I intron of Tetrahymena self-splices, and Sidney Altman (1983) proved that the catalytic component of bacterial RNase P is its RNA (M1), not protein. Both shared the 1989 Nobel in Chemistry.
  2. The ribosome is a ribozyme. X-ray crystallography (Yonath, Ramakrishnan, Steitz; 2009 Nobel) revealed that the peptidyl transferase centre (PTC) contains no protein — peptide bond formation is catalysed entirely by 23S rRNA.
  3. RNA cofactors. Central metabolites (ATP, NAD+, FAD, coenzyme A, SAM) are all RNA-derived, fossilised relics of an age when RNA ran metabolism.
  4. Deoxyribose from ribose. In modern cells, ribonucleotide reductase makes DNA from RNA — the synthetic precedence is unambiguous.
  5. Riboswitches. Natural RNAs in bacteria still sense small molecules and regulate gene expression — direct descendants of an RNA-run regulatory logic.

2.2 Ribozymes & Catalytic Chemistry

Natural ribozymes span five structural classes: hammerhead, hairpin, HDV, Varkud satellite, and glmS. All catalyse RNA cleavage or ligation with \(k_\text{cat}/k_\text{uncat}\) rate enhancements from \(10^5\) to \(10^9\) — good but modest compared to protein enzymes ( \(10^{14}\)+). The PTC rate enhancement is only \(\sim 10^7\) but involves positioning rather than chemistry; the RNA backbone templates the attacking α-amino group.

Metal ion catalysis

Most natural ribozymes depend on divalent cations, typically Mg2+, which:

  • Neutralise phosphate backbone charge, allowing compact folding
  • Activate the 2′-OH nucleophile by deprotonation
  • Stabilise the pentacoordinate phosphorane transition state
  • Shield negative charge on the leaving group

The two-metal-ion mechanism proposed by Steitz & Steitz (1993) operates in HDV, group I/II introns, and the RNase P active site. In Hadean oceans (warm, Mg-rich from serpentinite weathering), Mg2+ was likely abundant.

In-vitro evolution & SELEX

Systematic Evolution of Ligands by EXponential enrichment (SELEX; Tuerk & Gold 1990; Ellington & Szostak 1990) takes a random pool of \(\sim 10^{15}\) RNAs (40-mer: \(4^{40} \approx 10^{24}\)possibilities, so sampled randomly) and iterates selection + amplification. After 10-15 rounds the best binders or catalysts are enriched by many orders of magnitude. SELEX has produced:

  • Aptamers binding any small molecule with nM affinity
  • Ribozymes catalysing Diels-Alder cycloaddition, aldol condensation, amino-acid charging
  • An RNA polymerase ribozyme (Bartel & Szostak 1993; Wochner et al. 2011) capable of copying short templates

2.3 Self-Replication: Lincoln & Joyce 2009

In 2009, Tracey Lincoln and Gerald Joyce at Scripps published an RNA system that replicates itself indefinitely without protein enzymes. Two “R3C” ribozymes (E and F) each catalyse a ligation that produces the other from two oligonucleotide substrates (A+B and A′+B′). The cross-catalytic kinetics:

\[ \frac{d[E]}{dt} = k_F [F][A][B] - d_E [E], \quad \frac{d[F]}{dt} = k_E [E][A'][B'] - d_F [F] \]

This system grows exponentially and supports Darwinian evolution under laboratory conditions: variants appear through ligation errors, compete for limiting substrates, and the fittest survive. In serial-transfer experiments, new recombinant ribozymes 3-4 fold faster than the parents emerged after 150 generations.

Fitness and selection equation

For a pool of variants with individual rate constants \(k_i\), the concentration of variant \(i\) follows:

\[ \frac{dx_i}{dt} = k_i x_i - \langle k \rangle x_i = (k_i - \langle k \rangle)x_i \]

This is Fisher's selection equation in disguise: variants grow faster than the pool average increase, slower ones decrease. Combined with mutation (imperfect replication) the system explores the fitness landscape.

2.4 Eigen's Quasispecies & the Error Catastrophe

Manfred Eigen (1971) derived a fundamental constraint on the length of any self-replicating molecule given its per-base fidelity. Let \(q\) be the probability that each base is copied correctly. For a sequence of length \(L\):

\[ Q = q^L = \text{probability of error-free copy} \]

The master sequence dominates the population only if the selective advantage \(s\) (fitness of master vs mutants) satisfies:

\[ s > \frac{1}{Q} = q^{-L} \quad \Leftrightarrow \quad L < \frac{\ln s}{-\ln q} \approx \frac{\ln s}{1-q} \]

Above the error threshold, the master sequence dissolves into a cloud of mutants and genetic information is lost: the error catastrophe. For typical ribozyme copying fidelities \((q \approx 0.99)\) and mild selection \((s \approx 10)\), the maximum length is \(L_\text{max} \approx 230\) bases — limits consistent with the shortest natural ribozymes.

The information crisis

This creates a bootstrap problem: a high-fidelity polymerase ribozyme needs \(L \gtrsim 200\) bases, but copying 200 bases reliably requires \(q \gtrsim 0.995\) — already better than the best in-vitro RNA polymerase. Proposed escapes:

  • Shorter proto-ribozymes functioning in modular cooperative networks
  • Compartmentalisation in lipid vesicles reducing parasite spread
  • Surface-bound replication with sequence-specific catalysis
  • Hypercycles: mutually reinforcing replicator pairs (Eigen & Schuster 1979)

2.5 The Chirality & Instability Problems

The RNA world faces serious chemical challenges:

Cross-inhibition by opposite chirality

Joyce et al. (1984) demonstrated that template-directed polymerisation of D-ribonucleotides is blocked by the presence of L-enantiomers — a chain-terminating effect. Homochirality of the monomer pool is thus a precondition, not a consequence, of the first RNA polymerisation. The Soai autocatalytic amplification (Module 1) or a chiral mineral surface (calcite, quartz) must solve this upstream.

Backbone instability

The 2′-OH of ribose attacks the adjacent phosphodiester, cleaving RNA in neutral aqueous solution with a half-life of \(\sim 4\) years at 25 C and pH 7. In hot hydrothermal environments ( \(T = 85\) C) the half-life collapses to roughly \(\sim 8\) days. This is part of the reason life eventually replaced RNA with the chemically far more stable DNA.

The chicken-and-egg problem

The first RNA molecule capable of self-replication requires a prior pool of activated nucleotides, a folded catalytic structure, and a favourable template — none of which are obvious outcomes of unguided chemistry. Progress depends on finding plausible minimalist systems below the error threshold.

2.6 Pre-RNA Worlds: TNA, PNA, GNA

Given the difficulty of prebiotic ribose synthesis and the chemical frailty of the RNA backbone, several authors have suggested that the first genetic polymer may not have been RNA but a chemically simpler precursor, later replaced by RNA in a “genetic takeover” (Cairns-Smith). Three leading candidates:

  • TNA (threose nucleic acid): Eschenmoser's group (2000) showed that TNA with threose replacing ribose forms stable Watson-Crick duplexes with itself, DNA, and RNA. Threose is four-carbon, arises directly from the formose reaction without requiring a specific stereoselection.
  • PNA (peptide nucleic acid): Nielsen (1991) designed PNA with a peptide-like N-aminoethylglycine backbone. PNA hybridises to DNA/RNA with higher affinity than the natural backbones and is chemically much more stable. PNA templates direct their own ligation from activated dimers.
  • GNA (glycol nucleic acid): Meggers (2005) showed propylene glycol-based nucleic acids, lacking stereogenic centres in the backbone altogether.

These alternative backbones suggest the first living chemistry could have used simpler, more prebiotically plausible components that were eventually overwritten by RNA after the biosynthetic machinery for ribose became available.

Python: Eigen Quasispecies Dynamics

Panel 1 shows a population of 12 sequence types evolving under high copying fidelity (\(q = 0.97\)): the master sequence (0) dominates, its near-mutants form a cloud. Panel 2 drops the fidelity to \(q = 0.58\), past the error threshold: the master sequence loses its grip and the population delocalises — the error catastrophe.

Python
script.py85 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Python: SELEX & Cross-Replicating Ribozymes

Panel 1 runs a simple SELEX-style in-vitro evolution: a pool of 200 ribozyme variants with log-normal activity undergoes selection and amplification for 15 rounds, converging on the best catalyst while diversity collapses. Panel 2 integrates the Lincoln-Joyce cross-replicator ODEs for three parameter sets, showing symmetric exponential growth, failure when one partner is too rare, and slower asymmetric growth under unbalanced kinetics.

Python
script.py97 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server

References

  1. Kruger, K. et al. (1982). Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena. Cell, 31, 147-157.
  2. Guerrier-Takada, C., Gardiner, K., Marsh, T., Pace, N. & Altman, S. (1983). The RNA moiety of ribonuclease P is the catalytic subunit. Cell, 35, 849-857.
  3. Nissen, P., Hansen, J., Ban, N., Moore, P.B. & Steitz, T.A. (2000). The structural basis of ribosome activity in peptide bond synthesis. Science, 289, 920-930.
  4. Lincoln, T.A. & Joyce, G.F. (2009). Self-sustained replication of an RNA enzyme. Science, 323, 1229-1232.
  5. Wochner, A. et al. (2011). Ribozyme-catalyzed transcription of an active ribozyme. Science, 332, 209-212.
  6. Eigen, M. (1971). Selforganization of matter and the evolution of biological macromolecules. Naturwissenschaften, 58, 465-523.
  7. Eigen, M. & Schuster, P. (1979). The Hypercycle. Springer.
  8. Bartel, D.P. & Szostak, J.W. (1993). Isolation of new ribozymes from a large pool of random sequences. Science, 261, 1411-1418.
  9. Schöning, K.-U. et al. (2000). Chemical etiology of nucleic acid structure: the α-threofuranosyl-(3′->2′) oligonucleotide system. Science, 290, 1347-1351.
  10. Nielsen, P.E., Egholm, M., Berg, R.H. & Buchardt, O. (1991). Sequence-selective recognition of DNA by strand displacement with a thymine-substituted polyamide. Science, 254, 1497-1500.
  11. Joyce, G.F. et al. (1984). Chiral selection in poly(C)-directed synthesis of oligo(G). Nature, 310, 602-604.
  12. Gilbert, W. (1986). The RNA world. Nature, 319, 618.