Module 0: Evolution & Phylogenomics

Proboscidea is one of the most spectacular mammalian radiations on record: from the otter-sized Moeritherium wading through Eocene swamps 60 million years ago, through the shovel-tusked gomphotheres, the cold-adapted woolly mammoth, and the largest terrestrial mammal ever to walk the planet — Palaeoloxodon antiquus — the order has repeatedly invaded every continent except Antarctica and Australia. This module reconstructs their phylogenomic history, derives coalescent TMRCA for Loxodonta and Elephas, and explains why 20 retrotransposed copies of the tumour-suppressor gene TP53 resolve Peto’s paradox in the elephant lineage.

1. The Proboscidean Radiation (60 Mya → Holocene)

The earliest unambiguous proboscidean, Eritherium azzouzorum from the Paleocene of Morocco, is only slightly younger than the K–Pg boundary. By the late Eocene (~37 Mya), Moeritherium— an amphibious, tapir-like animal roughly 230 kg — already exhibits the diagnostic second-incisor tusks and retracted nasal bones presaging the trunk. Stable isotope analyses of Moeritherium enamel (Liu et al. 2008) indicate an aquatic/marsh lifestyle, suggesting the trunk originated as a snorkel rather than as a feeding appendage.

The Oligocene and Miocene saw an explosive radiation of stem proboscideans: Deinotheriidae (with downward-curving mandibular tusks and a lineage persisting into the Pliocene), Gomphotheriidae (bunodont cheek teeth, four tusks, pan-continental distribution), and the Mammutidae (mastodons). These groups colonised Eurasia, North America, and — uniquely among terrestrial mammals of their size — repeatedly crossed between continents via Beringia and the Gomphothere Interchange of the Great American Biotic Interchange.

Elephantidae emerge

True elephantids — characterised by high-crowned (hypsodont) lamellar molars with horizontal tooth-replacement conveyor belts — appear in the late Miocene of Africa. The key transitional genus is Primelephas, whose fossils from the Lukeino Formation (Kenya, ~6 Mya) bracket the last common ancestor of the three extant elephant genera. From Primelephas three crown lineages diverge: Loxodonta (African bush and forest elephants), Elephas (Asian elephant), and the extinct Mammuthus (the mammoths).

Palaeoloxodon antiquus, the straight-tusked elephant of Pleistocene Europe and the Levant, is the largest terrestrial mammal with solid skeletal evidence — adult males exceeded 13 tonnes and 4 m shoulder height. Ancient DNA (Meyer et al. 2017) unexpectedly placed Palaeoloxodonnested within the African forest elephant clade (Loxodonta cyclotis), overturning previous morphology-based placements and revealing substantial inter-lineage gene flow before human arrival.

Proboscidean Radiation (schematic, not branch-length scaled)

60 Mya5040302010PresentEritherium / stem ProboscideaMoeritherium (amphibious)DeinotheriidaeGomphotheriidaeMammutidae (mastodons)Primelephas (~6 Mya)LoxodontaElephasMammuthus (ext. 4 kya)Palaeoloxodon (ext. 30 kya)PaleoceneEoceneOligoceneMiocenePlio.Pleist.

2. Phylogenomic Resolution of the Crown Elephantidae

Shoshani & Tassy (1996) reviewed >400 proboscidean fossil taxa and proposed an elephantid topology based on morphology. The phylogenomic era began with Rohland et al. (2007, 2010), who sequenced nuclear loci from mammoth, mastodon, Asian and both African elephant lineages, demonstrating a deep genetic split between Loxodonta africana (savannah) and L. cyclotis (forest) of 2.6–5.6 Mya — comparable to the human/chimpanzee divergence and firmly establishing the forest elephant as a distinct species rather than a subspecies.

Roca et al. (2014) and Palkopoulou et al. (2018) subsequently performed genome-scale analyses across Loxodonta, Elephas, Mammuthus, Palaeoloxodon and mastodon, resolving the tree as:

\[(\text{Mastodon},\ (\text{Loxodonta},\ (\text{Palaeoloxodon},\ (\text{Elephas},\ \text{Mammuthus}))))\]

with substantial ancient introgression between Palaeoloxodon and L. cyclotis

Genome-wide introgression statistics (D, f3, f4-ratio) show that the straight-tusked elephant acquired up to ~40% of its genome from a forest-elephant-like source, a striking example of reticulated evolution within megaherbivores. Similar introgression was later detected between woolly and Columbian mammoths.

Coalescent theory for elephant TMRCA

Under the neutral Kingman coalescent, if a sample of n chromosomes is drawn from a panmictic diploid population of constant effective size Ne, the time to the most recent common ancestor (TMRCA) is a sum of independent exponential waiting times:

\[T_{\mathrm{MRCA}} \;=\; \sum_{k=2}^{n} T_k, \qquad T_k \sim \mathrm{Exp}\!\left(\frac{k(k-1)}{4 N_e}\right)\]

giving E[TMRCA] = 4Ne(1 − 1/n) generations

For elephants, generation time is long (\(\tau_g \approx 25\) years) and Ne estimates vary: Rohland (2010) inferred NeLoxodonta ≈ 40 000, NeElephas ≈ 30 000 (with Pleistocene bottlenecks). The simulation below samples the joint coalescent for both lineages including a Late Pleistocene bottleneck for Elephas.

3. Ivory Genome Wars & Forensic Geo-assignment

Every year roughly 20 000 African elephants are killed for their tusks. Wasser et al. (2015, Science) pioneered a stable-isotope and microsatellite-based geographic-origin mapping of seized ivory: large ivory shipments are typically drawn from two or three concentrated poaching hotspots rather than distributed continent-wide. The assignment framework uses a composite likelihood over carbon (\(\delta^{13}\mathrm{C}\)), nitrogen (\(\delta^{15}\mathrm{N}\)), strontium (\(^{87}\mathrm{Sr}/^{86}\mathrm{Sr}\)), and hydrogen (\(\delta^{2}\mathrm{H}\)) isotope ratios plus 16 microsatellite loci.

\[\ell(\mathbf{x} \mid g) = \sum_{m=1}^{M}\left[-\tfrac{1}{2}\frac{(x_m - \mu_m(g))^2}{\sigma_m^2(g)}\right] + \sum_{\ell=1}^{L} \log P_{\ell}(a_{\ell} \mid g)\]

x = isotope vector, g = candidate geographic grid cell, a = microsatellite alleles

Hart et al. (2015) and Wasser et al. (2018) extended this approach using genome-wide SNP reference panels, achieving geographic assignment with\(\sim 300\) km precision for savannah elephants and finer resolution for forest elephants whose population structure is deeper. Elephantine ivory forensics is now admissible as evidence in U.S. federal wildlife-trafficking prosecutions.

4. TRPV1 Copy Number & African Heat Adaptation

TRPV1, the capsaicin and noxious-heat receptor (activated above ~43°C), normally occurs as a single gene in mammals. Weissenböck et al. (2010, 2012) showed that Loxodonta africanacarries five functional TRPV1 paralogs(TRPV1a–e), presumably duplicated in the early Loxodonta stem after the split from Elephas. Elephas maximus retains only the ancestral single copy.

The expansion is hypothesised to be an adaptive response to sustained savannah heat stress: higher copy number allows tissue-specific regulation of heat and pain sensation, particularly in the trunk tip and pharyngeal mucosa. The elephant skin-cooling system (covered in detail in M3) relies on exquisite thermal discrimination, and TRPV1 dosage may tune behavioural responses such as mud-wallowing and dust-bathing thresholds.

\[R_{\mathrm{heat}}(T) \;=\; \sum_{i=1}^{N_{\mathrm{TRPV1}}} \frac{1}{1 + e^{-\beta_i (T - T_{0,i})}}\]

summed receptor activation across N paralogs with tunable \(\beta_i, T_{0,i}\)

5. TP53 Retrogenes & Peto’s Paradox

Peto (1977) noted that if cancer were simply a stochastic accumulation of somatic mutations, then large long-lived animals — with orders of magnitude more cells and cell divisions per lifetime — should suffer vastly higher cancer rates than small mammals. Empirically they do not. This discrepancy is Peto’s paradox.

Under the Armitage-Doll multistage model, a cell requires k independent driver mutations to transform. With mutation rate \(\mu\) per division and D divisions in a lifetime, the probability a given cell becomes malignant is approximately

\[p_{\mathrm{cell}} \;\approx\; \frac{(\mu D)^k}{k!}\]

and the organismal lifetime cancer probability is\(P_{\mathrm{org}} = 1 - (1 - p_{\mathrm{cell}})^{N_{\mathrm{cells}}}\). A 6-tonne elephant has ~6 × 1015 cells vs. 3 × 1013for a human — 200× more. Without compensation, elephant cancer incidence should be essentially 100%.

Abegglen et al. (2015, JAMA) resolved the paradox by demonstrating that Loxodonta africana carries ~20 retrotransposed copies of TP53(“LA-TP53” retrogenes) compared with a single copy in humans. Elephant lymphocytes exposed to ionising radiation or doxorubicin show roughly twice the apoptotic response of human cells, effectively culling incipient cancer cells before they proliferate. The retrogenes are transcribed, translated, and functionally active, though several lack introns and appear to act dominantly to sensitise the apoptotic machinery.

\[p_{\mathrm{cell}}^{\mathrm{(elephant)}} \;=\; \frac{(\mu D)^k}{k!} \cdot \eta^{(n_{\mathrm{TP53}} - 1)}, \quad \eta \approx 0.75\]

each extra TP53 paralog lowers effective malignant-transformation probability by factor \(\eta\)

With n = 20 the per-cell cancer probability drops by a factor\(\eta^{19} \approx 4 \times 10^{-3}\), more than compensating for the 200× increase in cell count. Cancer incidence in zoo elephant necropsies is ~5% lifetime — lower than the 11%–25% reported in humans.

5b. Molecular Mechanism of Elephant TP53 Retrogenes

The ~20 TP53 retrogenes in Loxodonta (often called LA-TP53, for “L. africana TP53”) arose via LINE-1-mediated retrotransposition of the ancestral TP53 mRNA. Sulak et al. (2016) reconstructed the timing of these retrogene births by dating synonymous-site divergence between paralogs: the wave of duplications coincided with the onset of gigantism in the proboscidean lineage during the mid-Miocene (~15 Mya), consistent with strong positive selection for tumour suppression under increasing body-size pressure.

Biochemically, the retrogene products dimerise with the canonical full-length TP53, priming the apoptotic “trigger” at lower DNA-damage thresholds. Vazquez et al. (2018) further showed that a subset of the retrogenes encode a shortened “LIF6” (leukaemia inhibitory factor 6) transcript that localises to mitochondria and directly permeabilises the outer membrane in response to TP53 activation, providing an auxiliary apoptotic pathway absent in smaller mammals.

The net phenotype is that elephant cells preferentially undergo apoptosis rather than repair in response to DNA damage — a risk-averse strategy favoured when mutation-bearing clones carry the catastrophic risk of metastatic cancer in an individual with \(10^{15}\) cells. Mice and humans instead rely more heavily on repair pathways; the evolutionary trade-off mirrors the classical “disposable soma” model of senescence.

\[\frac{dN_{\text{damaged}}}{dt} = \mu D \cdot N_{\text{healthy}} - (k_{\text{repair}} + n_{\mathrm{TP53}} k_{\text{apop}}) N_{\text{damaged}}\]

steady-state reservoir of damaged cells shrinks linearly with TP53 dosage at the apoptotic rate term

6. Mitochondrial DNA, Bottlenecks & Matrilineal Structure

Because elephants live in strict matriarchal family units, mitochondrial DNA shows deep geographic structure: females rarely disperse, so mtDNA lineages are largely local while nuclear markers (via male dispersal) homogenise over longer timescales. Debruyne et al. (2003, 2007) identified two deeply divergent mtDNA clades (F and S) within Loxodonta africana with coalescence\(>\) 1 Mya — older than the nuclear split — likely reflecting incomplete lineage sorting and ancient introgression between forest and savannah populations.

Ancient DNA from mammoth specimens (Lynch et al. 2015; van der Valk 2021) revealed that the Pleistocene woolly mammoth (M. primigenius) underwent multiple severe bottlenecks, with Wrangel Island’s final refugium population (< 500 individuals) accumulating deleterious mutations indicative of terminal genomic meltdown before extinction ~4000 years ago.

The coalescent waiting time for mitochondrial lineages is a factor of four smaller than for nuclear autosomal lineages (haploid and maternally inherited, Ne,mt = Ne/4), making mtDNA a high-resolution probe of recent demographic history and an ideal tracer for matriline reconstruction within extant herds.

6b. Contemporary Population Structure

Current IUCN Red List assessments (2021) separate Loxodonta africana(savannah elephant, Endangered) and Loxodonta cyclotis (forest elephant, Critically Endangered) in recognition of the deep genetic divergence first documented by Rohland et al. Savannah elephants now number ~350 000 individuals; forest elephants ~100 000, with both populations declining ~30&percnt; and ~86&percnt; respectively over recent decades (Thouless et al. 2016; Poulsen et al. 2017).

Elephas maximus is sub-divided into three subspecies — E. m. maximus (Sri Lankan), E. m. indicus (mainland Asian), and E. m. sumatranus (Sumatran) — plus a reduced Bornean population of debated taxonomic status. Fernando et al. (2003) identified two mtDNA clades (α and β) with coalescence ~ 1.5 Mya, reflecting Pleistocene vicariance across the Sundaland corridor. Genome-wide STRUCTURE analyses confirm that mainland and Sri Lankan populations are genetically distinguishable at\(F_{ST} \approx 0.15\).

Population viability analyses (PVAs) for the smallest remnant populations — e.g. the Sumatran Way Kambas herd (~250 individuals) — project extinction probabilities above 50&percnt; within 100 years in the absence of gene flow augmentation. The long generation time (\(\tau_g = 25\) yr) and low reproductive rate (one calf every 4–5 years) compound the vulnerability.

7. Woolly Mammoth Genomics & De-extinction Horizons

Ancient-DNA sequencing from permafrost-preserved tissue has produced high-coverage woolly mammoth (Mammuthus primigenius) genomes (Palkopoulou et al. 2015; van der Valk 2021). The comparison with extant Asian elephant genomes reveals ~1.4 million single-nucleotide differences — of which roughly 2000 are candidate functional substitutions concentrated in genes for keratin, hair follicle development, circadian rhythm, insulin signalling, and lipid metabolism. Lynch et al. (2015) identified a suite of cold-adaptation fixations including EP300, PRPF8, and the haemoglobin β-chain variant that shifts the oxygen-binding P50 to lower temperatures.

The haemoglobin adaptation is particularly elegant: Campbell et al. (2010) showed that three amino-acid substitutions in mammoth β-globin reduce the enthalpy of oxygen binding (\(\Delta H\)) so that erythrocytes release oxygen efficiently even at near-freezing peripheral temperatures. Re-expressing the mammoth haemoglobin in vitro confirmed the biochemistry.

The Wrangel Island population, the last mammoths to survive until ~4000 years before present, shows genomic hallmarks of inbreeding collapse: excess non-synonymous variants, loss of olfactory receptor function, and heterozygosity one-tenth that of mainland populations. This terminal “meltdown” provides a cautionary template for modern small populations of African forest elephant.

8. Molecular Clock Calibration for Proboscidea

Node-age estimation in the elephant phylogeny relies on fossil-calibrated relaxed molecular clocks. The canonical calibration points are:

  • Paenungulata crown (Proboscidea + Sirenia + Hyracoidea): 65 Mya, soft upper bound.
  • Proboscidea stem: Eritherium, 60 Mya.
  • Elephantidae crown (mastodon split): 25–30 Mya.
  • Mammoth-Asian elephant split: 6.0 Mya (Rohland 2010).
  • Loxodonta africana / cyclotis split: 2.6–5.6 Mya.

Using a strict molecular clock on autosomal intergenic SNPs, the per-site substitution rate in elephantids is\(\mu_{\text{auto}} \approx 1.2 \times 10^{-9}\) per site per year — roughly half the rodent rate, reflecting long generation time. The mitochondrial rate is about 20× higher, making mtDNA the workhorse for resolving sub-million-year splits.

\[T_{\mathrm{split}} \;=\; \frac{d}{2\mu}\;\left(1 + \frac{\sigma_{\mathrm{ILS}}^2}{d^2}\right)^{-1/2}\]

d = observed genetic distance, \(\sigma_{\text{ILS}}\) = incomplete lineage sorting correction

Bayesian dating packages (MCMCtree, BEAST) implementing the above with relaxed clocks and fossil priors now routinely yield 95&percnt; HPD intervals below ±10&percnt; of point estimates for Miocene-bounded elephantid nodes.

Simulation 1: Kingman Coalescent for Loxodonta & Elephas

Monte Carlo simulation of the neutral coalescent within each lineage, joined at the Primelephas split (~6 Mya), with a Late-Pleistocene bottleneck episode for Elephas maximus. Outputs include within-lineage TMRCA histograms, the joint cross-lineage TMRCA distribution, expected waiting times E[Tk], and one realised lineage-through-time plot.

Python
script.py137 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Simulation 2: TP53 Dosage & Peto’s-Paradox Cancer-Risk Model

Armitage-Doll multistage carcinogenesis with body-mass-scaled cell count, lifespan-scaled mitotic divisions, and explicit TP53 dose response (\(\eta = 0.75\) per extra copy). Compares naive (single-copy) vs. observed copy number across mouse, human, elephant, and bowhead whale, and traces the dose-response curve that collapses the Peto scatter onto the Loxodonta point.

Python
script.py153 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server

9. Synthesis & Open Problems

Proboscidean evolution is a textbook example of iterative gigantism: across the 60 Myr record, many independent lineages — deinotheres, gomphotheres, mastodons, elephantids — converged on body masses above 5 tonnes. Each successful wave encountered the same physiological frontier: heat dissipation (Module 3), bone loading (Module 1), and reproductive constraints (Module 7). The recurring evolution of lamellar molars in Elephantidae is interpreted as an adaptation to C4 grass expansion during late-Miocene aridification.

Key outstanding questions include (i) the precise timing of TP53 retrogene dosage increases relative to body-mass thresholds along the proboscidean stem; (ii) how much nuclear vs. mitochondrial introgression between Loxodonta, Palaeoloxodon and Mammuthus persisted through the Pleistocene; (iii) whether Elephas experienced a cryptic third species-level radiation on Sundaland; and (iv) the biomechanical and neurological consequences of the forest/savannah split for the divergent ear, tusk, and trunk-tip morphologies of the two Loxodonta lineages.

The simulations below make these questions quantitative: the coalescent framework allows formal hypothesis tests on TMRCA given any proposed Ne trajectory, and the TP53 dose-response model permits prediction of cancer incidence under hypothetical gene-editing interventions — whether for elephant welfare or, more controversially, for de-extinction of woolly mammoth-derived cold-adapted elephants.

Key References

• Shoshani, J. & Tassy, P. (1996). The Proboscidea: Evolution and Palaeoecology of Elephants and their Relatives. Oxford University Press.

• Rohland, N. et al. (2010). “Genomic DNA sequences from mastodon and woolly mammoth reveal deep speciation of forest and savanna elephants.” PLoS Biology, 8, e1000564.

• Roca, A. L. et al. (2014). “Elephant natural history: a genomic perspective.” Annual Review of Animal Biosciences, 3, 139–167.

• Palkopoulou, E. et al. (2018). “A comprehensive genomic history of extinct and living elephants.” PNAS, 115, E2566–E2574.

• Meyer, M. et al. (2017). “Palaeogenomes of Eurasian straight-tusked elephants challenge the current view of elephant evolution.” eLife, 6, e25413.

• Abegglen, L. M. et al. (2015). “Potential mechanisms for cancer resistance in elephants and comparative cellular response to DNA damage in humans.” JAMA, 314, 1850–1860.

• Weissenböck, N. M. et al. (2010). “Thermal windows on the body surface of African elephants (Loxodonta africana) studied by infrared thermography.” Journal of Thermal Biology, 35, 182–188.

• Wasser, S. K. et al. (2015). “Genetic assignment of large seizures of elephant ivory reveals Africa’s major poaching hotspots.” Science, 349, 84–87.

• Hart, J. et al. (2015). “Forensic genomics of ivory: progress and limitations.” Conservation Genetics Resources, 7, 345–351.

• van der Valk, T. et al. (2021). “Million-year-old DNA sheds light on the genomic history of mammoths.” Nature, 591, 265–269.

• Peto, R. (1977). “Epidemiology, multistage models and short-term mutagenicity tests.” Cold Spring Harbor Conferences on Cell Proliferation, 4, 1403–1428.

• Sulak, M. et al. (2016). “TP53 copy number expansion is associated with the evolution of increased body size and an enhanced DNA damage response in elephants.” eLife, 5, e11994.

• Vazquez, J. M. et al. (2018). “A zombie LIF gene in elephants is upregulated by TP53 to induce apoptosis in response to DNA damage.” Cell Reports, 24, 1765–1776.

• Lynch, V. J. et al. (2015). “Elephantid genomes reveal the molecular bases of woolly mammoth adaptations to the Arctic.” Cell Reports, 12, 217–228.

• Campbell, K. L. et al. (2010). “Substitutions in woolly mammoth hemoglobin confer biochemical properties adaptive for cold tolerance.” Nature Genetics, 42, 536–540.

• Fernando, P. et al. (2003). “DNA analysis indicates that Asian elephants are native to Borneo and are therefore a high priority for conservation.” PLoS Biology, 1, e6.

• Thouless, C. R. et al. (2016). African Elephant Status Report 2016: An update from the African Elephant Database. IUCN/SSC African Elephant Specialist Group.

• Poulsen, J. R. et al. (2017). “Poaching empties critical central African wilderness of forest elephants.” Current Biology, 27, R134–R135.

• Palkopoulou, E. et al. (2015). “Complete genomes reveal signatures of demographic and genetic declines in the woolly mammoth.” Current Biology, 25, 1395–1400.

• Debruyne, R. et al. (2003). “Mitochondrial cytochrome b of the Lyakhov mammoth confirms the presence of two haplogroups in the late Pleistocene.” Comptes Rendus Biologies, 326, 687–697.