Evolution & Domestication
From Felis silvestris lybica to Felis catus — phylogenetics, population genetics, and the feline genome
1. From Wildcat to House Cat
The domestic cat (Felis catus) descends from the African wildcat (Felis silvestris lybica), a small solitary felid native to the Near East, North Africa, and the Arabian Peninsula. Unlike virtually every other domestic animal, cats were not deliberately selected for any human purpose — they self-domesticated.
The Commensal Pathway
Approximately 10,000 years ago, the Neolithic agricultural revolution in the Fertile Crescent created the ecological niche that drew wildcats to human settlements. Grain stores attracted rodents (Mus musculus); wildcats followed the mice. Humans tolerated and eventually encouraged the cats' presence because of their rodent control services. This “commensal pathway” to domestication contrasts sharply with the directed pathway used for dogs, cattle, and horses.
Key Archaeological Evidence
- • Cyprus burial (Vigne et al., 2004): A cat was deliberately buried alongside a human at Shillourokambos, ~9,500 years ago. Since there are no native felids on Cyprus, the cat must have been transported by boat — implying a valued human-cat relationship.
- • Egyptian domestication (Driscoll et al., 2007): Mitochondrial DNA analysis of 979 cats showed all domestic cats cluster within clade IV (F. s. lybica), with genetic signatures pointing to Near Eastern origin.
- • Chinese millet farmers (Hu et al., 2014): Isotopic analysis of 5,300-year-old cat bones from Quanhucun showed cats were eating millet-fed rodents, confirming the commensal relationship.
Population Genetics of Domestication
The transition from wild to domestic involved several population genetic processes:
Founder Effect
When a small number of wildcats established the founding population around human settlements, the allele frequencies in this subpopulation differed from the source population simply by chance. If the founding group contained \(N_f\) individuals drawn from a population with allele frequency \(p\), the expected variance in allele frequency is:
\[\text{Var}(p_f) = \frac{p(1-p)}{2N_f}\]
With a founding population of perhaps \(N_f = 20\text{--}50\) individuals, the variance is substantial, leading to random fixation or loss of alleles unrelated to selection.
Genetic Drift in Small Populations
The probability that a neutral allele with current frequency \(p\) eventually reaches fixation in a population of effective size \(N_e\) is simply:
\[P(\text{fixation}) = p\]
The expected time to fixation (given it occurs) is:
\[\bar{t}_{\text{fix}} = -\frac{4N_e(1-p)}{p} \ln(1-p)\]
In small populations associated with early agricultural villages (\(N_e \sim 50\text{--}200\)), drift operates rapidly, potentially fixing coat colour variants and behavioural alleles (e.g., reduced fear of humans) within hundreds of generations.
Relaxed Selection
In the commensal environment, predation pressure on cats was reduced and food was abundant (mice). Traits under strong purifying selection in the wild — such as cryptic coloration, extreme neophobia, and solitary behaviour — experienced relaxed selection. This allowed previously deleterious variants (non-agouti coat, increased tameness) to persist and spread. Molecular signatures of selection on the domestic cat genome include regions near genes for neural crest cell development (linked to the “domestication syndrome”).
2. Feline Phylogenetics
The family Felidae comprises 37 extant species arranged in 8 lineages that diverged from a common ancestor approximately 11 million years ago (MYA) in Asia (Johnson et al., 2006; Li et al., 2016).
Molecular Clock Analysis
Divergence times are estimated using molecular clock methods applied to mitochondrial (cytochrome b, ND5) and nuclear gene sequences. The molecular clock assumes that mutations accumulate at an approximately constant rate \(\mu\) per site per year. The divergence time between two lineages is:
\[T = \frac{d}{2\mu}\]
where \(d\) is the observed genetic distance (substitutions per site) and the factor of 2 accounts for divergence along both lineages since their common ancestor. For feline cytochrome b, the calibrated rate is approximately\(\mu \approx 2\text{--}4 \times 10^{-8}\) substitutions/site/year.
Coalescent Theory
Modern phylogenetic inference uses coalescent theory to account for incomplete lineage sorting (ILS) — the phenomenon where gene trees differ from the species tree due to ancestral polymorphism. For two gene copies in a population of effective size \(N_e\), the expected coalescence time is:
\[E[T_{\text{MRCA}}] = 2N_e \text{ generations}\]
The probability that two lineages fail to coalesce before a speciation event (causing ILS) depends on the ratio of population size to divergence time. For rapid radiations (such as the felid diversification ~8–10 MYA), ILS is common. For \(n\) gene copies:
\[P(\text{no coalescence in } t \text{ gens}) = \exp\!\left(-\frac{\binom{n}{2}t}{N_e}\right)\]
The Eight Felid Lineages
Panthera Lineage (7 spp)
Divergence: ~10.8 MYA
Lion, tiger, leopard, jaguar, snow leopard
Bay Cat Lineage (3 spp)
Divergence: ~9.4 MYA
Bay cat, Asian golden cat, marbled cat
Caracal Lineage (3 spp)
Divergence: ~8.5 MYA
Caracal, African golden cat, serval
Ocelot Lineage (8 spp)
Divergence: ~8.0 MYA
Ocelot, margay, oncilla, and 5 others
Lynx Lineage (4 spp)
Divergence: ~7.2 MYA
Eurasian, Canada, Iberian lynx, bobcat
Puma Lineage (3 spp)
Divergence: ~6.7 MYA
Puma, jaguarundi, cheetah
Leopard Cat Lineage (5 spp)
Divergence: ~5.9 MYA
Leopard cat, fishing cat, Pallas cat, and 7 others
Felis Lineage (5 spp)
Divergence: ~3.4 MYA
Wildcat, sand cat, black-footed cat, jungle cat, Chinese mountain cat
The domestic cat belongs to the Felis lineage, the most recently diverged group. Within this lineage, F. catus and F. s. lybica are so closely related that many taxonomists consider the domestic cat a subspecies rather than a separate species. The genetic distance between them is only ~0.2% at the nuclear level.
3. Population Genetics of Breeds
There are approximately 73 recognised cat breeds, but most were established less than 150 years ago during the Victorian cat fancy movement. This is dramatically different from dog breeds, many of which have centuries-old histories tied to specific working functions.
Heterozygosity Decay Under Drift
When a breed is founded from a small number of individuals and maintained as a closed population, heterozygosity declines each generation due to genetic drift. The expected heterozygosity after \(t\) generations is:
\[H_t = H_0 \left(1 - \frac{1}{2N_e}\right)^t\]
where \(H_0\) is the initial heterozygosity and \(N_e\) is the effective population size. The half-life of heterozygosity (time to lose 50%) is:
\[t_{1/2} = \frac{\ln 2}{\ln\!\left(\frac{2N_e}{2N_e - 1}\right)} \approx 2N_e \ln 2 \approx 1.39 N_e\]
Breed Bottlenecks
Many cat breeds experienced extreme genetic bottlenecks at foundation:
Persian
Founder population estimated at ~50 individuals. Effective population size\(N_e \approx 30\text{--}40\). High prevalence of polycystic kidney disease (PKD1, autosomal dominant, ~38% carrier frequency in some populations). Single C→A transversion in exon 29 of PKD1.
Maine Coon
North American breed with moderate genetic diversity. However, hypertrophic cardiomyopathy (HCM) affects ~30% due to a single missense mutation (A31P) in the MYBPC3 gene (myosin-binding protein C).
Siamese
Originally imported from Thailand (Siam) in the 1880s. Modern show Siamese have diverged significantly from the traditional type. Prone to amyloidosis (hepatic), progressive retinal atrophy, and the cs tyrosinase allele (Module 6) is fixed in the breed.
Scottish Fold
All descended from a single farm cat (Susie, 1961, Scotland). The fold ear is caused by a dominant mutation in TRPV4 (previously mapped toFd). Homozygotes (Fd/Fd) develop severe osteochondrodysplasia — a strong argument against Fd/Fd breeding.
Inbreeding Coefficient
The inbreeding coefficient \(F\) measures the probability that two alleles at a locus are identical by descent:
\[F_t = 1 - \left(1 - \frac{1}{2N_e}\right)^t \approx 1 - e^{-t/(2N_e)}\]
For a breed with \(N_e = 40\) maintained for 50 generations (~100 years with 2-year generation interval): \(F_{50} \approx 1 - e^{-50/80} \approx 0.47\). This extreme inbreeding coefficient (compared to \(F < 0.05\) in outbred populations) explains the high prevalence of recessive genetic diseases in pedigreed cats.
4. The Feline Genome
The domestic cat genome was first sequenced in 2007 (Pontius et al.) from an Abyssinian cat named Cinnamon. Subsequent assemblies have refined the picture:
2.7 Gb
Genome size
19,493
Protein-coding genes
38
Chromosomes (2n)
~60%
Repetitive elements
Comparative Genomics
Cat
2n = 38
2.7 Gb
~19,500 genes
Dog
2n = 78
2.4 Gb
~19,300 genes
Human
2n = 46
3.1 Gb
~20,400 genes
Notable Gene Losses
Several pseudogenizations reveal the molecular basis of distinctly feline traits:
- • TAS1R2 (sweet taste receptor): A 247-bp microdeletion renders this gene non-functional. Cats cannot taste sweet — they are the only known mammalian order with a universally pseudogenised sweet receptor. This is consistent with their obligate carnivore ecology: no selective advantage to detecting sugars.
- • GCKR (glucokinase regulatory protein): Loss of this hepatic enzyme regulator contributes to the cat's inability to efficiently metabolise carbohydrates. Feline hepatocytes have constitutively active glucokinase, driving continuous gluconeogenesis (Module 4).
- • SLC6A18 (renal amino acid transporter): Pseudogenised in cats, potentially explaining their high protein requirement and susceptibility to amino acid imbalances.
Notable Gene Expansions
- • CYP2A6 (cytochrome P450 family): Expanded in cats, providing enhanced detoxification of plant alkaloids. This compensates for their reduced UDP-glucuronosyltransferase activity and helps explain why cats are sensitive to some drugs (paracetamol) but resistant to others.
- • Vomeronasal receptor genes (V1R): Cats retain ~30 functional V1R genes (vs ~5 in humans), consistent with their reliance on pheromone detection via the vomeronasal organ (Module 5).
- • Olfactory receptor genes: ~800 functional OR genes (~3% of the genome), supporting their excellent sense of smell.
5. Phylogenetic Tree of Felidae
The phylogeny below shows the 8 major felid lineages with approximate divergence times, based on Johnson et al. (2006) and Li et al. (2016).
6. Simulations
Population Genetics & Phylogenetic Analysis
This simulation produces three panels: (1) heterozygosity decay under genetic drift for different effective population sizes, simulating breed formation, (2) stochastic simulation of allele frequency drift in a small breed population, and (3) a phylogenetic distance matrix heatmap for the 8 felid lineages.
Click Run to execute the Python code
Code will be executed with Python 3 on the server
References
- Driscoll, C. A. et al. (2007). “The Near Eastern origin of cat domestication.” Science, 317(5837), 519–523.
- Vigne, J.-D. et al. (2004). “Early taming of the cat in Cyprus.” Science, 304(5668), 259.
- Johnson, W. E. et al. (2006). “The Late Miocene radiation of modern Felidae: A genetic assessment.” Science, 311(5757), 73–77.
- Li, G., Davis, B. W., Eizirik, E., & Murphy, W. J. (2016). “Phylogenomic evidence for ancient hybridization in the genomes of living cats (Felidae).” Genome Research, 26(1), 1–11.
- Hu, Y. et al. (2014). “Earliest evidence for commensal processes of cat domestication.” Proceedings of the National Academy of Sciences, 111(1), 116–120.
- Pontius, J. U. et al. (2007). “Initial sequence and comparative analysis of the cat genome.” Genome Research, 17(11), 1675–1689.
- Lyons, L. A. (2010). “Feline genetics: Clinical applications and genetic testing.” Topics in Companion Animal Medicine, 25(4), 203–212.
- Li, X. et al. (2005). “Pseudogenization of a sweet-receptor gene accounts for cats' indifference toward sugar.” PLoS Genetics, 1(1), e3.
- Montague, M. J. et al. (2014). “Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication.” Proceedings of the National Academy of Sciences, 111(48), 17230–17235.
- Gandolfi, B. et al. (2013). “A splice variant in KIT is associated with the Dominant White and White Spotting phenotypes in the domestic cat.” Animal Genetics, 44(5), 586–603.
- Meurs, K. M. et al. (2005). “A cardiac myosin binding protein C mutation in the Maine Coon cat with familial hypertrophic cardiomyopathy.” Human Molecular Genetics, 14(23), 3587–3593.