Module 1: Flower Development & the ABC Model
In 1991 Enrico Coen and Elliot Meyerowitz proposed a three-letter code for flower identity that still dominates the field. The ABC model - now expanded to ABCDE with the quartet hypothesis - explains why there are four concentric whorls, why homeotic mutants convert whole whorls into other organs, and why mutants in agamous produce indeterminate double flowers prized by horticulturists. This module develops the model rigorously, connects it to upstream meristem-identity genes (LEAFY, APETALA1), and then places flowering-time control (florigen, vernalization) upstream of the identity genes.
1. From Homeotic Mutants to the ABC Code
Three classes of homeotic mutations affect Arabidopsis thaliana flowers:
- Class A mutants (e.g. apetala1, apetala2): whorl 1 becomes carpel-like, whorl 2 becomes stamen-like (i.e. whorls acquire whorl-4 and whorl-3 identity).
- Class B mutants (e.g. apetala3, pistillata): whorl 2 becomes sepal, whorl 3 becomes carpel.
- Class C mutants (e.g. agamous): whorl 3 becomes petal, whorl 4 becomes sepal, and the meristem is indeterminate - producing more whorls of petals (the "double flower" of roses and peonies).
The parsimonious interpretation: three gene activities A, B, C are expressed in overlapping domains along the radial axis, specifying:
\[ \text{Whorl 1: A} \rightarrow \text{sepal},\quad \text{Whorl 2: A+B} \rightarrow \text{petal} \]
\[ \text{Whorl 3: B+C} \rightarrow \text{stamen},\quad \text{Whorl 4: C} \rightarrow \text{carpel} \]
An additional rule: A and C are mutually antagonistic. If A is lost, C spreads into whorls 1-2; if C is lost, A spreads into whorls 3-4. This single constraint is sufficient to predict every homeotic phenotype above.
1.1 The ABC Diagram
Below, the three domains are drawn as overlapping rectangles across the four whorls, with the organ identity determined by which combination of letters is present.
2. The Quartet Model & MADS-Box TFs
All ABCDE genes except APETALA2 are MADS-boxtranscription factors. The acronym comes from the founding members - MCM1 (yeast), AGAMOUS (Arabidopsis), DEFICIENS (snapdragon), and SRF (humans). The MADS domain is a conserved ~60-residue N-terminal DNA-binding module that recognises a CArG box (CC(A/T)6GG) as a dimer.
Theissen's quartet hypothesis (2001) posits that floral organ identity is specified not by pairs but by tetrameric protein complexes bound to two CArG boxes that loop chromatin. Each whorl has its own tetramer composition:
Sepal (W1)
(AP1)2 (SEP)2
Petal (W2)
AP1 AP3 PI SEP
Stamen (W3)
AP3 PI AG SEP
Carpel (W4)
(AG)2 (SEP)2
2.1 D-class (ovule identity)
SEEDSTICK (STK, formerly AGL11) and SHATTERPROOF (SHP1/2) are D-class MADS-box genes required specifically for ovule identity within the carpel. A quadruple stk shp1 shp2 ag mutant converts ovules into leaf-like organs.
2.2 A Quantitative Identity Rule
We can formalise the ABC logic as a majority-wins function on the tetramer composition. Let \(A, B, C, E \in [0,1]\) be the local expression levels. The organ identity score for each whorl type is:
\[ S_{\text{sepal}} = A E (1-B)(1-C), \quad S_{\text{petal}} = A B E (1-C) \]
\[ S_{\text{stamen}} = B C E (1-A), \quad S_{\text{carpel}} = C E (1-A)(1-B) \]
The predicted organ is the one with the highest score. The simulation below computes these scores, animates their emergence during development, and then reproduces the three canonical mutant phenotypes by setting A, B or C to zero.
Click Run to execute the Python code
Code will be executed with Python 3 on the server
Note the elegance of the C-class loss-of-function simulation: because the A domain expands into whorls 3-4 (no antagonism), but B is unchanged, whorl 3 becomes A+B = petal and whorl 4 becomes A = sepal. Exactly the phenotype of agamous / double roses. The meristem also fails to terminate, explaining indeterminate growth of double flowers.
3. Meristem Identity: LEAFY & APETALA1
Before the ABC genes can act, a vegetative shoot apical meristem (SAM) must convert to an inflorescence/floral meristem. Two genes act at the top of this hierarchy:
- LEAFY (LFY) - plant-specific transcription factor that integrates photoperiod, gibberellin and age cues. Mis-expression in vegetative tissue can transform leaf-axillary buds into terminal flowers.
- APETALA1 (AP1) - class-A MADS-box but also a meristem-identity gene; LFY directly activates AP1, which in turn represses the inflorescence gene TERMINAL FLOWER 1 (TFL1).
The circuit is a classical mutual exclusion:
\[ \text{LFY} \rightarrow \text{AP1} \dashv \text{TFL1} \dashv \text{LFY,AP1} \]
TFL1 represses LFY/AP1 and keeps the apex inflorescence-like (indeterminate); when LFY/AP1 win, a terminal flower is specified and the meristem becomes determinate (except in agamous mutants, see above).
3.1 Two-Step Selector Logic
LFY activates the ABC genes region-specifically: in conjunction with WUSCHEL (WUS) it directly binds the AG locus (Lenhard 2001). In combination with UFO (UNUSUAL FLORAL ORGANS) it activates AP3 (class B). The system is a beautiful example of combinatorial gene regulation: the same LFY protein specifies different downstream targets depending on partner cofactors.
4. Flowering Time: Florigen & FLC
Grafting experiments in the 1930s (Chailakhyan) showed that a long-day plant, once induced, transmits a flowering signal through its phloem. This signal was christened florigen. Eighty years later it was identified: the FT protein, product of the FLOWERING LOCUS T gene.
4.1 Photoperiodic Induction
In long-day plants, CONSTANS (CO) mRNA accumulates at the end of the light period. CO is stabilised only in the presence of the blue-light photoreceptor CRY2 and the phytochrome PhyA. Stable CO activates FT transcription in the leaf vasculature. FT protein moves via phloem to the shoot apex where it binds FD (a bZIP TF) to activate AP1 and SOC1 - initiating the ABC cascade.
\[ \text{Light} \rightarrow \text{CO}_{\text{leaf}} \rightarrow \text{FT} \xrightarrow{\text{phloem}} \text{FT:FD}_{\text{apex}} \rightarrow \text{AP1, SOC1} \rightarrow \text{flower} \]
4.2 Vernalization: Winter Memory
Many biennials (Arabidopsis winter annuals, beets, winter wheat) must experience prolonged cold (weeks) before they can flower. The cold-repressed gene FLOWERING LOCUS C (FLC) is a MADS-box repressor of FT. Prolonged cold deposits repressive Polycomb\( \text{H3K27me3} \) marks at the FLC locus; silencing is cell-heritable and persists through warm spring, creating an epigenetic memory of winter.
The quantitative model (Angel et al. 2011) treats FLC silencing as a stochastic bistable switch: each cell's FLC flips OFF with rate \(k(T)\) depending on temperature, and the plant-level competence emerges from the fraction of silenced cells. We simulate a deterministic mean-field version below.
\[ \frac{\text{d FLC}}{\text{d}t} = -k(T)\,\text{FLC}, \qquad k(T) = k_{\max}\, e^{-(T-T_{\text{opt}})^2/\sigma^2} \]
Click Run to execute the Python code
Code will be executed with Python 3 on the server
Notice the bell-shaped temperature dependence of silencing rate: 5 ยฐC is optimum, neither freezing nor warm spells help. This matches published data: winter-grown Arabidopsis at Haell (Sweden, ~4 ยฐC mean) silences FLC more rapidly than plants grown at 10 ยฐC.
4.3 Gibberellin Pathway
A fourth flowering pathway converges on SOC1: gibberellin biosynthesis increases LEAFY expression and overrides FLC in older plants. This is why biennials eventually flower without vernalization: endogenous GA accumulates with age, and GA-deficient mutants (ga1-3) stay vegetative forever under short days.
4.5 The Full Flowering Network
Integrating four pathways (photoperiod, vernalization, autonomous, gibberellin) the complete flowering-time circuit in Arabidopsis has more than 200 known genes. The core integrators - FT, SOC1, AP1, LFY - are the convergence points:
4.6 The MADS-Box Structure
The MADS-box is a ~60 amino-acid DNA-binding domain at the N-terminus. The canonical structure (Pellegrini et al. 1995) has a \(\beta\)-sheet lying in the DNA minor groove, followed by two \(\alpha\)-helices that dimerise via an antiparallel coiled-coil. The consensus CArG-box recognition sequence is\(\text{CC(A/T)}_6\text{GG}\). Downstream are the I (intervening), K (keratin-like coiled-coil, responsible for tetramerisation), and C (variable C-terminal) domains:
\[ \text{MADS-box TF: N-[MADS(60)]-[I(30)]-[K-domain(70)]-[C-term(variable)]-C} \]
Tetramerisation via the K domain is what allows quartet-model complexes to form: two MADS dimers each recognising one CArG box loop the DNA between them, contacting target promoters cooperatively.
5. Evolution of the ABC System
MADS-box genes predate angiosperms - they are present in mosses and ferns, where they control gametophyte development. The angiosperm innovation was to recruit paralogues into a strict spatial code across whorls. In basal angiosperms (Amborella, Nuphar) the boundaries between A, B, C are fuzzy - accounting for "undifferentiated" perianths of tepals. Eudicot and monocot core-eudicot lineages sharpen the boundaries by gene duplications: FUL (fruitful) is the euAPETALA1 sister gene.
The fading borders model (Buzgo et al. 2004) explains the continuous perianth of basal angiosperms without invoking failure of the ABC system: gradients of B-class expression yield gradual transitions from sepal to petal to stamen. In core eudicots the gradients have been replaced by sharp step functions.
6. Summary Table
ABC Model
Combinatorial code: A -> sepal, A+B -> petal, B+C -> stamen, C -> carpel
Antagonism
A and C mutually exclude; loss of A expands C and vice versa
Class A
APETALA1 (MADS-box), APETALA2 (AP2 domain, not MADS)
Class B
APETALA3 + PISTILLATA; obligate heterodimer
Class C
AGAMOUS; also provides determinacy - loss yields double flowers
Class D
SEEDSTICK, SHATTERPROOF 1/2 - ovule identity
Class E
SEPALLATA1-4; required in every whorl (quartet hypothesis)
Quartet Model
Tetrameric MADS complexes loop two CArG-box-containing enhancers
Meristem Identity
LEAFY -> APETALA1 -| TERMINAL FLOWER 1; converts SAM to floral meristem
Florigen
FT protein - moves from leaf phloem to apex, activates AP1/SOC1
Vernalization
Cold deposits H3K27me3 at FLC; silences MADS repressor; cell-heritable winter memory
Integration
FT + GA + LFY + (absence of FLC) converge on the floral transition
References
- Coen, E. S. & Meyerowitz, E. M. (1991). The war of the whorls: genetic interactions controlling flower development. Nature, 353, 31-37.
- Theissen, G. (2001). Development of floral organ identity: stories from the MADS house. Current Opinion in Plant Biology, 4, 75-85.
- Pelaz, S., Ditta, G. S., Baumann, E., Wisman, E. & Yanofsky, M. F. (2000). B and C floral organ identity functions require SEPALLATA MADS-box genes. Nature, 405, 200-203.
- Honma, T. & Goto, K. (2001). Complexes of MADS-box proteins are sufficient to convert leaves into floral organs. Nature, 409, 525-529.
- Lenhard, M. et al. (2001). Termination of stem cell maintenance in Arabidopsis floral meristems by interactions between WUSCHEL and AGAMOUS. Cell, 105, 805-814.
- Corbesier, L. et al. (2007). FT protein movement contributes to long-distance signaling in floral induction of Arabidopsis. Science, 316, 1030-1033.
- Michaels, S. D. & Amasino, R. M. (1999). FLOWERING LOCUS C encodes a novel MADS domain protein that acts as a repressor of flowering. The Plant Cell, 11, 949-956.
- Angel, A., Song, J., Dean, C. & Howard, M. (2011). A Polycomb-based switch underlying quantitative epigenetic memory. Nature, 476, 105-108.
- Buzgo, M., Soltis, P. S. & Soltis, D. E. (2004). Floral developmental morphology of Amborella trichopoda. International Journal of Plant Sciences, 165, 925-947.
- Theissen, G. & Saedler, H. (2001). Floral quartets. Nature, 409, 469-471.
- Schmid, M. et al. (2003). Dissection of floral induction pathways using global expression analysis. Development, 130, 6001-6012.