Fluorescence & FRET in Biophysics
Jablonski diagrams, fluorescence lifetime, Förster theory, FCS autocorrelation, and super-resolution microscopy
Fluorescence as a Biophysical Tool
Fluorescence is arguably the most versatile tool in modern biophysics. By attaching fluorescent labels to specific sites on biomolecules, we can track their location, measure distances, probe local environments, and monitor dynamics — all with extraordinary sensitivity down to single molecules. This chapter develops the photophysics of fluorescence from the Jablonski diagram through to advanced techniques including FRET, FCS, and super-resolution microscopy.
The quantitative framework we develop here underpins nearly every modern fluorescence experiment in cell biology, structural biology, and biophysics. We derive the key equations from first principles, connecting photophysical rates to measurable quantities such as lifetime, quantum yield, FRET efficiency, and diffusion coefficients.
1. The Jablonski Diagram
The Jablonski diagram summarises the electronic states and photophysical processes of a fluorophore. The key states and transitions are:
Electronic States
$S_0$: Ground singlet state. The molecule resides here before excitation.
$S_1$: First excited singlet state. Reached by absorption of a photon ($\sim 10^{-15}$ s). The molecule rapidly relaxes to the lowest vibrational level of $S_1$ via internal conversion ($\sim 10^{-12}$ s).
$T_1$: First excited triplet state. Reached from $S_1$ via intersystem crossing (ISC). Transition$T_1 \to S_0$ (phosphorescence) is spin-forbidden and therefore slow ($\sim \mu$s to s).
Kasha's rule states that fluorescence emission occurs from the lowest vibrational level of$S_1$, regardless of which vibrational level was initially excited. This is because internal conversion and vibrational relaxation ($\sim 10^{-12}$ s) are much faster than fluorescence ($\sim 10^{-9}$ s). A key consequence is that the emission spectrum is independent of the excitation wavelength, and the Stokes shift (energy difference between absorption and emission maxima) reflects vibrational relaxation in both ground and excited states.
Timescales of Photophysical Processes
Absorption: $\sim 10^{-15}$ s (femtoseconds). The Franck-Condon principle states that the electronic transition occurs so rapidly that the nuclear positions are unchanged. The molecule is promoted vertically on the potential energy surface, typically to a vibrationally excited level of $S_1$.
Vibrational relaxation: $\sim 10^{-12}$ s (picoseconds). Energy is redistributed among vibrational modes and dissipated to the solvent. This rapid thermalisation ensures fluorescence originates from the lowest vibrational level of $S_1$.
Fluorescence: $\sim 10^{-9}$ s (nanoseconds). Radiative transition from $S_1 \to S_0$ with emission of a photon. The emitted photon has lower energy (longer wavelength) than the absorbed photon — this is the Stokes shift.
Intersystem crossing: $\sim 10^{-8}$ to$10^{-6}$ s. Spin-forbidden $S_1 \to T_1$ transition, enhanced by spin-orbit coupling (heavy atom effect). Molecules containing heavy atoms (Br, I) or transition metals show enhanced ISC rates.
Phosphorescence: $\sim 10^{-6}$ to$10^{0}$ s (microseconds to seconds). Spin-forbidden $T_1 \to S_0$transition. Rarely observed in solution at room temperature due to efficient non-radiative quenching of the long-lived triplet state, but important in oxygen sensing and photodynamic therapy.
The Stokes Shift and Mirror Image Rule
The Stokes shift $\Delta\tilde{\nu}$ is the difference (in wavenumber) between the absorption and emission maxima. It arises from vibrational relaxation in both the excited state (after absorption) and the ground state (after emission). Larger Stokes shifts indicate greater structural reorganisation upon excitation and are advantageous for microscopy because they allow clean separation of excitation and emission light.
The mirror image rule states that the emission spectrum is often an approximate mirror image of the absorption spectrum (plotted on a wavenumber scale). This arises when the vibrational mode structure is similar in $S_0$ and $S_1$. Deviations from the mirror image rule indicate significant excited-state structural changes (e.g., charge transfer, proton transfer, or excimer formation).
Common fluorophore Stokes shifts: Fluorescein (20 nm), Rhodamine B (25 nm), Cy3 (20 nm), Cy5 (20 nm), GFP (20 nm), DAPI (80 nm), Alexa Fluor 488 (20 nm). Environment-sensitive dyes like Laurdan and Prodan exhibit Stokes shifts that change dramatically ($\sim 50$–120 nm) with solvent polarity, making them valuable membrane probes.
2. Derivation: Fluorescence Lifetime and Quantum Yield
Rate Equations for the Excited State
The population of $S_1$ decays via competing pathways. Let $k_r$ be the radiative (fluorescence) rate constant, $k_{nr}$ the non-radiative decay rate (including internal conversion, collisional quenching, and intersystem crossing). The rate equation for the excited-state population $n^*(t)$ after a delta-function excitation pulse is:
$$\frac{dn^*(t)}{dt} = -(k_r + k_{nr})\, n^*(t)$$
This first-order ODE has the solution:
$$n^*(t) = n^*(0)\, e^{-t/\tau}$$
where the fluorescence lifetime is defined as:
$$\boxed{\tau = \frac{1}{k_r + k_{nr}}}$$
The lifetime $\tau$ is the average time a molecule spends in the excited state before returning to the ground state. Typical values for common fluorophores range from$\sim 1$ ns (quenched dyes) to $\sim 10$ ns (fluorescein, rhodamine).
Quantum Yield
The fluorescence quantum yield $\Phi$ is the fraction of absorbed photons that are re-emitted as fluorescence:
$$\boxed{\Phi = \frac{k_r}{k_r + k_{nr}} = \frac{\tau}{\tau_{\text{nat}}}}$$
where $\tau_{\text{nat}} = 1/k_r$ is the natural (or radiative) lifetime — the lifetime the fluorophore would have if there were no non-radiative decay. Since$k_{nr} \geq 0$, we always have $\tau \leq \tau_{\text{nat}}$ and$\Phi \leq 1$.
Measuring both $\tau$ (via time-correlated single photon counting, TCSPC) and$\Phi$ (via comparative or integrating sphere methods) allows separate determination of $k_r$ and $k_{nr}$.
Measuring Lifetimes: TCSPC and Frequency-Domain Methods
Time-correlated single photon counting (TCSPC): The sample is excited with a pulsed laser (typically $\sim 100$ ps pulse width at$\sim 40$–80 MHz repetition rate). A fast detector (avalanche photodiode or photomultiplier tube) records the arrival time of individual fluorescence photons relative to the excitation pulse. After millions of excitation cycles, a histogram of arrival times is built up, which represents the fluorescence decay curve $I(t) = I_0\, e^{-t/\tau}$.
The measured decay is the convolution of the true decay with the instrument response function (IRF):
$$I_{\text{measured}}(t) = \text{IRF}(t) \otimes I_{\text{true}}(t) = \int_0^t \text{IRF}(t') \cdot I_{\text{true}}(t-t')\, dt'$$
Deconvolution or iterative reconvolution fitting is used to extract the true lifetime. For multi-exponential decays (indicating heterogeneous environments or multiple conformational states):
$$I(t) = \sum_i \alpha_i\, e^{-t/\tau_i}$$
where $\alpha_i$ are the pre-exponential amplitudes (fractional populations) and$\tau_i$ are the individual lifetimes. The average lifetime is$\langle\tau\rangle = \sum_i \alpha_i \tau_i^2 / \sum_i \alpha_i \tau_i$.
Frequency-domain (phase modulation) method: The excitation is sinusoidally modulated at angular frequency $\omega$. The emitted fluorescence is phase-shifted by $\phi$ and demodulated by a factor $m$ relative to the excitation. For a single-exponential decay: $\tan\phi = \omega\tau$ and$m = (1 + \omega^2\tau^2)^{-1/2}$. Multi-frequency measurements allow resolution of multi-exponential decays.
FLIM: Fluorescence Lifetime Imaging
Fluorescence lifetime imaging microscopy (FLIM) maps the fluorescence lifetime at each pixel of an image. Since $\tau$ is sensitive to the local environment (pH, ion concentration, viscosity, FRET) but independent of fluorophore concentration and excitation intensity, FLIM provides contrast mechanisms that are inaccessible to intensity-based imaging.
FRET-FLIM: The most powerful application of FLIM is detecting FRET via changes in donor lifetime. In the presence of acceptor, the donor lifetime decreases from$\tau_D$ to $\tau_{DA} = \tau_D(1-E)$. FRET-FLIM is superior to intensity-based FRET because it is insensitive to donor/acceptor stoichiometry, spectral bleedthrough, and direct acceptor excitation.
Phasor analysis: An alternative to multi-exponential fitting, the phasor approach maps each pixel to a point on the phasor plot (a semicircle in the $g$–$s$plane). Single-exponential decays fall on the universal semicircle; multi-exponential or FRET populations appear inside the semicircle. This model-free approach is particularly useful for complex biological samples with heterogeneous lifetimes.
The Strickler-Berg Equation
The radiative rate $k_r$ can be predicted from the absorption spectrum using the Strickler-Berg equation (1962), which connects $k_r$ to the integrated absorption cross-section:
$$k_r = 2.88 \times 10^{-9}\, n^2 \frac{\langle \tilde{\nu}_f^{-3} \rangle^{-1}}{\langle \tilde{\nu}_a^{-1} \rangle} \int \epsilon(\tilde{\nu})\, d\tilde{\nu}$$
where $n$ is the refractive index, $\tilde{\nu}$ is the wavenumber,$\epsilon$ is the molar extinction coefficient, and the averages are over the fluorescence and absorption spectra respectively. This equation allows prediction of the natural lifetime from steady-state spectral data alone, providing a crucial check on experimental lifetime measurements.
Stern-Volmer Quenching
When a quencher molecule Q is present at concentration $[Q]$, it provides an additional de-excitation pathway with rate $k_q[Q]$. The modified lifetime and quantum yield become:
$$\tau' = \frac{1}{k_r + k_{nr} + k_q[Q]}$$
Dividing the unquenched by the quenched quantities gives the Stern-Volmer equation:
$$\boxed{\frac{\tau_0}{\tau} = \frac{\Phi_0}{\Phi} = 1 + k_q \tau_0 [Q] = 1 + K_{SV}[Q]}$$
where $K_{SV} = k_q \tau_0$ is the Stern-Volmer constant. A linear plot of$\tau_0/\tau$ vs $[Q]$ confirms dynamic (collisional) quenching and yields $k_q$. For oxygen quenching of typical fluorophores,$k_q \approx 10^{10}$ M$^{-1}$s$^{-1}$, near the diffusion-controlled limit.
3. Derivation: FRET Efficiency (Förster Theory)
Förster resonance energy transfer (FRET) is the non-radiative transfer of excitation energy from a donor fluorophore (D) to an acceptor chromophore (A) via a dipole-dipole coupling mechanism. The donor and acceptor need not be in contact; FRET operates over distances of 1–10 nm, making it an exquisite molecular ruler for biophysics.
Dipole-Dipole Coupling and the Transfer Rate
The excited donor can be modelled as an oscillating electric dipole that generates a near-field electric field. The acceptor, at distance $r$, couples to this field through its transition dipole moment. Using Fermi's golden rule, the energy transfer rate is:
$$k_T = \frac{1}{\tau_D}\left(\frac{R_0}{r}\right)^6$$
where $\tau_D$ is the donor lifetime in the absence of acceptor, and$R_0$ is the Förster radius. The crucial $r^{-6}$ dependence arises because the dipole-dipole interaction energy scales as $r^{-3}$, and the transfer rate (proportional to the square of the coupling matrix element) therefore scales as$r^{-6}$.
The physical picture: the near-field of an oscillating dipole falls off as $E \propto r^{-3}$. The rate of energy transfer, being proportional to the squared interaction energy ($|V|^2 \propto E^2 \propto r^{-6}$), gives the characteristic sixth-power distance dependence that makes FRET so sensitive to nanometre-scale distance changes.
Step-by-Step: From Fermi's Golden Rule to the Transfer Rate
The energy transfer rate is obtained from Fermi's golden rule:
$$k_T = \frac{2\pi}{\hbar} |V_{DA}|^2 \rho_A$$
where $V_{DA}$ is the electronic coupling between donor and acceptor transition dipole moments, and $\rho_A$ is the density of acceptor states at the transition energy. The coupling matrix element for dipole-dipole interaction is:
$$V_{DA} = \frac{\kappa |\boldsymbol{\mu}_D||\boldsymbol{\mu}_A|}{4\pi\epsilon_0 n^2 r^3}$$
where $\boldsymbol{\mu}_D$ and $\boldsymbol{\mu}_A$ are the transition dipole moments of donor and acceptor, $\kappa$ is the orientation factor, $n$is the refractive index, and $r$ is the inter-chromophore distance. Squaring gives$|V_{DA}|^2 \propto r^{-6}$.
The transition dipole magnitude can be related to the radiative rate via the oscillator strength, and the density of states is expressed through the spectral overlap integral. Combining these, and expressing the donor emission and acceptor absorption in terms of experimentally measurable spectra:
$$k_T = \frac{9000\,(\ln 10)\, \kappa^2 \Phi_D}{128\pi^5 N_A n^4 \tau_D r^6}\, J(\lambda)$$
This is the full Förster transfer rate equation. The dependence on $\Phi_D$(rather than $k_r$ directly) is because the donor emission spectrum shape determines the overlap integral, and the quantum yield normalises for competing decay pathways.
Deriving the Förster Radius $R_0$
The Förster radius $R_0$ is the distance at which the transfer rate equals the donor's intrinsic decay rate ($k_T = 1/\tau_D$), meaning transfer efficiency is 50%. Starting from Fermi's golden rule and integrating over the spectral overlap between the donor emission and acceptor absorption:
$$k_T = \frac{9000\, (\ln 10)\, \kappa^2 \Phi_D}{128\pi^5 N_A n^4 \tau_D r^6} \int_0^\infty F_D(\lambda)\, \epsilon_A(\lambda)\, \lambda^4\, d\lambda$$
where $F_D(\lambda)$ is the normalised donor emission spectrum (area = 1),$\epsilon_A(\lambda)$ is the acceptor molar extinction coefficient,$\kappa^2$ is the orientation factor, $n$ is the refractive index of the medium, $\Phi_D$ is the donor quantum yield, and$N_A$ is Avogadro's number. The spectral overlap integral is:
$$J(\lambda) = \int_0^\infty F_D(\lambda)\, \epsilon_A(\lambda)\, \lambda^4\, d\lambda$$
Setting $k_T = 1/\tau_D$ at $r = R_0$ and solving for $R_0$:
$$\boxed{R_0 = 0.211 \left[\kappa^2 n^{-4} \Phi_D J(\lambda)\right]^{1/6} \quad \text{(in nm, with } J \text{ in M}^{-1}\text{cm}^{-1}\text{nm}^4\text{)}}$$
Typical $R_0$ values for common FRET pairs: Cy3-Cy5 ($\sim 5.4$ nm), CFP-YFP ($\sim 4.9$ nm), Alexa488-Alexa594 ($\sim 5.4$ nm), ECFP-EYFP ($\sim 4.9$ nm).
FRET Efficiency
The FRET efficiency $E$ is the probability that an excited donor transfers its energy to the acceptor rather than decaying by its own pathways. It is the ratio of the transfer rate to the total decay rate of the donor:
$$E = \frac{k_T}{k_T + \tau_D^{-1}} = \frac{(R_0/r)^6}{1 + (R_0/r)^6}$$
Simplifying:
$$\boxed{E = \frac{1}{1 + (r/R_0)^6}}$$
This equation has the following properties: at $r = R_0$, $E = 0.5$; for $r \ll R_0$, $E \to 1$ (complete transfer); for$r \gg R_0$, $E \to 0$ (no transfer). The steep distance dependence means FRET is most sensitive to distance changes near $r \approx R_0$.
Experimentally, $E$ can be measured from (i) the donor intensity ratio$E = 1 - I_{DA}/I_D$, (ii) the donor lifetime ratio$E = 1 - \tau_{DA}/\tau_D$, or (iii) the acceptor-to-donor intensity ratio (sensitised emission).
The Orientation Factor $\kappa^2$
The orientation factor describes the relative alignment of the donor emission and acceptor absorption transition dipole moments:
$$\kappa^2 = (\cos\theta_T - 3\cos\theta_D\cos\theta_A)^2$$
where $\theta_T$ is the angle between the two dipoles,$\theta_D$ and $\theta_A$ are the angles each dipole makes with the inter-chromophore vector. The range is $0 \leq \kappa^2 \leq 4$:
- $\kappa^2 = 0$: perpendicular dipoles (no transfer)
- $\kappa^2 = 1$: parallel dipoles, perpendicular to the connecting vector
- $\kappa^2 = 4$: head-to-tail collinear dipoles (maximum transfer)
- $\kappa^2 = 2/3$: isotropic dynamic averaging (commonly assumed when both chromophores rotate freely on timescales faster than $\tau_D$)
The assumption $\kappa^2 = 2/3$ is validated by measuring fluorescence anisotropy of both donor and acceptor; low anisotropy values ($r < 0.1$) indicate rapid isotropic rotation and justify this assumption.
Single-Molecule FRET (smFRET)
Ensemble FRET measurements report the average efficiency over all molecules, which can mask conformational heterogeneity. Single-molecule FRET (smFRET) resolves the efficiency distribution of individual molecules, revealing distinct conformational states and dynamic transitions between them.
In a typical smFRET experiment, a confocal microscope or total internal reflection fluorescence (TIRF) microscope excites individual doubly-labelled molecules. The donor and acceptor emission are spectrally separated and detected simultaneously. The apparent FRET efficiency for each burst or time bin is:
$$E^* = \frac{n_A}{n_A + n_D}$$
where $n_A$ and $n_D$ are the detected acceptor and donor photon counts. Corrections for background, crosstalk (donor emission leaking into the acceptor channel), direct excitation of the acceptor, and differences in detection efficiencies and quantum yields are required to obtain the true efficiency $E$.
Histograms of $E$ values from thousands of individual molecules reveal the number and population of distinct conformational states. Hidden Markov model (HMM) analysis of time traces from surface-immobilised molecules yields transition rates between states, providing a complete kinetic description of conformational dynamics.
FRET-Based Distance Measurements: Practical Considerations
Converting FRET efficiency to distance requires careful attention to several factors:
- Linker flexibility: Fluorophores are attached via flexible linkers (typically 10–20 atoms), so the measured distance is an average over the accessible volume of each dye, not the backbone-to-backbone distance
- Dynamic averaging: If the donor-acceptor distance fluctuates on timescales faster than $\tau_D$, the measured $E$ corresponds to $\langle r^{-6}\rangle$, not $\langle r \rangle^{-6}$
- Photobleaching: Acceptor photobleaching produces an apparent decrease in $E$; donor-only populations must be identified and excluded
- Multi-label stoichiometry: Incomplete labelling gives donor-only or acceptor-only populations that must be separated from genuine FRET populations
Despite these complications, smFRET has been successfully applied to measure distances in protein folding (denatured state dimensions, folding intermediates), nucleic acid structures (DNA bending, RNA folding), molecular machines (ribosome dynamics, polymerase translocation), and intrinsically disordered proteins (distance distributions in the unfolded ensemble).
4. Derivation: FCS Autocorrelation Function
Fluorescence correlation spectroscopy (FCS) analyses the temporal fluctuations of fluorescence intensity from a small (<1 fL) observation volume. These fluctuations arise from molecules diffusing in and out of the focal volume, chemical reactions, or photophysical transitions. FCS extracts molecular concentrations, diffusion coefficients, and reaction rates without needing single-molecule sensitivity in the traditional sense.
Setting Up the Autocorrelation
The fluorescence intensity $F(t)$ fluctuates around its mean$\langle F \rangle$. We define the fluctuation$\delta F(t) = F(t) - \langle F \rangle$ and the normalised autocorrelation function:
$$G(\tau) = \frac{\langle \delta F(t)\, \delta F(t+\tau) \rangle}{\langle F(t) \rangle^2}$$
The fluorescence signal is $F(t) = \kappa \int I(\mathbf{r})\, C(\mathbf{r},t)\, d^3r$, where $I(\mathbf{r})$ is the molecule detection function (product of excitation intensity and collection efficiency), $C(\mathbf{r},t)$ is the local concentration, and $\kappa$ is a proportionality constant including the molecular brightness.
3D Gaussian Observation Volume
For a 3D Gaussian observation volume with lateral radius $w_0$ and axial half-length $z_0$:
$$I(\mathbf{r}) = I_0 \exp\left(-\frac{2(x^2+y^2)}{w_0^2} - \frac{2z^2}{z_0^2}\right)$$
The effective observation volume is $V_{\text{eff}} = \pi^{3/2} w_0^2 z_0$ and the structure parameter is $s = z_0/w_0$ (typically $s \approx 3$–8 for confocal setups). The mean number of molecules in the volume is$\langle N \rangle = C \cdot V_{\text{eff}}$.
Using the diffusion propagator for free 3D Brownian motion:
$$P(\mathbf{r}, \tau | \mathbf{r}_0) = \frac{1}{(4\pi D\tau)^{3/2}} \exp\left(-\frac{|\mathbf{r}-\mathbf{r}_0|^2}{4D\tau}\right)$$
and substituting into the autocorrelation integral, we obtain (after evaluating Gaussian integrals):
$$\boxed{G(\tau) = \frac{1}{N}\left(1 + \frac{\tau}{\tau_D}\right)^{-1}\left(1 + \frac{\tau}{s^2\tau_D}\right)^{-1/2}}$$
where $\tau_D = w_0^2/(4D)$ is the characteristic diffusion time through the focal volume, and $N = \langle N \rangle$ is the mean number of molecules. At$\tau = 0$, $G(0) = 1/N$, providing a direct measure of concentration.
Extracting Physical Parameters from FCS
From a fit of $G(\tau)$ to the measured autocorrelation:
- Concentration: $C = N/V_{\text{eff}}$ from $G(0) = 1/N$
- Diffusion coefficient: $D = w_0^2/(4\tau_D)$ from the half-decay time of $G(\tau)$
- Hydrodynamic radius: $R_H = k_BT/(6\pi\eta D)$ via the Stokes-Einstein relation
- Molecular interactions: Changes in $\tau_D$ upon binding report on complex formation (larger complexes diffuse more slowly)
For two-component diffusion (e.g., free vs. bound ligand), the autocorrelation becomes a weighted sum of two diffusion terms with different $\tau_D$ values. Fluorescence cross-correlation spectroscopy (FCCS) uses two spectrally distinct labels to detect co-diffusion of interacting partners.
Triplet-State and Photophysical Corrections to FCS
Real fluorophores undergo transitions to the triplet state, producing fast intensity fluctuations (microsecond timescale) that must be accounted for in the FCS model. The corrected autocorrelation includes a triplet blinking term:
$$G(\tau) = \frac{1}{N}\left(1 + \frac{\tau}{\tau_D}\right)^{-1}\left(1 + \frac{\tau}{s^2\tau_D}\right)^{-1/2}\left(1 + \frac{T}{1-T}\exp\left(-\frac{\tau}{\tau_T}\right)\right)$$
where $T$ is the equilibrium triplet-state fraction and $\tau_T$ is the triplet relaxation time (typically 1–10 $\mu$s). The triplet term produces a shoulder at short lag times that must be fitted before extracting reliable diffusion parameters.
Additional photophysical effects that can affect FCS measurements include photobleaching (loss of fluorophores during measurement, leading to apparent slow fluctuations), detector afterpulsing (producing artifactual correlations at very short lag times, corrected by cross-correlating signals split between two detectors), and saturation effects at high excitation intensities.
Practical Considerations for FCS Experiments
Calibration: The focal volume dimensions ($w_0$and $z_0$) must be calibrated using a dye with known diffusion coefficient (e.g., Alexa 488 in water, $D = 435\;\mu$m$^2$/s at 25°C). The structure parameter $s$ and $w_0$ are obtained from a fit to the autocorrelation.
Concentration range: FCS works best at nanomolar concentrations (1–100 nM), corresponding to $N \sim 0.1$–10 molecules in the focal volume. At lower concentrations, data acquisition takes prohibitively long; at higher concentrations, the relative fluctuations $\delta F/\langle F \rangle \propto 1/\sqrt{N}$ become too small to detect reliably.
Two-component analysis: For a mixture of free and bound species with diffusion times $\tau_{D,1}$ and $\tau_{D,2}$, the autocorrelation becomes a weighted sum: $G(\tau) = f_1 G_1(\tau) + f_2 G_2(\tau)$, where the weights$f_i$ depend on both the fraction and the molecular brightness of each species. Binding-induced changes in diffusion time are detectable when the mass ratio of free to bound species is at least $\sim 8$:1 (since $D \propto M^{-1/3}$ for globular proteins).
5. Derivation: Super-Resolution Microscopy (PALM/STORM)
Conventional fluorescence microscopy is limited by diffraction to a resolution of$\sim \lambda/(2\text{NA}) \approx 200$ nm (Abbe limit). Super-resolution techniques circumvent this limit by exploiting the photophysics of individual fluorophores.
The Abbe Diffraction Limit
Ernst Abbe (1873) showed that the minimum resolvable distance for a conventional microscope is:
$$d_{\text{Abbe}} = \frac{\lambda}{2n\sin\theta} = \frac{\lambda}{2\,\text{NA}}$$
where $\lambda$ is the wavelength of light and NA is the numerical aperture of the objective. For visible light ($\lambda \approx 500$ nm) and a high-NA oil-immersion objective (NA = 1.4), this gives $d_{\text{Abbe}} \approx 180$ nm. This limit stood for over a century and was widely believed to be a fundamental barrier to optical microscopy.
The key insight that enabled super-resolution was recognising that the diffraction limit applies to the resolution of two simultaneously emitting fluorophores, but not to the precision with which a single isolated emitter can be localised. This distinction between "resolution" and "localisation precision" is the foundation of PALM/STORM.
Localisation Precision
A single fluorophore produces a diffraction-limited spot (point spread function, PSF) of width$s \approx 0.21\lambda/\text{NA}$. However, the centre of this spot can be determined by fitting a 2D Gaussian with precision far better than the diffraction limit. The localisation precision depends on the number of detected photons $N$:
$$\boxed{\sigma_{\text{loc}} \approx \frac{s}{\sqrt{N}}}$$
More precisely, the Thompson-Larson-Webb formula (2002) gives:
$$\sigma_{\text{loc}}^2 = \frac{s^2 + a^2/12}{N} + \frac{8\pi s^4 b^2}{a^2 N^2}$$
where $a$ is the pixel size and $b$ is the background standard deviation per pixel. With $N \sim 1000$ photons, $\sigma_{\text{loc}} \approx 10$ nm is routinely achieved.
Temporal Separation Strategy
The key insight of PALM (Betzig, 2006) and STORM (Zhuang, 2006) is to use photoswitchable or photoactivatable fluorophores to ensure that only a sparse subset of molecules is fluorescent in each frame. This allows individual PSFs to be isolated and fitted without overlap.
The process: (1) activate a random sparse subset of fluorophores; (2) image until they photobleach, fitting each PSF to determine its centre; (3) repeat thousands of times; (4) reconstruct a super-resolution image from all localised positions. The final resolution is determined by $\sigma_{\text{loc}}$, not the diffraction limit.
Nyquist Criterion for Super-Resolution
Achieving high localisation precision is necessary but not sufficient. The Nyquist-Shannon sampling theorem requires that the density of localisations must be high enough to resolve the structure of interest:
$$d_{\text{Nyquist}} = \frac{2}{\sqrt{\rho}}$$
where $\rho$ is the localisation density (localisations per unit area). The effective resolution of a PALM/STORM image is the larger of $\sigma_{\text{loc}}$and $d_{\text{Nyquist}}$. Dense labelling is therefore essential: a structure with features at 20 nm spacing requires localisations at least every 10 nm (Nyquist criterion), corresponding to $\rho \geq 10^4$ localisations/$\mu$m$^2$.
STED Microscopy Resolution
Stimulated emission depletion (STED) microscopy (Hell, 1994) uses a doughnut-shaped depletion beam to suppress fluorescence from the periphery of the excitation spot. The effective PSF is narrowed to:
$$\boxed{d_{\text{STED}} \approx \frac{\lambda}{2\text{NA}\sqrt{1 + I/I_{\text{sat}}}}}$$
where $I$ is the peak STED beam intensity and $I_{\text{sat}}$ is the saturation intensity of the fluorophore. In principle, resolution is unlimited as$I/I_{\text{sat}} \to \infty$; in practice, STED achieves 20–50 nm resolution with live-cell compatible intensities.
Structured Illumination Microscopy (SIM)
SIM uses patterned illumination (typically sinusoidal stripes) to encode high-spatial-frequency information into the detected image via Moiré fringes. By acquiring images with different pattern orientations and phases, then computationally reconstructing the image in Fourier space, SIM extends the observable spatial frequency bandwidth by a factor of 2, yielding a resolution of:
$$d_{\text{SIM}} \approx \frac{\lambda}{4\,\text{NA}} \approx 100\;\text{nm}$$
While the resolution improvement is more modest than PALM/STORM or STED, SIM has major practical advantages: it works with any fluorophore, requires only moderate illumination intensities, and can image large fields of view quickly. This makes SIM particularly suitable for live-cell imaging of dynamic processes. Nonlinear SIM (using saturated excitation) can theoretically achieve unlimited resolution, similar to STED, but requires higher intensities.
3D Super-Resolution and Axial Localisation
Extending super-resolution to three dimensions requires encoding axial ($z$) position information. Several strategies have been developed:
- Astigmatic imaging: A cylindrical lens introduces astigmatism, making the PSF elliptical with an orientation that depends on the $z$-position. The ellipticity encodes $z$ with $\sim 50$ nm precision over a $\sim 1 \;\mu$m range
- Biplane detection: Two focal planes separated by $\sim 350$ nm are imaged simultaneously; comparing the defocus of each molecule in the two planes yields $z$
- Double-helix PSF: A phase mask creates a PSF consisting of two lobes that rotate with $z$, encoding axial position over a $\sim 2 \;\mu$m range
- Interferometric PALM (iPALM): Two opposing objectives and interferometric detection achieve $\sim 10$ nm axial resolution, but require specialised hardware
Comparison of Super-Resolution Techniques
PALM/STORM: Resolution $\sim 10$–20 nm (lateral), $\sim 50$ nm (axial). Requires photoswitchable fluorophores. Slow (minutes per image). Best for fixed samples with high label density.
STED: Resolution $\sim 20$–50 nm. Works with many conventional fluorophores. Fast scanning ($\sim 1$ frame/s). Compatible with live-cell imaging. Limited by photobleaching and phototoxicity.
SIM: Resolution $\sim 100$ nm. Works with any fluorophore. Fast ($\sim 10$ frames/s). Lowest phototoxicity. Ideal for live-cell dynamics.
MINFLUX: Resolution $\sim 1$–5 nm. Requires photoswitchable fluorophores. Currently limited to small fields of view. Minimal photon budget ($\sim 100$ photons per localisation).
6. Derivation: FRAP (Fluorescence Recovery After Photobleaching)
FRAP measures the mobility of fluorescent molecules in membranes or cytoplasm. A defined region is irreversibly photobleached with an intense laser pulse, and the subsequent recovery of fluorescence as unbleached molecules diffuse into the bleached area is monitored.
Recovery Curve Derivation
For a circular bleach spot of radius $w$ with a Gaussian bleaching profile, the recovery is governed by the 2D diffusion equation. The initial condition after bleaching is a concentration profile $C(r,0)$ with a "hole" in the bleached region.
The Soumpasis solution (1983) for uniform circular bleaching gives the recovery function:
$$\boxed{f(t) = e^{-2\tau_D/t}\left[I_0\left(\frac{2\tau_D}{t}\right) + I_1\left(\frac{2\tau_D}{t}\right)\right]}$$
where $I_0$ and $I_1$ are modified Bessel functions of the first kind, and $\tau_D = w^2/(4D)$ is the characteristic diffusion time. The recovery half-time is:
$$\boxed{t_{1/2} = \frac{0.224\, w^2}{D}}$$
This provides a simple way to extract the diffusion coefficient: measure $t_{1/2}$ and the bleach spot radius $w$, then compute $D = 0.224\, w^2/t_{1/2}$.
Mobile Fraction
In practice, not all molecules are mobile. The recovery curve typically plateaus below the pre-bleach intensity:
$$F(t) = F_0 + (F_{\infty} - F_0)\, f(t)$$
The mobile fraction $M_f$ is:
$$\boxed{M_f = \frac{F_{\infty} - F_0}{F_{\text{pre}} - F_0}}$$
where $F_{\text{pre}}$ is the pre-bleach intensity, $F_0$ is the intensity immediately after bleaching, and $F_{\infty}$ is the plateau intensity. The immobile fraction ($1 - M_f$) may represent molecules bound to immobile structures (cytoskeleton, nuclear matrix), aggregated species, or molecules in membrane microdomains.
Reaction-Dominated FRAP
When molecular interactions (binding/unbinding) are important in addition to diffusion, the FRAP recovery is governed by a reaction-diffusion equation. In the reaction-dominant limit (where binding/unbinding is slower than diffusion), the recovery kinetics report primarily on the binding rate constants:
$$F(t) \approx F_{\infty}\left(1 - C_{\text{eq}} e^{-k_{\text{off}}t}\right)$$
where $k_{\text{off}}$ is the dissociation rate constant and $C_{\text{eq}}$is the bound fraction at equilibrium. This regime is identified by testing whether the recovery half-time is independent of the bleach spot size: if $t_{1/2}$ does not change when $w$ is varied, the recovery is reaction-limited rather than diffusion-limited.
FRAP variants: Half-FRAP (bleaching half the cell) tests for barriers to diffusion. iFRAP (inverse FRAP, bleaching everything except a small region) monitors dissipation rather than recovery. FLIP (fluorescence loss in photobleaching) continuously bleaches one region and monitors fluorescence loss elsewhere, probing connectivity between cellular compartments.
Typical FRAP Results in Biology
FRAP has yielded diffusion coefficients for a wide range of biological molecules:
- Lipids in membranes: $D \approx 1$–$5 \;\mu$m$^2$/s for free lipids, reduced in ordered domains
- Membrane proteins: $D \approx 0.01$–$0.1 \;\mu$m$^2$/s, much slower than lipids due to cytoskeletal interactions
- Cytoplasmic GFP: $D \approx 25$–$30 \;\mu$m$^2$/s (3–4x slower than in water due to macromolecular crowding)
- Nuclear proteins: $D \approx 0.5$–$5 \;\mu$m$^2$/s for freely diffusing; much slower ($< 0.01 \;\mu$m$^2$/s) for chromatin-bound factors
- Histones: Mobile fraction $\sim 3$% for H2B (most are stably bound to chromatin), with recovery times of minutes to hours
7. Applications in Modern Biophysics
Live Cell Imaging
Genetically encoded fluorescent proteins (GFP and derivatives) revolutionised cell biology by enabling visualisation of protein localisation, trafficking, and dynamics in living cells. Multi-colour imaging with spectrally distinct fluorescent proteins allows simultaneous tracking of multiple cellular components. Light-sheet fluorescence microscopy (LSFM) reduces phototoxicity and enables long-term 3D imaging of developing embryos and organoids.
Key fluorescent protein families: The original Aequorea GFP ($\lambda_{\text{ex}} = 395/475$ nm, $\lambda_{\text{em}} = 509$ nm) was engineered into enhanced variants (EGFP, EYFP, ECFP, mCherry, mKate2) spanning the entire visible spectrum. Self-labelling tags (HaloTag, SNAP-tag) allow attachment of synthetic dyes with superior brightness and photostability to genetically encoded protein targets.
Photoconvertible and photoactivatable proteins: Proteins such as mEos, Dendra2, and PA-GFP can be irreversibly converted from one spectral form to another by UV illumination. Reversibly switchable fluorescent proteins (rsEGFP, Dreiklang) can be toggled between fluorescent and dark states, enabling RESOLFT super-resolution microscopy with low illumination intensities.
Single-Molecule Tracking
Single-particle tracking (SPT) follows individual fluorescently labelled molecules in real time. By recording trajectories of membrane receptors, motor proteins, or transcription factors, researchers extract diffusion coefficients, confinement zones, binding kinetics, and transport mechanisms. Mean squared displacement (MSD) analysis distinguishes free diffusion ($\langle r^2\rangle \propto t$), confined diffusion ($\langle r^2\rangle \to \text{const}$), and directed transport ($\langle r^2\rangle \propto t^2$).
Protein-Protein Interactions
FRET-based assays are the gold standard for detecting protein-protein interactions in vivo. Bimolecular fluorescence complementation (BiFC), FRET-FLIM, and fluorescence cross-correlation spectroscopy (FCCS) each provide complementary information. FRET between labelled binding partners reports on the proximity ($< 10$ nm) and stoichiometry of complexes. Time-resolved FRET measurements distinguish static from dynamic heterogeneity in conformational ensembles.
Super-Resolution Microscopy (Nobel Prize 2014)
The 2014 Nobel Prize in Chemistry was awarded to Eric Betzig, Stefan Hell, and William Moerner for the development of super-resolution fluorescence microscopy. PALM/STORM has revealed the nanoscale organisation of the cytoskeleton (actin filaments, microtubules), chromatin structure, synaptic vesicle distributions, and nuclear pore complexes. STED has been applied to live-cell imaging of synaptic dynamics and viral entry pathways. The MINFLUX technique (Hell, 2017) achieves 1–5 nm resolution by minimising the number of photons needed for localisation.
Fluorescent Biosensors
Genetically encoded biosensors exploit FRET, circular permutation, or environment-sensitive fluorophores to report on intracellular signals in real time. Examples include: calcium sensors (GCaMP family), voltage indicators (ASAP, Voltron), cAMP sensors (Epac-based FRET), kinase activity reporters (AKAR), and redox sensors (roGFP). These tools enable spatiotemporal mapping of signalling dynamics with subcellular resolution.
Fluorescence Anisotropy
When a sample is excited with polarised light, only fluorophores with transition dipoles aligned with the polarisation direction are preferentially excited (photoselection). The emitted fluorescence is partially polarised, and the degree of polarisation is quantified by the fluorescence anisotropy:
$$r = \frac{I_{\parallel} - I_{\perp}}{I_{\parallel} + 2I_{\perp}}$$
where $I_{\parallel}$ and $I_{\perp}$ are the fluorescence intensities polarised parallel and perpendicular to the excitation polarisation. The fundamental anisotropy$r_0$ (at $t = 0$, before any rotation) is $r_0 = 0.4$for a parallel absorption and emission dipole.
The anisotropy decays due to rotational diffusion:
$$r(t) = r_0\, e^{-t/\theta}$$
where $\theta$ is the rotational correlation time, related to the molecular volume$V$ by $\theta = \eta V/(k_BT)$ (Stokes-Einstein-Debye relation). The steady-state anisotropy follows the Perrin equation:
$$r = \frac{r_0}{1 + \tau/\theta}$$
Anisotropy measurements are widely used to study: protein-protein binding (increase in $\theta$upon complex formation), membrane fluidity (lipid order restricts rotation), and local flexibility of labelled sites on biomolecules. Time-resolved anisotropy can resolve multiple rotational modes (e.g., local dye rotation vs. global protein tumbling).
8. Historical Development of Fluorescence Biophysics
Key Milestones
Theodor Förster (1948): Published the foundational theory of resonance energy transfer, deriving the $r^{-6}$distance dependence from dipole-dipole coupling. Förster's original paper ("Zwischenmolekulare Energiewanderung und Fluoreszenz") established FRET as a spectroscopic ruler for molecular distances, long before the technique could be applied to biological systems. His work built on earlier contributions by Jean Perrin (1927) and Francis Perrin (1932) on energy transfer in solutions.
Stryer and Haugland (1967): Provided the first experimental verification of the Förster $r^{-6}$ dependence using oligoproline spacers of defined length between donor and acceptor dyes, establishing FRET as a practical "spectroscopic ruler" for biology.
Daniel Axelrod (1970s–1980s): Pioneered FRAP (fluorescence recovery after photobleaching) for measuring lateral diffusion of membrane components, and invented total internal reflection fluorescence (TIRF) microscopy for selective imaging of surfaces and cell membranes with $\sim 100$ nm axial resolution.
Elson and Magde (1974): Introduced fluorescence correlation spectroscopy (FCS), demonstrating that intensity fluctuations from a small observation volume encode information about molecular diffusion and chemical kinetics. The technique was later refined by Rigler and colleagues in the 1990s with confocal optics and single-molecule sensitivity.
Single-molecule fluorescence revolution (1990s): Moerner and Kador (1989) detected single molecules via absorption at cryogenic temperatures. Betzig and Chichester (1993) detected single molecules by near-field fluorescence. Xie and Trautman (1998) established single-molecule FRET (smFRET). Ha et al. (1996) measured FRET between single donor-acceptor pairs, opening the door to studying conformational dynamics of individual biomolecules without ensemble averaging.
Green Fluorescent Protein (GFP): The discovery by Shimomura (1962), cloning by Prasher (1992), and optimisation by Tsien and Chalfie (Nobel Prize in Chemistry, 2008) transformed fluorescence from an in vitro technique to a universal in vivo tool. The development of the "fluorescent protein palette" (CFP, YFP, mCherry, etc.) enabled multi-colour imaging and FRET in living cells.
Betzig, Hell, and Moerner (Nobel Prize 2014): Eric Betzig developed PALM (photoactivated localisation microscopy), Stefan Hell invented STED (stimulated emission depletion) microscopy, and William Moerner discovered the photoswitching of single GFP molecules that made PALM/STORM possible. Their work broke the century-old Abbe diffraction limit and opened the nanoscale world to optical microscopy.
Recent advances: MINFLUX (Hell, 2017) achieves 1–5 nm resolution with minimal photon budgets. DNA-PAINT uses transient binding of fluorescent oligonucleotides for unlimited multiplexing. Expansion microscopy (Boyden, 2015) physically enlarges specimens for effective super-resolution on conventional microscopes. Lattice light-sheet microscopy (Betzig, 2014) enables fast 3D super-resolution imaging of living cells with minimal phototoxicity.
9. Python Simulation
The following simulation visualises the key quantitative relationships derived in this chapter: FRET efficiency vs. distance for several Förster radii, fluorescence decay curves with different lifetimes and quenching, and FCS autocorrelation functions for different diffusion coefficients.
Fluorescence & FRET Simulations
PythonClick Run to execute the Python code
Code will be executed with Python 3 on the server
10. Chapter Summary
Key Results
- • The Jablonski diagram organises the photophysics: absorption, vibrational relaxation, fluorescence ($k_r$), non-radiative decay ($k_{nr}$), and intersystem crossing ($k_{ISC}$).
- • Fluorescence lifetime $\tau = 1/(k_r + k_{nr})$ and quantum yield $\Phi = k_r/(k_r + k_{nr}) = \tau/\tau_{\text{nat}}$ fully characterise the excited-state decay. The Strickler-Berg equation predicts $k_r$ from the absorption spectrum.
- • Stern-Volmer quenching $\tau_0/\tau = 1 + K_{SV}[Q]$ provides a linear diagnostic for dynamic quenching and yields the bimolecular quenching rate constant.
- • FRET efficiency $E = 1/(1+(r/R_0)^6)$ arises from dipole-dipole coupling; the Förster radius $R_0 = 0.211[\kappa^2 n^{-4}\Phi_D J(\lambda)]^{1/6}$ depends on spectral overlap, orientation, and quantum yield.
- • FCS autocorrelation $G(\tau) = (1/N)(1+\tau/\tau_D)^{-1}(1+\tau/s^2\tau_D)^{-1/2}$ yields concentration ($N$) and diffusion coefficient ($D = w_0^2/4\tau_D$) from intensity fluctuations.
- • PALM/STORM achieve $\sim 10$ nm resolution by localising individual fluorophores with precision $\sigma_{\text{loc}} \approx s/\sqrt{N}$. STED resolution scales as $d \propto 1/\sqrt{1+I/I_{\text{sat}}}$.
- • FRAP recovery yields diffusion coefficients via $t_{1/2} = 0.224 w^2/D$ (Soumpasis solution) and the mobile fraction quantifies the proportion of freely diffusing molecules.
Connections to Other Chapters
• Single-Molecule Techniques (Chapter 1): Optical tweezers and AFM complement fluorescence by providing force measurements; smFRET bridges both chapters.
• Diffusion & Transport (Part V): FCS and FRAP directly measure diffusion coefficients that are treated theoretically in the transport chapter.
• Protein Folding (Part I): smFRET is a primary tool for studying folding energy landscapes and conformational dynamics.
• Membrane Biophysics (Part II): FRAP, FCS, and super-resolution are central to studying membrane organisation, lipid rafts, and receptor dynamics.