Protein Folding & Misfolding

From Anfinsen's thermodynamic hypothesis through Levinthal's paradox to modern funnel theory — with full derivations of folding kinetics, the Zimm-Bragg helix-coil transition, and computational simulations.

Learning Objectives

●Understand Anfinsen's thermodynamic hypothesis and the concept of the native state
●Derive the numbers behind Levinthal's paradox and explain why it demands a directed search
●Analyze the free energy landscape and funnel theory of protein folding
●Derive two-state folding kinetics, chevron plots, and $\phi$-value analysis
●Apply the Zimm-Bragg model to helix-coil transitions
●Connect folding theory to amyloid diseases and modern structure prediction

1. Introduction — Anfinsen's Thermodynamic Hypothesis

In 1961, Christian Anfinsen demonstrated that the enzyme ribonuclease A, once fully denatured and reduced, could refold spontaneously into its catalytically active form when the denaturant was removed and disulfide bonds were allowed to re-form. This landmark experiment established the thermodynamic hypothesis:

The native structure of a protein is the thermodynamically most stable state — the unique conformation that minimizes the Gibbs free energy under physiological conditions.

Mathematically, the native state N satisfies:

$G(N) = \min_{\{\text{all conformations } C\}} G(C)$

The Gibbs free energy of the native state relative to the unfolded ensemble is:

$\Delta G_{\text{fold}} = G_N - G_U = \Delta H_{\text{fold}} - T\Delta S_{\text{fold}}$

For a typical small globular protein, $\Delta G_{\text{fold}} \approx -5$ to$-15 \; \text{kcal/mol}$ — remarkably small compared to the total enthalpic and entropic contributions, which can each be hundreds of kcal/mol. The native state is thus only marginally stable, a delicate balance between large opposing forces.

Anfinsen's insight implies that the amino acid sequence alone encodes all the information needed to determine the three-dimensional structure. This is the foundation of the entire field of protein structure prediction — from homology modeling to the revolutionary AlphaFold.

The key insight of Ken Dill, Jose Onuchic, Peter Wolynes, and others is that the energy landscape for a foldable protein is funnel-shaped. As the chain forms increasing numbers of native contacts, the energy decreases on average, creating a downhill bias toward the native state.

Quantifying the Funnel

Define a reaction coordinate $Q$ as the fraction of native contacts formed ($0 \leq Q \leq 1$). The free energy as a function of $Q$ is:

$G(Q) = E(Q) - T S(Q)$

The energy decreases roughly linearly with $Q$:

$E(Q) \approx -Q \cdot n \cdot \epsilon_0$

where $\epsilon_0$ is the average energy per native contact. The conformational entropy decreases as native contacts constrain the chain:

$S(Q) \approx (1 - Q) \cdot n \cdot k_B \ln(\Omega_0)$

The competition between these two terms creates a free energy profile with a barrier at intermediate $Q$, the height of which determines the folding rate. When $\epsilon_0 > k_B T \ln(\Omega_0)$, the funnel slope is sufficient to guide folding.

The principle of minimal frustration (Bryngelson and Wolynes, 1987) states that natural proteins have evolved sequences where the energy landscape is minimally frustrated — meaning that most local energy minima still point downhill toward the native state, avoiding deep kinetic traps.

4. Derivation 3: Two-State Folding Kinetics

Many small single-domain proteins fold in a cooperative, two-state manner with no detectable intermediates. The folding reaction is:

$N \rightleftharpoons U$

Equilibrium Thermodynamics

The equilibrium constant for unfolding is:

$K_U = \frac{[U]}{[N]} = \exp\left(\frac{-\Delta G_{N \to U}}{RT}\right) = \exp\left(\frac{\Delta G_{\text{fold}}}{RT}\right)$

Since $\Delta G_{\text{fold}} = G_N - G_U < 0$, we have $K_U < 1$, and the native state is favored. The fraction of unfolded protein is:

$f_U = \frac{K_U}{1 + K_U} = \frac{1}{1 + \exp(-\Delta G_{\text{fold}}/RT)}$

Linear Free Energy Relationships and Denaturant Dependence

The stability of a protein varies linearly with denaturant concentration [D] (urea or guanidinium chloride):

$\Delta G_U([\text{D}]) = \Delta G_U^{H_2O} - m_{\text{eq}} \cdot [\text{D}]$

where $\Delta G_U^{H_2O}$ is the stability in the absence of denaturant and$m_{\text{eq}}$ (the m-value) reflects the change in solvent-accessible surface area upon unfolding, typically$1$–$5$ kcal/(mol$\cdot$M).

Folding and Unfolding Rate Constants

Using transition state theory, the microscopic rate constants for folding ($k_f$) and unfolding ($k_u$) also depend linearly on denaturant:

$\ln k_f([\text{D}]) = \ln k_f^{H_2O} - \frac{m_f}{RT} \cdot [\text{D}]$

$\ln k_u([\text{D}]) = \ln k_u^{H_2O} + \frac{m_u}{RT} \cdot [\text{D}]$

where $m_f$ and $m_u$ are the kinetic m-values, with$m_{\text{eq}} = m_f + m_u$.

Derivation of the Chevron Plot

In a kinetic experiment (e.g., stopped-flow fluorescence), the observed relaxation rate is the sum of folding and unfolding rates:

$k_{\text{obs}} = k_f + k_u$

Substituting the denaturant dependencies:

$k_{\text{obs}}([\text{D}]) = k_f^{H_2O} \exp\left(-\frac{m_f}{RT}[\text{D}]\right) + k_u^{H_2O} \exp\left(\frac{m_u}{RT}[\text{D}]\right)$

Taking the logarithm of $k_{\text{obs}}$:

$\ln k_{\text{obs}}([\text{D}]) = \ln\left[k_f^{H_2O} e^{-m_f[\text{D}]/RT} + k_u^{H_2O} e^{+m_u[\text{D}]/RT}\right]$

Shape of the Chevron Plot

Plotting $\ln k_{\text{obs}}$ vs [D] yields the characteristic V-shaped chevron plot:

●Left arm (low [D]): $k_f \gg k_u$, so $\ln k_{\text{obs}} \approx \ln k_f^{H_2O} - (m_f/RT)[\text{D}]$ — decreasing slope
●Minimum (midpoint): At $[\text{D}]_{1/2}$ where $k_f = k_u$ and $\Delta G = 0$
●Right arm (high [D]): $k_u \gg k_f$, so $\ln k_{\text{obs}} \approx \ln k_u^{H_2O} + (m_u/RT)[\text{D}]$ — increasing slope

$\phi$-Value Analysis

Alan Fersht developed $\phi$-value analysis to map the structure of the transition state ensemble. For a mutation that changes the stability by $\Delta\Delta G_{N-U}$and the folding activation energy by $\Delta\Delta G_{\ddagger-U}$:

$\phi = \frac{\Delta\Delta G_{\ddagger - U}}{\Delta\Delta G_{N - U}} = \frac{RT \ln(k_f^{\text{wt}}/k_f^{\text{mut}})}{\Delta\Delta G_{N-U}}$

Interpretation of $\phi$-values:

●$\phi = 1$: The mutated residue is fully structured in the transition state (native-like interactions fully formed)
●$\phi = 0$: The mutated residue is fully unstructured in the transition state (no native contacts formed)
●$0 < \phi < 1$: Partial structure formation, indicating the residue is involved in partially formed interactions at the transition state

5. Derivation 4: Helix-Coil Transition (Zimm-Bragg Model)

The helix-coil transition is one of the simplest and most thoroughly understood conformational transitions in biophysics. The Zimm-Bragg model (1959) treats a polypeptide chain as a one-dimensional Ising-like system where each residue is either in a helical (h) or coil (c) state.

Model Parameters

Propagation parameter $s$: The equilibrium constant for adding a helical residue to an existing helix.$s = \exp(-\Delta G_{\text{prop}}/k_BT)$. When $s > 1$, helix extension is favorable. The transition occurs near $s = 1$.
Nucleation parameter $\sigma$: The statistical weight penalty for initiating a new helical segment.$\sigma \ll 1$ (typically $10^{-3}$ to $10^{-4}$for $\alpha$-helices) because forming the first turn of a helix requires constraining $\sim 3$ residues without gaining a hydrogen bond.

Transfer Matrix Method

The statistical weight of each residue depends on its own state and that of its predecessor. We define a $2 \times 2$ transfer matrix $\mathbf{M}$:

$\mathbf{M} = \begin{pmatrix} 1 & \sigma s \\ 1 & s \end{pmatrix}$

Rows: predecessor state (c, h). Columns: current state (c, h).

The element $M_{ij}$ gives the statistical weight for residue $k$being in state $j$ given that residue $k-1$ is in state $i$:

$c \to c$: weight = 1 (reference state)
$c \to h$: weight = $\sigma s$ (nucleation penalty $\times$ propagation)
$h \to c$: weight = 1 (helix termination, no penalty)
$h \to h$: weight = $s$ (helix propagation)

Partition Function

For a chain of $N$ residues, the partition function is obtained by multiplying transfer matrices:

$Z = \mathbf{v}_0^T \cdot \mathbf{M}^N \cdot \mathbf{v}_f$

where $\mathbf{v}_0 = (1, 0)^T$ (chain starts in coil) and $\mathbf{v}_f = (1, 1)^T$ (sum over final states).

Eigenvalue Solution

The eigenvalues of $\mathbf{M}$ are found from $\det(\mathbf{M} - \lambda \mathbf{I}) = 0$:

$(1 - \lambda)(s - \lambda) - \sigma s = 0$

$\lambda^2 - (1 + s)\lambda + s(1 - \sigma) = 0$

Using the quadratic formula:

$\lambda_{\pm} = \frac{(1 + s) \pm \sqrt{(1 - s)^2 + 4\sigma s}}{2}$

For large $N$, the partition function is dominated by the larger eigenvalue $\lambda_+$:

$Z \approx c_+ \lambda_+^N$

Fraction Helix

The average fraction of residues in the helical state is:

$\theta = \frac{s}{N} \frac{\partial \ln Z}{\partial s} \approx \frac{s}{\lambda_+}\frac{\partial \lambda_+}{\partial s}$

Evaluating the derivative:

$\frac{\partial \lambda_+}{\partial s} = \frac{1}{2}\left(1 + \frac{-(1-s) + 2\sigma}{\sqrt{(1-s)^2 + 4\sigma s}}\right)$

At the transition midpoint $s = 1$:

$\theta(s=1) = \frac{1}{2}$

Sharpness of the Transition

The sharpness of the helix-coil transition is controlled by $\sigma$. Smaller $\sigma$ (stronger nucleation penalty) gives a sharper, more cooperative transition. In the limit $\sigma \to 0$, the transition becomes an all-or-nothing phase transition. For real $\alpha$-helices with$\sigma \approx 10^{-3}$–$10^{-4}$, the transition is fairly sharp and occurs over a narrow temperature range of $\sim 10$–$20$ K.

6. Applications

AlphaFold & Structure Prediction

DeepMind's AlphaFold (2020) achieved near-experimental accuracy in protein structure prediction at CASP14, validating Anfinsen's hypothesis computationally. It uses multiple sequence alignments and attention-based neural networks to predict 3D coordinates directly from sequence. AlphaFold2 has predicted structures for over 200 million proteins, transforming structural biology.

Amyloid Diseases

Protein misfolding leads to aggregation into amyloid fibrils — ordered, cross-$\beta$ sheet structures that are thermodynamically stable but kinetically trapped. These are implicated in:

●Alzheimer's disease: A$\beta$ peptide and tau protein
●Parkinson's disease: $\alpha$-synuclein
●Prion diseases: PrP$^{\text{Sc}}$
●Type II diabetes: IAPP (amylin)

Molecular Chaperones

Chaperones (GroEL/GroES, Hsp70, Hsp90) do not provide folding information but rather prevent aggregation by sequestering unfolded or partially folded intermediates. GroEL provides an isolated cavity where a single protein can fold without intermolecular contacts. The iterative annealing mechanism (Thirumalai and Lorimer) suggests chaperones unfold kinetically trapped intermediates, giving them another chance to reach the native state.

Drug Design Targeting Misfolding

Therapeutic strategies include:

●Kinetic stabilizers: Tafamidis binds transthyretin (TTR) and prevents amyloid formation
●Pharmacological chaperones: Small molecules that stabilize the native fold
●Aggregation inhibitors: Compounds that block fibril elongation
●Immunotherapy: Antibodies targeting amyloid plaques (e.g., lecanemab for Alzheimer's)

7. Historical Context

1961

Christian Anfinsen demonstrates spontaneous refolding of ribonuclease A, establishing the thermodynamic hypothesis. He receives the Nobel Prize in Chemistry in 1972 "for his work on ribonuclease, especially concerning the connection between the amino acid sequence and the biologically active conformation."

1968

Cyrus Levinthal articulates the paradox bearing his name at a conference, published in 1969. The paradox motivates the search for folding pathways and intermediates.

1987

Bryngelson & Wolynes introduce the energy landscape theory and the principle of minimal frustration, laying the groundwork for the funnel picture.

1995

Ken Dill, Jose Onuchic, and Peter Wolynes formalize the folding funnel concept and develop the "new view" of protein folding based on statistical mechanics of heteropolymers.

1998

David Baker and colleagues design the first computationally designed protein (Top7) and develop the Rosetta software suite, which becomes a cornerstone of computational protein design.

2020

DeepMind's AlphaFold2 achieves near-experimental accuracy at CASP14, solving the protein structure prediction problem for single-domain proteins. Demis Hassabis and John Jumper share the 2024 Nobel Prize in Chemistry with David Baker for computational protein design and structure prediction.

8. Python Simulation

Below we simulate three key aspects of protein folding theory using only NumPy: (1) the free energy profile as a function of the reaction coordinate at different temperatures, (2) the chevron plot for two-state folding kinetics, and (3) the Zimm-Bragg helix-coil transition showing how the nucleation parameter $\sigma$ controls cooperativity.

Protein Folding: Free Energy Landscape & Kinetics

Python

script.py233 lines

#!/usr/bin/env python3
"""
protein_folding_simulations.py
Simulate key aspects of protein folding theory:
  1. Free energy landscape vs reaction coordinate at various temperatures
  2. Two-state chevron plot: ln(k_obs) vs [denaturant]
  3. Zimm-Bragg helix-coil transition
Uses numpy only (no scipy).
"""
import numpy as np
import matplotlib
matplotlib.use('Agg')
import matplotlib.pyplot as plt

print("=" * 65)
print("  Protein Folding & Misfolding: Computational Simulations")
print("=" * 65)

# ================================================================
# SIMULATION 1: Free Energy Landscape G(Q) at Various Temperatures
# ================================================================
print("\n--- Simulation 1: Free Energy Landscape G(Q, T) ---")

# Model: G(Q) = E(Q) - T * S(Q)
# E(Q) = -Q * n * epsilon_0      (energy from native contacts)
# S(Q) = (1-Q) * n * kB * ln(omega)  (conformational entropy)
# Plus a quadratic roughness: delta_E * Q * (1-Q)

n_residues = 100
epsilon_0 = 0.06       # energy per native contact (kcal/mol)
kB = 0.001987          # Boltzmann constant (kcal/mol/K)
omega_0 = 3.0          # conformational states per residue
roughness = 2.0        # roughness amplitude (kcal/mol)

Q = np.linspace(0.0, 1.0, 500)

# Energy: linear decrease + barrier from incomplete contacts
E_Q = -Q * n_residues * epsilon_0 + roughness * Q * (1 - Q) * 4

# Entropy: decreases as chain folds
S_Q = (1 - Q) * n_residues * kB * np.log(omega_0)

temperatures = [280, 300, 320, 340, 360]
colors_T = ['#2563eb', '#16a34a', '#f59e0b', '#ef4444', '#8b5cf6']

fig, axes = plt.subplots(1, 3, figsize=(18, 5.5))

ax1 = axes[0]
for T, col in zip(temperatures, colors_T):
    G_Q = E_Q - T * S_Q
    G_Q = G_Q - G_Q[0]  # reference to unfolded state
    ax1.plot(Q, G_Q, color=col, linewidth=2, label=f'T = {T} K')

ax1.set_xlabel('Reaction Coordinate Q (fraction native contacts)', fontsize=11)
ax1.set_ylabel('G(Q) - G(0)  [kcal/mol]', fontsize=11)
ax1.set_title('Free Energy Landscape', fontsize=13, fontweight='bold')
ax1.legend(fontsize=9, loc='upper right')
ax1.axhline(y=0, color='white', linestyle='--', alpha=0.3)
ax1.set_xlim(0, 1)
ax1.grid(True, alpha=0.3)

# Print key values
print(f"  Residues: {n_residues}")
print(f"  epsilon_0 = {epsilon_0} kcal/mol per contact")
print(f"  Conformational states/residue: {omega_0}")
for T in temperatures:
    G_Q = E_Q - T * S_Q
    G_Q = G_Q - G_Q[0]
    dG_fold = G_Q[-1]
    barrier_idx = np.argmax(G_Q)
    barrier_height = G_Q[barrier_idx]
    print(f"  T={T}K: dG_fold = {dG_fold:.2f} kcal/mol, "
          f"barrier = {barrier_height:.2f} kcal/mol at Q = {Q[barrier_idx]:.2f}")

# ================================================================
# SIMULATION 2: Two-State Chevron Plot
# ================================================================
print("\n--- Simulation 2: Two-State Chevron Plot ---")

# Parameters for a typical two-state folder
# ln(kf) = ln(kf_H2O) - mf * [D] / RT
# ln(ku) = ln(ku_H2O) + mu * [D] / RT
# k_obs = kf + ku

R = 0.001987  # kcal/(mol*K)
T_chevron = 298.15  # K
RT = R * T_chevron

# Folding parameters
ln_kf_H2O = 6.0    # ln(kf) in water ~ kf ~ 400 s^-1
ln_ku_H2O = -4.0   # ln(ku) in water ~ ku ~ 0.018 s^-1
mf = 1.2            # kcal/(mol*M)
mu = 0.6            # kcal/(mol*M)

denaturant = np.linspace(0, 10, 500)

ln_kf = ln_kf_H2O - (mf / RT) * denaturant
ln_ku = ln_ku_H2O + (mu / RT) * denaturant

kf = np.exp(ln_kf)
ku = np.exp(ln_ku)
k_obs = kf + ku
ln_k_obs = np.log(k_obs)

# Find midpoint
mid_idx = np.argmin(np.abs(ln_kf - ln_ku))
D_mid = denaturant[mid_idx]

ax2 = axes[1]
ax2.plot(denaturant, ln_kf, '--', color='#16a34a', linewidth=1.5, alpha=0.7, label='ln(k_f)')
ax2.plot(denaturant, ln_ku, '--', color='#ef4444', linewidth=1.5, alpha=0.7, label='ln(k_u)')
ax2.plot(denaturant, ln_k_obs, '-', color='#f59e0b', linewidth=2.5, label='ln(k_obs)')
ax2.axvline(x=D_mid, color='white', linestyle=':', alpha=0.4)
ax2.set_xlabel('[Denaturant] (M)', fontsize=11)
ax2.set_ylabel('ln(k)  [s$^{-1}$]', fontsize=11)
ax2.set_title('Chevron Plot: Two-State Folding', fontsize=13, fontweight='bold')
ax2.legend(fontsize=9)
ax2.grid(True, alpha=0.3)

# Compute thermodynamic parameters
dG_H2O = RT * (ln_kf_H2O - ln_ku_H2O)
m_eq = mf + mu
D_half = dG_H2O / m_eq

print(f"  T = {T_chevron:.1f} K, RT = {RT:.4f} kcal/mol")
print(f"  ln(kf_H2O) = {ln_kf_H2O:.1f}, ln(ku_H2O) = {ln_ku_H2O:.1f}")
print(f"  mf = {mf:.1f} kcal/(mol*M), mu = {mu:.1f} kcal/(mol*M)")
print(f"  dG(H2O) = RT * [ln(kf) - ln(ku)] = {dG_H2O:.2f} kcal/mol")
print(f"  m_eq = mf + mu = {m_eq:.1f} kcal/(mol*M)")
print(f"  Midpoint [D]_1/2 = dG/m_eq = {D_half:.2f} M")
print(f"  Chevron minimum at [D] ~ {D_mid:.2f} M")

# ================================================================
# SIMULATION 3: Zimm-Bragg Helix-Coil Transition
# ================================================================
print("\n--- Simulation 3: Zimm-Bragg Helix-Coil Transition ---")

def zimm_bragg_helix_fraction(s_values, sigma, N_chain):
    """
    Compute helix fraction theta for given s values using
    the Zimm-Bragg transfer matrix method.

Transfer matrix M = [[1, sigma*s], [1, s]]
    Z = v0^T . M^N . vf
    theta = (s/N) * d(ln Z)/ds  (computed numerically)
    """
    theta = np.zeros_like(s_values)
    ds = 1e-6  # for numerical derivative

for i, s in enumerate(s_values):
        # Compute Z at s and s+ds by matrix power
        def compute_lnZ(s_val):
            M = np.array([[1.0, sigma * s_val],
                          [1.0, s_val]])
            # Matrix power by repeated squaring
            result = np.eye(2)
            base = M.copy()
            n = N_chain
            while n > 0:
                if n % 2 == 1:
                    result = result @ base
                base = base @ base
                n //= 2
            # Z = v0^T . M^N . vf
            v0 = np.array([1.0, 0.0])  # start in coil
            vf = np.array([1.0, 1.0])  # sum over end states
            Z = v0 @ result @ vf
            if Z > 0:
                return np.log(Z)
            return -1e10

lnZ = compute_lnZ(s)
        lnZ_ds = compute_lnZ(s + ds)

# theta = (s / N) * d(lnZ)/ds
        theta[i] = (s / N_chain) * (lnZ_ds - lnZ) / ds
        theta[i] = max(0.0, min(1.0, theta[i]))

return theta

# Range of s values (s ~ exp(-dG_prop/kBT), transition at s=1)
s_values = np.linspace(0.5, 1.5, 500)
N_chain = 100

# Different nucleation parameters
sigma_values = [1e-2, 1e-3, 1e-4]
sigma_colors = ['#f59e0b', '#16a34a', '#2563eb']
sigma_labels = ['\u03c3 = 10\u207b\u00b2', '\u03c3 = 10\u207b\u00b3', '\u03c3 = 10\u207b\u2074']

ax3 = axes[2]
for sigma, col, lab in zip(sigma_values, sigma_colors, sigma_labels):
    theta = zimm_bragg_helix_fraction(s_values, sigma, N_chain)
    ax3.plot(s_values, theta, color=col, linewidth=2.5, label=lab)

ax3.axvline(x=1.0, color='white', linestyle=':', alpha=0.4)
ax3.axhline(y=0.5, color='white', linestyle=':', alpha=0.4)
ax3.set_xlabel('Propagation parameter s', fontsize=11)
ax3.set_ylabel('Helix fraction \u03b8', fontsize=11)
ax3.set_title('Zimm-Bragg Helix-Coil Transition', fontsize=13, fontweight='bold')
ax3.legend(fontsize=9)
ax3.set_xlim(0.5, 1.5)
ax3.set_ylim(-0.05, 1.05)
ax3.grid(True, alpha=0.3)

for sigma in sigma_values:
    # Width of transition: approximate as 1/sqrt(sigma*N)
    width = 1.0 / np.sqrt(sigma * N_chain)
    print(f"  sigma = {sigma:.0e}, N = {N_chain}: "
          f"transition width ~ {width:.3f} in s-units")

# Style all axes
for ax in axes:
    ax.set_facecolor('#0a0a0a')
    ax.tick_params(colors='#cccccc')
    for spine in ax.spines.values():
        spine.set_color('#444444')
    ax.xaxis.label.set_color('#cccccc')
    ax.yaxis.label.set_color('#cccccc')
    ax.title.set_color('#4ade80')

fig.patch.set_facecolor('#111111')
plt.tight_layout(pad=2.0)
plt.savefig('output.png', dpi=150, bbox_inches='tight',
            facecolor='#111111', edgecolor='none')
plt.close()

print("\n" + "=" * 65)
print("  Simulation complete. Plots saved to output.png")
print("  Left:   Free energy G(Q) at five temperatures")
print("  Center: Chevron plot for two-state folding")
print("  Right:  Zimm-Bragg helix-coil transition")
print("=" * 65)

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Summary of Key Equations

Levinthal's Paradox

$\Omega = 3^{100} \approx 5 \times 10^{47}, \quad t_{\text{search}} = \frac{\Omega}{10^{13}\,\text{s}^{-1}} \approx 10^{27}\,\text{years}$

Folding Free Energy

$\Delta G_{\text{fold}} = \Delta H_{\text{fold}} - T\Delta S_{\text{fold}}$

Two-State Equilibrium

$K_U = \frac{[U]}{[N]} = \exp\!\left(\frac{\Delta G_{\text{fold}}}{RT}\right), \quad \Delta G_U([\text{D}]) = \Delta G_U^{H_2O} - m_{\text{eq}}[\text{D}]$

Chevron Plot

$k_{\text{obs}} = k_f^{H_2O}\,e^{-m_f[\text{D}]/RT} + k_u^{H_2O}\,e^{+m_u[\text{D}]/RT}$

$\phi$-Value Analysis

$\phi = \frac{\Delta\Delta G_{\ddagger - U}}{\Delta\Delta G_{N - U}}$

Zimm-Bragg Eigenvalues

$\lambda_{\pm} = \frac{(1 + s) \pm \sqrt{(1 - s)^2 + 4\sigma s}}{2}$

Practice Problems

Problem 1:Levinthal's paradox: A 100-residue protein has 3 conformational states per residue. If each conformation is sampled in 1 ps, how long would a random search take? Compare to the age of the universe.

Solution:

1. Total number of conformations: $N = 3^{100} \approx 5.15 \times 10^{47}$.

2. Time per conformation: $\tau = 10^{-12}$ s.

3. Total search time: $t = N\tau = 5.15 \times 10^{47} \times 10^{-12} = 5.15 \times 10^{35}$ s.

4. Age of the universe: $t_U \approx 4.3 \times 10^{17}$ s.

5. Ratio: $t/t_U \approx 1.2 \times 10^{18}$. A random search would take $\sim 10^{18}$ times the age of the universe.

6. This demonstrates that protein folding must follow directed pathways on a funneled energy landscape, not a random search.

Problem 2:A protein unfolds with a two-state equilibrium. At 25°C, the fraction unfolded is 0.01. Calculate the equilibrium constant K_U and the free energy of unfolding ΔG_U.

Solution:

1. Two-state model: $N \rightleftharpoons U$. Fraction unfolded $f_U = 0.01$, fraction native $f_N = 0.99$.

2. Equilibrium constant: $K_U = \frac{[U]}{[N]} = \frac{f_U}{f_N} = \frac{0.01}{0.99} = 0.0101$.

3. Free energy: $\Delta G_U = -RT\ln K_U$.

4. At $T = 298$ K: $\Delta G_U = -(8.314)(298)\ln(0.0101) = -(2479)(-4.595)$.

5. $\Delta G_U = 11.4$ kJ/mol $\approx 2.7$ kcal/mol. This is typical for a small protein's marginal stability.

Problem 3:From a chevron plot, the folding rate in water is k_f^{H₂O} = 500 s⁻¹ and the unfolding rate is k_u^{H₂O} = 0.05 s⁻¹. The m-values are m_f = 4 kJ/(mol·M) and m_u = 1 kJ/(mol·M). Find the midpoint denaturant concentration and ΔG_U in water.

Solution:

1. In water: $\Delta G_U^{H_2O} = -RT\ln(k_u^{H_2O}/k_f^{H_2O}) = -RT\ln(0.05/500)$.

2. $\Delta G_U^{H_2O} = -(2.479)\ln(10^{-4}) = -(2.479)(-9.21) = 22.8$ kJ/mol.

3. The total m-value: $m_{\text{eq}} = m_f + m_u = 4 + 1 = 5$ kJ/(mol$\cdot$M).

4. At the midpoint, $\Delta G_U = 0$, so $\Delta G_U^{H_2O} = m_{\text{eq}}[\text{D}]_m$.

5. Midpoint: $[\text{D}]_m = \Delta G_U^{H_2O}/m_{\text{eq}} = 22.8/5 = 4.56$ M.

6. The Tanford $\beta_T = m_f/m_{\text{eq}} = 4/5 = 0.8$, indicating the transition state is 80% native-like in solvent exposure.

Problem 4:In the Zimm-Bragg model, a polypeptide has nucleation parameter σ = 10⁻⁴ and propagation parameter s = 1.0. Calculate the eigenvalues λ± and the helix-coil transition width.

Solution:

1. The Zimm-Bragg transfer matrix eigenvalues: $\lambda_{\pm} = \frac{(1+s) \pm \sqrt{(1-s)^2 + 4\sigma s}}{2}$.

2. At $s = 1$: $(1-s)^2 = 0$, so $\lambda_{\pm} = \frac{2 \pm \sqrt{4\sigma}}{2} = 1 \pm \sqrt{\sigma}$.

3. $\lambda_+ = 1 + \sqrt{10^{-4}} = 1.01$, $\lambda_- = 1 - 0.01 = 0.99$.

4. The transition width is governed by $\Delta s \sim 4\sqrt{\sigma} = 4 \times 0.01 = 0.04$.

5. Since $\sigma$ is small ($10^{-4}$), the transition is highly cooperative: the helix fraction changes from ~0 to ~1 over a narrow range of $s$ (or equivalently, temperature).

6. For a chain of $N$ residues, the transition sharpens further as $N$ increases, approaching a true phase transition as $N \to \infty$.

Problem 5:A mutation destabilizes a protein by ΔΔG_{N-U} = 5 kJ/mol and increases the unfolding activation energy by ΔΔG_{‡-U} = 4 kJ/mol. Calculate the φ-value and interpret the result.

Solution:

1. The $\phi$-value is defined as: $\phi = \frac{\Delta\Delta G_{\ddagger - U}}{\Delta\Delta G_{N - U}}$.

2. Substituting: $\phi = \frac{4}{5} = 0.8$.

3. Interpretation: $\phi = 0.8$ means the transition state has 80% of the native-state interaction energy at the mutation site.

4. This residue is largely structured in the transition state, suggesting it is part of the folding nucleus.

5. If $\phi = 0$, the site would be unstructured in the transition state (unfolded-like). If $\phi = 1$, it would be fully native-like.

6. Fractional $\phi$-values can indicate partial structure formation or reflect an ensemble of transition states with varying degrees of structure at that site.

Share:X Reddit LinkedIn

← Part 1 Overview Biophysics Home

Protein Folding & Misfolding

Learning Objectives

1. Introduction — Anfinsen's Thermodynamic Hypothesis

Quantifying the Funnel

4. Derivation 3: Two-State Folding Kinetics

Equilibrium Thermodynamics

Linear Free Energy Relationships and Denaturant Dependence

Folding and Unfolding Rate Constants

Derivation of the Chevron Plot

Shape of the Chevron Plot

$\phi$-Value Analysis

5. Derivation 4: Helix-Coil Transition (Zimm-Bragg Model)

Model Parameters

Transfer Matrix Method

Partition Function

Eigenvalue Solution

Fraction Helix

Sharpness of the Transition

6. Applications

AlphaFold & Structure Prediction

Amyloid Diseases

Molecular Chaperones

Drug Design Targeting Misfolding

7. Historical Context

Related Video Lectures

8. Python Simulation

Protein Folding: Free Energy Landscape & Kinetics

Summary of Key Equations

Practice Problems

Solution:

Solution:

Solution:

Solution:

Solution: