Ranomics
Scientific research and computational biology
deep mutational scanningantibody engineeringaffinity maturationDMS

Deep Mutational Scanning for Antibody Affinity Maturation

Affinity maturation moves an antibody from a discovery hit (typically 10–100 nM) to a clinical lead (typically sub-nanomolar). The historical methods — error-prone PCR, CDR walking, parsimonious mutagenesis — are productive but inefficient. They search sequence space by random sampling and selection, which means most cycles spend most of their budget on neutral or deleterious mutations. The yield is measured in rounds, not in mutations per round.

Deep mutational scanning replaces the random walk with a systematic map. Every single-amino-acid substitution across the CDRs is measured for its effect on binding, expressed, displayed, or any other phenotype that can be linked to a sequencing readout. The output is a fitness landscape that tells you, position by position, which substitutions improve affinity, which are neutral, and which break the antibody. The next-round library is built from validated information, not random sampling.

This article is a practical guide to running DMS for affinity maturation: how to design the library, how to read it out, how to use the landscape, and how to respect the developability constraint that random affinity maturation tends to ignore.

The affinity-maturation problem

A discovery campaign delivers a hit. Maybe it came from yeast display, maybe from a transgenic mouse, maybe from AI design followed by experimental validation. The hit binds in the high-nanomolar range with acceptable specificity, and the program needs sub-nanomolar affinity for clinical relevance.

Traditional affinity maturation has three problems.

Most mutations don’t help. Across a typical CDR, ~80% of substitutions are neutral or deleterious. Random libraries spend ~80% of their diversity on unproductive mutants and ~20% on the candidates worth screening. Throughput screening can absorb this inefficiency, but it taxes the budget linearly.

Epistatic combinations are hard to find. Two mutations that improve affinity by 2× each may improve it by 3× or by 10× together. Or one may erase the other. Random sampling at fixed mutation rates rarely covers the combinatorial space densely enough to discover positive epistasis. The “ceiling” hit rate for traditional methods is set by what single mutations alone can deliver.

Developability isn’t selected. Affinity is one fitness dimension. Aggregation propensity, stability, solubility, hydrophobic patch area, charge distribution are others. Pure affinity selection routinely drives the lead toward higher-affinity-but-less-developable space. The downstream cost (reformatting, failed expression, polyspecificity) is invisible at the selection step.

DMS principles for antibodies

A DMS experiment for an antibody works by:

  1. Constructing a library that systematically introduces every single-amino-acid substitution across the residues of interest (CDR positions, framework hotspots, or whole-domain coverage).
  2. Coupling each variant to a measurable phenotype — for antibodies, this is almost always display level + antigen binding on yeast or mammalian display.
  3. Reading out the population by NGS before and after a selection or sort, then computing the enrichment factor for each variant.
  4. Mapping enrichment back to position-and-substitution to produce a per-residue, per-amino-acid fitness score.

We covered the general DMS workflow in deep mutational scanning fitness landscapes. For antibody affinity maturation specifically, three design choices matter.

Single mutants vs combinatorial. Single-mutant DMS (200 variants per 10-residue CDR) is the minimum viable design. It reveals which positions are tolerant and which are forbidden, and it identifies the top single substitutions per position. Combinatorial DMS (pairs or triples of mutations) reveals epistatic interactions but costs an order of magnitude more in library size. Run single mutants first; build a focused combinatorial library from the top single-mutant hits.

Yeast vs mammalian display readout. Yeast display is the default — fast, quantitative, normalized per-cell. Mammalian display is the right choice when developability is the bottleneck and the affinity-maturation library must respect glycosylation and complex disulfides. The two-platform approach — yeast for affinity, mammalian for developability — generalizes to DMS-driven maturation.

Sort strategy. Affinity selection at decreasing antigen concentrations (10 nM → 1 nM → 100 pM across three sort rounds) generates a per-variant enrichment trace. Variants that enrich at low concentration are higher-affinity; variants that drop out are lower. The slope of the enrichment trace, not the round-3 endpoint alone, gives the most reliable affinity rank.

Library design — saturation per CDR vs combinatorial

For most antibody affinity maturation campaigns, we run the following ladder:

Phase 1 — Single-mutant scan, CDR-H3. 200 variants for a 10-residue CDR-H3, plus 200 for CDR-H2, plus 100 for CDR-H1 (typically shorter). Sequenced before and after a single-round sort at the discovery-hit Kd. Output: position-by-position fitness map for the heavy chain.

Phase 2 — Single-mutant scan, light chain. If the heavy-chain scan didn’t deliver enough affinity gain, scan CDR-L1 and CDR-L3. Light-chain contributions are smaller on average but non-zero, and the position-by-position map reveals whether the light chain has unexploited room.

Phase 3 — Combinatorial top hits. Take the top 5–10 single mutants per CDR. Build a combinatorial library of all pairwise combinations (~25–100 doubles) plus triples of the best singles (~50–200 triples). Sort under tightened stringency. The output is the candidate lead.

Each phase costs ~$5–15K in synthesis (Twist oligo pools) plus 2–4 weeks of yeast display and NGS. Total campaign cost is dominated by sorting and sequencing reagents, not by synthesis.

Readout — yeast/mammalian display plus NGS

The NGS readout for antibody DMS is delicate because the variant counts are small (200–1,000 per library) and the read depth needs to cover the rarest variant with statistical confidence. The standard:

  • Pre-sort sequencing: 500K–1M reads. Establishes the library composition. Confirms each variant is present at ≥100 reads.
  • Post-sort sequencing: 1–5M reads. Higher depth because the population has been compressed and the variants of interest are now over-represented.
  • Enrichment calculation: log2(post-sort frequency / pre-sort frequency). Variants with log2 enrichment > +1 are candidate gain-of-function mutants. Variants with log2 enrichment < −1 are candidate loss-of-function mutants. Variants between are neutral.

Software: enrich2, DiMSum, or a custom pipeline. For most campaigns, enrich2 is fine; the bookkeeping (read pairing, variant calling, statistical filtering) is what matters more than the algorithm choice.

For depth math and the relationship between library diversity, read count, and statistical confidence, see calculating library diversity with NGS.

From fitness landscape to optimized clones

The fitness landscape — a heatmap of position × substitution colored by enrichment score — is the deliverable that informs the next-round library. Three patterns to look for:

Conserved positions. Residues where only the wild-type tolerates substitution. These are typically structural — they form the CDR backbone or pack against framework residues. Don’t waste combinatorial budget here.

Tolerant positions. Residues where most substitutions are accepted. These positions have engineering room: pick substitutions that improve affinity (the top of the heatmap column) and that also satisfy other constraints (developability, immunogenicity).

Beneficial substitutions. Specific (position, substitution) pairs with enrichment >+2. These are the candidate single-mutant gain-of-function hits. The top 5–10 per CDR feed the combinatorial library.

The combinatorial library is built by combining beneficial single mutants. Positive epistasis (combined effect greater than the sum of singles) is more common between mutations at positions that are spatially proximal in the CDR loop; negative epistasis is more common between distant mutations. Both happen; both are useful information.

After the combinatorial sort, the top 10–50 clones are reformatted to Fab or IgG, expressed, and characterized by SPR or BLI for absolute Kd. The conversion rate from “top NGS hit” to “validated high-affinity lead” in a well-designed DMS campaign is typically 60–85% — substantially higher than from a random-walk library, where conversion runs 10–30%.

Developability constraint

Affinity selection alone optimizes for affinity. The DMS framework lets you add other selection axes without redesigning the experiment.

Co-selection for stability. Sort the post-affinity-selection library a second time on display level alone (without antigen). Variants that display well after antigen selection are stable AND high-affinity. Variants that display poorly are high-affinity but biophysically compromised.

Co-selection for low polyspecificity. Sort against a polyspecificity reagent (PSR) as a counter-screen. Variants that bind PSR drop out; variants that don’t survive.

Co-selection for thermal stability. Heat-treat the displayed library before antigen binding. Variants that retain display after heat are thermostable; the rest enrich poorly even if their wild-type affinity was higher.

These co-selections double the experimental burden but deliver a candidate lead that’s already past the developability triage step. For programs heading to therapeutic development, the upstream cost is repaid 5–10× in saved downstream re-engineering.

When DMS isn’t the right tool

DMS is a powerful affinity-maturation method, not a universal one. It struggles when:

  • The starting affinity is too weak. If the discovery hit binds at Kd > 1 µM, the single-mutant landscape is dominated by noise — too few variants register signal above background. Improve affinity to below 500 nM by other methods first, then run DMS.
  • The library can’t be displayed. Membrane proteins, very large constructs, post-translationally modified targets that yeast can’t make — these block display-based readouts. Alternative readouts (mammalian display, mRNA display) work but cost more.
  • Combinatorial optimization needs more than 3–4 simultaneous mutations. DMS scales to pairs and triples; quadruples and higher need different methods (directed evolution, ML-guided sequence sampling).

For the cases where DMS does fit — most therapeutic antibody affinity maturation programs — it is the most efficient way we know to move from a discovery hit to a clinical-grade lead.

Decision summary

If you have a discovery hit at 10–500 nM and need to reach sub-nanomolar: DMS-guided maturation is the right starting point. Single-mutant scan on CDR-H3 first, then expand based on what the landscape shows.

If your hit is at sub-nanomolar already and you need to improve developability: skip the affinity scan and run a stability-selected DMS instead. Same library design, different selection axis.

If your starting material is a library of candidates from AI design: DMS is the natural complement — AI generates the diversity, DMS measures it.


Ranomics designs and runs DMS campaigns for antibody affinity maturation end-to-end. If you’re scoping a program and want help designing the library and selection strategy, see our affinity maturation services, deep mutational scanning services, or reach out via the contact page.

Share

Ready to design your binder?

Single target on a fixed scope, start a Binder Pilot. Multi-target program or pharma-grade timeline, discuss a Custom Campaign.

Not sure which fits? Answer five quick questions for a recommendation.