Ranomics
Scientific research and computational biology
developabilityantibody engineeringprotein designRFdiffusionBindCraftCMC

5 Developability Red Flags That Kill mAb Programs

A failure in developability late in CMC is the most expensive kind of failure a monoclonal antibody program can suffer. Discovery has spent two years reaching a candidate with sub-nanomolar affinity, cell-based potency, and clean animal PK. The formulation team then reports viscosity at 150 mg/mL above pumpability, or a monomer fraction that collapses after four weeks at 40 C, or a mass shift in stability studies that traces back to a single Asn in the heavy chain CDR2. The program stalls, the scientists go back to re-engineer, and a year of clinical timing quietly disappears.

The inconvenient truth surfaced by the industry-wide Jain et al. 2017 panel of 137 clinical-stage antibodies (Proc. Natl. Acad. Sci. USA, 114:944 to 949) is that poor developability correlates visibly with the stage at which an antibody stops advancing. Many of those liabilities would have been visible from sequence before any wet-lab work. This post covers the five specific red flags that most reliably kill programs, how each one actually manifests, and which of them are detectable before synthesis.

Red Flag 1: Deamidation Hotspots in CDRs (NG, NS, NT, NH)

Asparagine deamidates to aspartate or isoaspartate via a succinimide intermediate. The reaction is strongly dependent on the residue that follows the Asn. Asn followed by Gly is the textbook hotspot, with a half-life measured in days at physiological pH and temperature. Asn followed by Ser, Thr, or His is slower but still clinically relevant over months in storage. Once a CDR Asn has deamidated, the charge, local backbone geometry, and sometimes the potency of the antibody shift. You end up selling a heterogeneous product whose potency drifts over shelf life.

The clinical picture is well documented. The Vlasak et al. 2009 work on deamidation in Herceptin identified a CDR Asn that accumulated iso-Asp during storage and required careful formulation control. Several IgG4 programs have pulled back on CDR Asn residues after observing charge heterogeneity in capillary isoelectric focusing at release. Deamidation is one of the most detectable liabilities. Any sequence-scanning tool can find NG, NS, NT, and NH motifs in the six CDRs in seconds.

Detectability before synthesis: very high. Pure sequence motif search.

Red Flag 2: Asp Isomerization Sites (DG, DS, DH in CDRs)

Aspartate isomerization runs through the same succinimide mechanism as Asn deamidation, converting Asp to iso-Asp in the backbone. The canonical motif is DG, with DS and DH also observed. The consequence is a backbone rearrangement that often sits directly inside the paratope. For an antibody that depends on a specific CDR conformation to contact its antigen, isomerization of a single Asp can drop affinity by an order of magnitude.

The Cacia et al. 1996 study on a recombinant antibody (Biochemistry, 35:1897 to 1903) showed that isomerization of a single CDR Asp produced a clinically relevant loss of potency during storage. This is one of the liabilities that converts a nanomolar binder into a double-digit nanomolar binder over six months on a pharmacy shelf, which is a regulatory problem and a commercial one.

Detectability before synthesis: very high. Same scan as Red Flag 1, different motif.

Red Flag 3: N-Linked Glycosylation Sequons in Variable Regions (N-X-S/T)

The N-X-S/T sequon (where X is any residue except Pro) is the site of N-linked glycosylation in eukaryotic expression systems. In the Fc, this sequon is expected and functional. In the variable region, it is almost always unwanted. A glycan on the V region introduces size and charge heterogeneity, often reduces affinity (the glycan physically sits in or near the paratope), complicates CMC analytics (you now have a distribution of glycoforms where you wanted a single species), and can create immunogenicity concerns if the glycan is non-human.

In mammalian expression, V-region sequons are filled at variable occupancy, which means the released product is a mixture of glycosylated and unglycosylated species with different activity profiles. This is a known failure mode in antibodies derived from yeast display campaigns run without a post-campaign sequon scrub, and in de novo designed binders, where the sequon can appear purely by chance during ProteinMPNN sequence design.

Detectability before synthesis: very high. Simple regex for N-X-S/T in the variable region.

Red Flag 4: Exposed Hydrophobic Patches

Hydrophobic surface patches drive two developability failures at once: non-specific binding to polyspecific reagents like cardiolipin, baculovirus particles, or membrane preparations, and aggregation in concentrated formulation. Polyspecificity is measured experimentally by the polyspecificity reagent (PSR) assay described by Xu et al. 2013 (Protein Eng. Des. Sel., 26:663 to 670) and by hydrophobic interaction chromatography (HIC) retention time. Both correlate with exposed hydrophobic surface area computed from a homology or predicted structure.

The Jain 2017 panel showed that clinical-stage antibodies with late-stage developability issues tended to cluster at the high end of HIC retention and PSR signal. Molecules that progressed cleanly tended to sit in the middle of the distribution. The mechanism is not surprising. Every time you concentrate to 150 mg/mL for subcutaneous delivery, hydrophobic patches on adjacent molecules find each other, and the result is reversible self-association, elevated viscosity, and elevated aggregation on long-term storage.

This is a structural property, not a sequence one. Catching it before synthesis requires a model of the antibody (a reasonable-quality homology model from ABodyBuilder, IgFold, or AlphaFold 3 is usually sufficient) and a solvent-exposed hydrophobic surface calculation. You can get a rough first pass from sequence by counting the fraction of CDR residues that are hydrophobic, but the reliable answer is the structural one.

Detectability before synthesis: moderate. Needs a structure model, not just sequence.

Red Flag 5: Aggregation-Prone Regions (High APR Score)

Aggregation-prone regions are stretches of sequence that are thermodynamically predisposed to assemble into cross-beta amyloid-like fibers or amorphous aggregates. The scoring approaches go back to TANGO and Aggrescan (Conchillo-Sole et al. 2007, BMC Bioinformatics, 8:65) and have been refined in SAP, CamSol, and SolubiS. All of them essentially ask: is this stretch of hydrophobic residues sitting in a context where it is solvent-exposed and flanked by the right charge and secondary structure to nucleate aggregation.

High APR scores correlate with the antibodies that quietly fail at phase 1 for reasons the clinical report describes as “manufacturing issues.” The candidate expresses at reasonable titers, binds its target, works in animals, and then fails a stability study at the 12-month mark because the aggregate fraction crossed the specification. These failures are under-reported in the literature because programs do not publish failures at that stage, but they are common in industry experience and they have essentially the same signature every time: a CDR or framework loop with a run of hydrophobic residues that an APR calculator flagged early, and that was dismissed because it did not affect initial potency.

Detectability before synthesis: high for sequence-based APR (TANGO, Aggrescan, CamSol profile), moderate for structure-aware APR (SAP, spatial clustering of hydrophobicity on the modeled structure).

How Much of This Is Catchable From Sequence Alone

Three of the five red flags above (deamidation, isomerization, V-region sequons) are pure sequence-level scans. They take milliseconds per antibody and they have no false negatives for the motif itself, though real reaction rates depend on local solvent exposure and flexibility. The remaining two (exposed hydrophobic patches, aggregation-prone regions) benefit from a structural model but can be approximated from sequence.

This means that for the cost of a model-building step you do not have to run (IgFold and ABodyBuilder2 take seconds per antibody, and AlphaFold 3 has made single-chain structure prediction close to free), the majority of the most expensive late-stage developability failures are visible before you order a single gene. The actual cost of this triage is not compute time. It is building the pipeline, picking defensible thresholds, and making the output readable to a scientist who is going to make a go or no-go call.

This is why we built Developability Scout. It is a free tool that takes an antibody sequence and runs the five-red-flag scan plus the structure-aware hydrophobic and APR analysis in a single pass, with a scored report. It is designed for the stage after discovery has a hit list and before the synthesis order goes in.

Where AI-Designed Binders Typically Land on These Dimensions

One practical note for teams running de novo or semi-de-novo antibody and nanobody campaigns with RFdiffusion, BindCraft, or ProteinMPNN. Those algorithms are optimized for different objectives than developability. RFdiffusion optimizes a diffusion objective that produces plausible backbones, BindCraft optimizes for AlphaFold-predicted interface metrics, and ProteinMPNN optimizes sequence recovery on training structures. None of these three loss functions penalize deamidation motifs, variable-region sequons, exposed hydrophobic paratopes, or APR hotspots.

Empirically, raw generative outputs tend to ship a meaningful fraction of sequences with at least one of the five red flags. Paratope-exposed hydrophobics are especially common because the training signal pushes toward interfaces with strong VDW contacts, which means burying a Phe, Trp, or Leu into the target surface with part of it facing solvent. Running the same five-red-flag scan on generative outputs before synthesis is the cheapest developability filter you can apply to a design campaign.

A Practical Workflow for the Triage

The minimal workflow that catches these five red flags before wet-lab commitment, for either discovery-stage hits or de novo designs:

  1. Run a sequence-level scan for NG/NS/NT/NH (deamidation), DG/DS/DH (isomerization), and N-X-S/T (V-region sequons). Flag everything in a CDR.
  2. Fold the antibody (IgFold, ABodyBuilder2, or AlphaFold 3). Compute solvent-exposed hydrophobic surface area across the V region.
  3. Run an APR score (Aggrescan, CamSol profile, or SAP) on the folded model.
  4. Threshold and triage. For discovery-stage hits, decide which liabilities you can fix with point mutations and which require re-discovery. For de novo designs, filter out the worst offenders and regenerate if the pool is too depleted.
  5. Advance the survivors to synthesis.

For teams without an in-house developability pipeline, Developability Scout runs steps 1 through 3 automatically and produces a per-antibody scorecard. It is free, browser-based, and does not require installing anything.

Where This Fits in a Campaign

Catching these red flags before synthesis is the highest-leverage developability investment a discovery or design team can make. It does not substitute for a real CMC team, and it does not replace experimental confirmation by DSF, DLS, HIC, SEC, or stability studies. What it does is prevent the most common preventable failures from consuming six months of clinical timing at the far more expensive end of the pipeline.

The teams that run this kind of triage at every stage (discovery hits, affinity-matured leads, de novo designs, humanized candidates) rarely ship liabilities downstream. The teams that skip it rediscover the same five red flags in CMC at ten times the cost.

If you want the full de novo design service with developability filtering baked into the pipeline by default, that is what the AI Binder Sprint is scoped to deliver. If you have an existing hit list or a set of de novo designs and want to do the triage yourself, Developability Scout is the free tool for that. Either path is better than finding the problems in a phase 1 stability study.

  • Developability Scout: Free sequence and structural scan for the five common antibody developability red flags.
  • AI Binder Sprint: Full de novo design program with developability filtering integrated into the pipeline.
Share

Ready to start a project?

Tell us about your protein engineering challenge. We will scope a program and get back to you within 24 hours.

Start a project →