Therapeutic Antibody Developability Analysis (TA-DA) is a computational model designed to predict the developability of therapeutic antibodies during early-stage drug discovery. It evaluates sequence- and structure-based features to distinguish clinical-stage antibodies from non-clinical repertoire antibodies, streamlining the selection of candidates with optimal drug-like properties .
TA-DA was developed by analyzing 144 descriptors across 213 clinical antibodies and 1,000 repertoire antibodies. Five orthogonal descriptors were integrated into a logistic regression model :
Key Descriptors in TA-DA:
Structural Aggregation Propensity (SAP): Predicts aggregation-prone regions.
Hydrophobic Patch Analysis: Identifies surface-exposed hydrophobic residues.
Charge Symmetry: Evaluates charge distribution across variable domains.
Complementarity-Determining Region (CDR) Length: Ensures CDR lengths align with clinical norms.
Framework Stability: Assesses thermodynamic stability of antibody frameworks.
These descriptors collectively achieved an AUC = 0.8 in separating clinical from non-clinical antibodies on a hold-out test set .
| Metric | TA-DA Score | Competing Models (e.g., TAP) |
|---|---|---|
| AUC (Clinical vs. Repertoire) | 0.8 | 0.6–0.7 |
| Bispecific Antibody Accuracy | 80% | Not reported |
| Computational Speed | Minutes | Hours |
TA-DA outperformed existing tools like the Therapeutic Antibody Profiler (TAP) in identifying developability risks, particularly in bispecific antibodies .
Mavrilimumab, a clinical antibody for rheumatoid arthritis, scored 0.26 on TA-DA due to aggregation-prone hotspots in its framework. This outlier case highlighted TA-DA’s ability to flag antibodies requiring formulation optimization .
| Feature | TA-DA | TAP |
|---|---|---|
| Descriptor Scope | Sequence + Structure | CDR-focused |
| Clinical Validation | 213 antibodies | Limited dataset |
| Bispecific Evaluation | Supported | Not supported |
| Off-Target Prediction | Yes (aggregation, charge) | No |
TA-DA’s holistic approach addresses limitations of earlier models by incorporating framework stability and global charge symmetry .
Lead Candidate Selection: Prioritizes antibodies with low SAP scores and optimal charge symmetry .
Bispecific Optimization: Validated on 25 bispecific antibodies, with 80% receiving high TA-DA scores (>0.5) .
Formulation Guidance: Flags antibodies requiring excipient optimization (e.g., mavrilimumab) .
Scope: TA-DA focuses on variable regions; constant domains and manufacturing factors are not modeled .
Dynamic Range: Scores >0.5 indicate clinical potential, but thresholds may vary by target .
Integration: Future iterations could incorporate machine learning for RNA- or DNA-level off-target predictions .
TA-DA stands for Therapeutic Antibody Developability Analysis, a computational model designed to simplify antibody developability assessment and enable accelerated early-stage screening. This tool quantifies the ability of sequence- and structure-based descriptors to differentiate clinical antibodies from antibodies in the human repertoire. Through rigorous analysis, researchers identified 144 descriptors capable of distinguishing clinical from repertoire antibodies. Five key descriptors were ultimately selected and combined based on performance and orthogonality to create the final TA-DA model .
Despite substantial improvements in antibody discovery approaches, advancing antibodies into the clinic remains challenging because therapeutic developability concerns are difficult to predict. TA-DA was developed specifically to address this challenge by providing a computational approach to predict antibody developability characteristics early in the discovery process. This allows researchers to identify potential issues before investing significant resources in candidates with poor developability profiles .
The TA-DA model employs five carefully selected descriptors that were chosen based on their performance and orthogonality. Four of these descriptors depend on predicted antibody structure, while one descriptor, All_Atomic_Contact_Energy, evaluates the entire variable domain. These descriptors focus on different aspects of antibody structure including the framework region and light chain CDRs, providing a comprehensive assessment of developability potential .
TA-DA builds upon other tools like the Therapeutic Antibody Profiler (TAP). While TAP primarily evaluates complementary-determining regions (CDRs) through analysis of length, hydrophobic patches, and charge characteristics, TA-DA incorporates a broader range of descriptors that evaluate the entire variable domain. In comparative analyses, TA-DA demonstrated an AUC of 0.8 on a hold-out test set, indicating strong performance in distinguishing clinical from repertoire antibodies .
The TA-DA model employs a logistic-regression approach trained on a dataset of clinical antibodies and repertoire antibodies. For validation, researchers created a test set of 20 clinical antibodies and 20 repertoire antibodies withheld from the training data. Performance was evaluated using Area Under the Curve (AUC) metrics, with the model achieving an AUC of 0.8 on this hold-out test set. The five descriptors in the final model were selected from a larger set of 144 descriptors based on their performance and orthogonality .
Researchers can implement TA-DA as an early-stage computational screening tool to prioritize antibody candidates before investing in extensive experimental characterization. The tool requires only variable region sequences of candidate antibodies and provides results rapidly, requiring only minutes on a single CPU. Antibodies scoring above 0.5 display characteristics similar to clinical antibodies and may be prioritized for further development, while those with lower scores may warrant additional scrutiny or engineering before advancing .
Beyond traditional monoclonal antibodies, TA-DA was tested on bispecific antibodies not included in the original training data. Over 80% of these bispecific antibodies received high TA-DA scores (values > 0.5), providing strong validation for the algorithm's ability to identify developability characteristics across different antibody formats. This success with bispecific antibodies suggests TA-DA can be applied to diverse antibody-based therapeutic modalities .
While TA-DA provides computational predictions, experimental validation remains essential. Based on the literature, recommended experimental approaches include:
| Developability Concern | Recommended Assay | Key Parameter Measured |
|---|---|---|
| Aggregation propensity | Size-Exclusion Chromatography (SEC) | Monomer percentage |
| Self-interaction tendency | Affinity-Capture Self-Interaction Nanoparticle Spectroscopy (AC-SINS) | Self-association propensity |
| Structural stability | Differential Scanning Calorimetry | Thermal transition midpoint |
| Charge distribution | Isoelectric focusing | Isoelectric point and charge heterogeneity |
These experimental data should be compared with TA-DA predictions to build confidence in the computational assessments .
Despite its utility, TA-DA has several acknowledged limitations. The model does not consider chemical modifications that can occur during production, storage, or in vivo circulation, which can affect antigen binding, immunogenicity, and product homogeneity. Additionally, perfect separation of clinical antibodies from repertoire antibodies based solely on the variable region is unlikely due to differences in constant domains, formulation buffer, and manufacturing protocols. Therefore, TA-DA should be used as a guideline for evaluating lead candidates and not as a strict rule or replacement for experimental data .
When analyzing TA-DA results, outliers warrant special attention. For example, mavrilimumab, a clinical antibody in the test set, received an unusually low TA-DA score of 0.26. This low score was driven by predicted aggregation-prone hotspots in the framework region of the antibody. Such outliers may indicate specific developability concerns that could require targeted engineering approaches or alternative formulation strategies. Researchers should examine the individual descriptor contributions to understand the specific factors driving low scores .
As our understanding of antibody chemical liabilities improves, future versions of TA-DA may incorporate predictions related to antibody chemical modifications. Additionally, advancements in structural prediction algorithms, particularly for challenging regions like the H3 loop, could further enhance TA-DA accuracy. The approach demonstrated with TA-DA could also potentially expand to include additional descriptors that capture other aspects of antibody developability not currently addressed in the model .
When contradictions arise between TA-DA predictions and experimental results, researchers should consider:
Limitations in the structural prediction model used for TA-DA's structure-based descriptors
Potential novel features in the candidate antibody not well-represented in TA-DA's training data
The influence of experimental conditions that may not reflect the predictive model's assumptions
The need for additional experimental characterization focusing on specific developability concerns
Generally, experimental data should take precedence, but TA-DA predictions might highlight subtle issues that could emerge during scale-up or long-term storage .