ConTact: Contact-First Antibody CDR Design via Explicit Interface Reasoning
Pith reviewed 2026-05-22 09:26 UTC · model grok-4.3
The pith
ConTact separates contact prediction from amino acid choice to improve antibody CDR designs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
ConTact decomposes CDR design into learning surface complementarity fingerprints, predicting CDR-antigen contacts, and injecting contact-gated antigen features into the sequence head, using distance-biased cross-attention and a contact-weighted loss to concentrate learning on binding positions.
What carries the argument
Contact-then-act architecture that first predicts CDR-antigen contacts and then gates antigen features by those contacts before sequence generation.
If this is right
- Designed CDRs achieve lower RMSD to native structures than prior baselines.
- Models show stronger identification of true epitope residues on the antigen.
- Amino acid recovery rates remain competitive while structural and interface metrics rise.
Where Pith is reading between the lines
- The staged approach may transfer to other protein-protein interface design tasks where contact positions are unknown in advance.
- Explicit contact maps could support downstream steps such as affinity maturation or specificity tuning.
- Errors in early contact prediction could be diagnosed and corrected independently of the final sequence output.
Load-bearing premise
Accurate contact predictions must feed forward without introducing errors that degrade sequence design, and the training data distribution must match real antigen-CDR interfaces.
What would settle it
A controlled test that replaces the learned contact predictor with random or oracle contacts and measures whether sequence recovery or structural quality improves, stays the same, or drops.
Figures
read the original abstract
Computational antibody CDR design methods condition on antigen structure to generate binding loops, yet existing architectures conflate two fundamentally distinct sub-problems: identifying which CDR positions will contact the antigen, and selecting amino acids at those positions. This conflation forces models to learn contact reasoning implicitly through uniform message passing, diluting antigen signal across all positions equally. We introduce ConTact, a contact-then-act architecture that explicitly decomposes CDR design into three cascaded stages: learning surface complementarity fingerprints, predicting CDR-antigen contacts, and injecting contact-gated antigen features into the sequence head. A distance-biased cross-attention module encodes geometric priors favoring spatial neighbors, while a contact-weighted cross-entropy loss concentrates gradient signal on binding-critical positions. On CHIMERA-Bench dataset, ConTact achieves the best structural quality (7% RMSD improvement over the next-best baseline), best epitope awareness (10% F1 score over GNN baselines), and competitive sequence recovery (AAR 0.38) among several CDR-H3 design baselines.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces ConTact, a contact-then-act architecture for antibody CDR design that explicitly decomposes the task into learning surface complementarity fingerprints, predicting CDR-antigen contacts, and injecting contact-gated antigen features into a sequence design head. It incorporates a distance-biased cross-attention module to encode geometric priors and a contact-weighted cross-entropy loss to focus gradients on binding positions. On the CHIMERA-Bench dataset, the method reports the best structural quality (7% RMSD improvement over the next-best baseline), best epitope awareness (10% F1 improvement over GNN baselines), and competitive sequence recovery (AAR of 0.38) among CDR-H3 design baselines.
Significance. If the performance gains are shown to arise from the explicit contact decomposition rather than other factors, the work could meaningfully advance computational antibody design by improving interpretability and epitope awareness. The explicit separation of contact reasoning from sequence selection, combined with geometric priors in attention, represents a clear methodological contribution. The introduction of a dedicated benchmark and the reported empirical results provide a useful reference point for the field.
major comments (3)
- [§4.2] §4.2 (CHIMERA-Bench results): The reported 7% RMSD improvement and 10% F1 gain are presented without error bars, standard deviations across runs, dataset split details, or statistical significance tests. This information is load-bearing for the central claim of superiority, as it is needed to rule out variance or implementation-specific effects.
- [Methods] Methods, contact prediction stage: No standalone metrics (precision, recall, or per-residue accuracy) are reported for the binary contact map predictions. Because the central claim attributes downstream gains to accurate contacts feeding the gated cross-attention, the absence of these metrics leaves open the possibility that noisy contacts degrade rather than improve sequence design quality.
- [§4.3] §4.3 (Ablation studies): No ablation is described that severs or randomizes the contact gate while retaining the distance-biased attention and complementarity fingerprints. Without this isolation, it cannot be confirmed that the explicit decomposition, rather than other architectural components, drives the observed RMSD and F1 improvements.
minor comments (2)
- [Abstract] The abstract refers to 'several CDR-H3 design baselines' without naming them; explicitly listing the compared methods would aid immediate understanding of the competitive context.
- [Methods] The exact formulation of the complementarity fingerprints and how they are computed from surface features could be formalized with an equation in the methods for improved reproducibility.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed comments. We have carefully reviewed each major point and provide point-by-point responses below. We agree that the suggested additions will strengthen the empirical support for our claims and plan to incorporate them in the revised manuscript.
read point-by-point responses
-
Referee: [§4.2] §4.2 (CHIMERA-Bench results): The reported 7% RMSD improvement and 10% F1 gain are presented without error bars, standard deviations across runs, dataset split details, or statistical significance tests. This information is load-bearing for the central claim of superiority, as it is needed to rule out variance or implementation-specific effects.
Authors: We agree that variability measures and statistical tests are necessary to substantiate the reported gains. In the revision we will rerun all experiments with five independent random seeds, report mean and standard deviation for RMSD and F1 scores, provide explicit details on the CHIMERA-Bench train/validation/test splits, and include paired statistical significance tests (e.g., Wilcoxon signed-rank) against the strongest baselines. These results will be added to §4.2 and the associated tables. revision: yes
-
Referee: [Methods] Methods, contact prediction stage: No standalone metrics (precision, recall, or per-residue accuracy) are reported for the binary contact map predictions. Because the central claim attributes downstream gains to accurate contacts feeding the gated cross-attention, the absence of these metrics leaves open the possibility that noisy contacts degrade rather than improve sequence design quality.
Authors: We acknowledge the value of reporting standalone contact-prediction performance to support the claim that the contact stage contributes positively. In the revised manuscript we will add a dedicated paragraph and table in the Methods section (or as supplementary material) that reports precision, recall, F1, and per-residue accuracy of the contact prediction module on the validation set. revision: yes
-
Referee: [§4.3] §4.3 (Ablation studies): No ablation is described that severs or randomizes the contact gate while retaining the distance-biased attention and complementarity fingerprints. Without this isolation, it cannot be confirmed that the explicit decomposition, rather than other architectural components, drives the observed RMSD and F1 improvements.
Authors: We agree that an ablation isolating the contact gate is required to confirm its contribution. We have performed an additional experiment that replaces the predicted contact map with random contacts while retaining the distance-biased attention and surface-fingerprint modules; the resulting drop in RMSD and F1 will be reported in the revised §4.3 together with a brief discussion of the findings. revision: yes
Circularity Check
No circularity: empirical benchmark results on external dataset
full rationale
The paper introduces a neural architecture that decomposes CDR design into contact prediction followed by gated sequence generation, using standard components such as distance-biased cross-attention and a contact-weighted loss. All performance claims (RMSD improvement, F1 score, AAR) are presented strictly as measured outcomes on the held-out CHIMERA-Bench dataset against external baselines. No equations, self-definitional reductions, fitted-input-as-prediction steps, or load-bearing self-citations appear in the abstract or described method; the central claims remain independent of the inputs by construction and rest on falsifiable external evaluation.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Mansoor Ahmed and Nadeem Taj and Imdad Ullah Khan and Hemanth Venkateswara and Murray Patterson , booktitle=. 2026 , url=
work page 2026
-
[2]
Chen, Xingyao and Dougherty, Thomas and Hong, Chan and Schibler, Rachel and Zhao, Yi and Sadeghi, Reza and Matasci, Naim and Wu, Yi-Chieh and Kerman, Ian , year =. biorxiv , title =
-
[3]
Ye, Chao and Hu, Wenxing and Gaeta, Bruno. Prediction of Antibody-Antigen Binding via Machine Learning: Development of Data Sets and Evaluation of Methods. JMIR Bioinform Biotech. 2022. doi:10.2196/29404
-
[4]
and Friedensohn, Simon and Weber, C
Mason, Derek M. and Friedensohn, Simon and Weber, C. Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning , journal=. 2021 , month=. doi:10.1038/s41551-021-00699-9 , url=
-
[5]
Discovery-stage identification of drug-like antibodies using emerging experimental and computational methods , author=. MAbs , volume=. 2021 , organization=
work page 2021
-
[6]
Antibody stability: A key to performance - Analysis, influences and improvement , journal =
Hui Ma and Ciarán Ó’Fágáin and Richard O’Kennedy , keywords =. Antibody stability: A key to performance - Analysis, influences and improvement , journal =. 2020 , issn =. doi:https://doi.org/10.1016/j.biochi.2020.08.019 , url =
-
[7]
doi:https://doi.org/10.1111/cbdd.13388 , url =
Muhammed, Muhammed Tilahun and Aki-Yalcin, Esin , title =. doi:https://doi.org/10.1111/cbdd.13388 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1111/cbdd.13388 , year =
-
[8]
Journal of Biomedical Science , year=
Lu, Ruei-Min and Hwang, Yu-Chyi and Liu, I-Ju and Lee, Chi-Chiu and Tsai, Han-Zen and Li, Hsin-Jung and Wu, Han-Chung , title=. Journal of Biomedical Science , year=. doi:10.1186/s12929-019-0592-z , url=
-
[9]
Grange, R. D. and Thompson, J. P. and Lambert, D. G. , title = ". BJA: British Journal of Anaesthesia , volume =. 2014 , month =. doi:10.1093/bja/aet293 , url =
-
[10]
Frontiers in Immunology , VOLUME=
Huang, Yan and Zhang, Ziding and Zhou, Yuan , TITLE=. Frontiers in Immunology , VOLUME=. 2022 , URL=. doi:10.3389/fimmu.2022.1053617 , ISSN=
-
[11]
Assisted Design of Antibody and Protein Therapeutics (ADAPT) , year =. PLOS ONE , publisher =. doi:10.1371/journal.pone.0181490 , author =
-
[12]
Pires, Douglas E.V. and Ascher, David B. , title = ". Nucleic Acids Research , volume =. 2016 , month =. doi:10.1093/nar/gkw458 , url =
-
[13]
Khamis and Walid Gomaa and Walaa F
Mohamed A. Khamis and Walid Gomaa and Walaa F. Ahmed , keywords =. Machine learning in computational docking , journal =. 2015 , issn =. doi:https://doi.org/10.1016/j.artmed.2015.02.002 , url =
-
[14]
and Andersen, Jan Terje and Greiff, Victor , title=
Akbar, Rahmad and Bashour, Habib and Rawat, Puneet and Robert, Philippe A. and Andersen, Jan Terje and Greiff, Victor , title=. mAbs , year=. doi:10.1080/19420862.2021.2008790 , url=
-
[15]
Ras-Carmona, Alvaro and Lehmann, Alexander A. and Lehmann, Paul V. and Reche, Pedro A. , title=. Scientific Reports , year=. doi:10.1038/s41598-022-18021-1 , url=
-
[16]
Ren, Jing and Song, Jiangning and Ellis, John and Li, Jinyan , title=. BMC Genomics , year=. doi:10.1186/s12864-017-3493-0 , url=
-
[17]
La Marca, Anthony F and Lopes, Robson da S and Lotufo, Anna Diva P and Bartholomeu, Daniella C and Minussi, Carlos R. BepFAMN : A Method for Linear B-Cell Epitope Predictions Based on Fuzzy-ARTMAP Artificial Neural Network. Sensors (Basel)
-
[18]
Liberis, Edgar and Veličković, Petar and Sormanni, Pietro and Vendruscolo, Michele and Liò, Pietro , title = ". Bioinformatics , volume =. 2018 , month =. doi:10.1093/bioinformatics/bty305 , url =
-
[19]
Paragraph—antibody paratope prediction using graph neural networks with minimal feature vectors , author=. Bioinformatics , volume=. 2023 , publisher=
work page 2023
-
[20]
Ahmed, Mansoor and Ali, Sarwan and Jan, Avais and Khan, Imdad Ullah and Patterson, Murray , title =. 2025 , doi =
work page 2025
-
[21]
In silico methods in antibody design , author=. Antibodies , volume=. 2018 , publisher=
work page 2018
-
[22]
Artificial intelligence-driven computational methods for antibody design and optimization , author=. Mabs , volume=. 2025 , organization=
work page 2025
-
[23]
arXiv preprint arXiv:2506.04235 , year=
AbBiBench: A Benchmark for Antibody Binding Affinity Maturation and Design , author=. arXiv preprint arXiv:2506.04235 , year=
-
[24]
OptMAVEn--a new framework for the de novo design of antibody variable region models targeting specific antigen epitopes , author=. PloS one , volume=. 2014 , publisher=
work page 2014
-
[25]
Attentive Cross-Modal Paratope Prediction , journal =
Deac, Andreea and Veli. Attentive Cross-Modal Paratope Prediction , journal =. 2019 , doi =
work page 2019
-
[26]
A trimodal protein language model enables advanced protein searches , author=. Nature Biotechnology , year=. doi:10.1038/s41587-025-02836-0 , url=
-
[27]
Nucleic acids research , year=
Chailyan, Anna and Tramontano, Anna and Marcatili, Paolo , title=. Nucleic acids research , year=. doi:10.1093/nar/gkr806 , url=
-
[28]
Lim, Yoong Wearn and Adler, Adam S. and Johnson, David S. , title=. mAbs , year=. doi:10.1080/19420862.2022.2069075 , url=
-
[29]
Briefings in bioinformatics , volume=
AntiFormer: graph enhanced large language model for binding affinity prediction , author=. Briefings in bioinformatics , volume=. 2024 , publisher=
work page 2024
-
[30]
CSM-AB: Graph-based antibody--antigen binding affinity prediction and docking scoring function , author=. Bioinformatics , volume=. 2022 , publisher=
work page 2022
-
[31]
ANTIPASTI: interpretable prediction of antibody binding affinity exploiting Normal Modes and Deep Learning , author=. Structure , volume=. 2024 , publisher=
work page 2024
-
[32]
Nature Machine Intelligence , volume=
A topology-based network tree for the prediction of protein--protein binding affinity changes following mutation , author=. Nature Machine Intelligence , volume=. 2020 , publisher=
work page 2020
-
[33]
Journal of Computational Biology , volume=
Reads2vec: Efficient embedding of raw high-throughput sequencing reads data , author=. Journal of Computational Biology , volume=. 2023 , publisher=
work page 2023
-
[34]
Robust representation and efficient feature selection allows for effective clustering of sars-cov-2 variants , author=. Algorithms , volume=. 2021 , publisher=
work page 2021
-
[35]
A k-mer based approach for sars-cov-2 variant identification , author=. Bioinformatics Research and Applications: 17th International Symposium, ISBRA 2021, Shenzhen, China, November 26--28, 2021, Proceedings 17 , pages=. 2021 , organization=
work page 2021
-
[36]
Exploring the Potential of GANs in Biological Sequence Analysis , author=. Biology , volume=. 2023 , publisher=
work page 2023
-
[37]
DLAB: deep learning methods for structure-based virtual screening of antibodies , author=. Bioinformatics , volume=. 2022 , publisher=
work page 2022
-
[38]
Myung, Yoochan and Pires, Douglas E V and Ascher, David B , title = ". Bioinformatics , volume =. 2021 , month =. doi:10.1093/bioinformatics/btab762 , url =
-
[39]
Machine learning prediction of Antibody-Antigen binding: dataset, method and testing , journal=
Ye, Chao and Hu, Wenxing and Gaëta, Bruno , year =. Machine learning prediction of Antibody-Antigen binding: dataset, method and testing , journal=
-
[40]
Jain, Tushar and Boland, Todd and Lilov, Asparouh and Burnina, Irina and Brown, Michael and Xu, Yingda and Vásquez, Maximiliano , title = ". Bioinformatics , volume =. 2017 , month =. doi:10.1093/bioinformatics/btx519 , url =
-
[41]
Frontiers in Microbiology , VOLUME=
Kang, Tae Hyun and Seong, Baik Lin , TITLE=. Frontiers in Microbiology , VOLUME=. 2020 , URL=. doi:10.3389/fmicb.2020.01927 , ISSN=
-
[42]
Antibody apparent solubility prediction from sequence by transfer learning , journal =
Jiangyan Feng and Min Jiang and James Shih and Qing Chai , keywords =. Antibody apparent solubility prediction from sequence by transfer learning , journal =. 2022 , issn =. doi:https://doi.org/10.1016/j.isci.2022.105173 , url =
-
[43]
Alejandro and Charonis, Spyros and Curtis, Robin and Warwicker, Jim , title =
Hebditch, Max and Carballo-Amador, M. Alejandro and Charonis, Spyros and Curtis, Robin and Warwicker, Jim , title =. 2017 , journal =. doi:10.1093/bioinformatics/btx345 , url =
-
[44]
Computational and artificial intelligence-based methods for antibody development
Kim, Jisun and McFee, Matthew and Fang, Qiao and Abdin, Osama and Kim, Philip M. Computational and artificial intelligence-based methods for antibody development. Trends Pharmacol Sci
-
[45]
Frontiers in immunology , volume=
DiscoTope-3.0: improved B-cell epitope prediction using inverse folding latent representations , author=. Frontiers in immunology , volume=. 2024 , publisher=
work page 2024
-
[46]
Antibody Therapeutics , volume =
Zhang, Weijie and Wang, Hao and Feng, Nan and Li, Yifeng and Gu, Jijie and Wang, Zhuozhi , title = ". Antibody Therapeutics , volume =. 2022 , month =. doi:10.1093/abt/tbac029 , url =
-
[47]
Computational methods for biomolecular docking , journal =. 1996 , issn =. doi:https://doi.org/10.1016/S0959-440X(96)80061-3 , url =
-
[48]
Deng, Haiyou and Jia, Ya and Zhang, Yang. Protein structure prediction. Int. J. Mod. Phys. B
-
[49]
Antibodies to watch in 2019 , author=. MAbs , volume=. 2019 , organization=
work page 2019
-
[50]
Phage display and hybridoma generation of antibodies to human CXCR2 yields antibodies with distinct mechanisms and epitopes , author=. MAbs , volume=. 2014 , organization=
work page 2014
-
[51]
Antibodies in diagnostics--from immunoassays to protein chips , author=. Immunology today , volume=. 2000 , publisher=
work page 2000
-
[52]
Highly accurate protein structure prediction with AlphaFold , author=. Nature , volume=. 2021 , publisher=
work page 2021
-
[53]
Open AI in education, the responsible and ethical use of ChatGPT towards lifelong learning , author=. Education, the Responsible and Ethical Use of ChatGPT Towards Lifelong Learning (February 11, 2023) , year=
work page 2023
-
[54]
Sormanni, Pietro and Aprile, Francesco A. and Vendruscolo, Michele. Third generation antibody discovery methods: in silico rational design. Chem. Soc. Rev. 2018. doi:10.1039/C8CS00523K
-
[55]
Nucleic acids research , volume=
PyIgClassify: a database of antibody CDR structural classifications , author=. Nucleic acids research , volume=. 2015 , publisher=
work page 2015
-
[56]
Current Opinion in Structural Biology , volume=
Advances in computational structure-based antibody design , author=. Current Opinion in Structural Biology , volume=. 2022 , publisher=
work page 2022
-
[57]
Dai, Bowen and Bailey-Kellogg, Chris , title = ". Bioinformatics , volume =. 2021 , month =
work page 2021
-
[58]
Automation of absolute protein-ligand binding free energy calculations for docking refinement and compound evaluation , author=. Scientific reports , volume=. 2021 , publisher=
work page 2021
-
[59]
Advancing Protein-DNA Binding Site Prediction: Integrating Sequence Models and Machine Learning Classifiers , author=. bioRxiv , pages=. 2023 , publisher=
work page 2023
-
[60]
Current opinion in virology , volume=
Antibody specific epitope prediction—emergence of a new paradigm , author=. Current opinion in virology , volume=. 2015 , publisher=
work page 2015
-
[61]
Wang, Chuan and Wang, Jiangyuan and Song, Wenjun and Luo, Guanzheng and Jiang, Taijiao , journal=. 2024 , publisher=
work page 2024
-
[62]
epitope1D: accurate taxonomy-aware
Silva, Bruna and Ascher, David and Pires, Douglas , journal=. epitope1D: accurate taxonomy-aware. 2023 , publisher=
work page 2023
-
[63]
Learning context-aware structural representations to predict antigen and antibody binding interfaces , author=. Bioinformatics , volume=. 2020 , publisher=
work page 2020
-
[64]
Liu, Chunan and Denzler, Lilian and Chen, Yihong and Martin, Andrew and Paige, Brooks , journal=
-
[65]
Lu, Shuai and Li, Yuguang and Ma, Qiang and Nan, Xiaofei and Zhang, Shoutao , journal=. A structure-based. 2022 , publisher=
work page 2022
-
[66]
Prediction of protein--protein interaction using graph neural networks , author=. Scientific Reports , volume=. 2022 , publisher=
work page 2022
-
[67]
Frontiers in immunology , volume=
SEMA: Antigen B-cell conformational epitope prediction using deep transfer learning , author=. Frontiers in immunology , volume=. 2022 , publisher=
work page 2022
-
[68]
Nucleic Acids Research , volume=
SEPPA-mAb: spatial epitope prediction of protein antigens for mAbs , author=. Nucleic Acids Research , volume=. 2023 , publisher=
work page 2023
-
[69]
Briefings in bioinformatics , volume=
Critical review of conformational B-cell epitope prediction methods , author=. Briefings in bioinformatics , volume=. 2023 , publisher=
work page 2023
-
[70]
The ClusPro AbEMap web server for the prediction of antibody epitopes , author=. Nature protocols , volume=. 2023 , publisher=
work page 2023
-
[71]
Journal of Biological Chemistry , volume=
Cloning and characterization of deoxymugineic acid synthase genes from graminaceous plants , author=. Journal of Biological Chemistry , volume=. 2006 , publisher=
work page 2006
-
[72]
Small ubiquitin-like modifier protein 3 enhances the solubilization of human bone morphogenetic protein 2 in E. coli , author=. Applied biochemistry and biotechnology , volume=. 2018 , publisher=
work page 2018
-
[73]
Journal of Experimental Botany , volume=
Paralogs and mutants show that one DMA synthase functions in iron homeostasis in rice , author=. Journal of Experimental Botany , volume=. 2017 , publisher=
work page 2017
- [74]
-
[75]
Nature communications , volume=
Histone H4 lysine 20 mono-methylation directly facilitates chromatin openness and promotes transcription of housekeeping genes , author=. Nature communications , volume=. 2021 , publisher=
work page 2021
-
[76]
Nature communications , volume=
Histone H4K20 methylation mediated chromatin compaction threshold ensures genome integrity by limiting DNA replication licensing , author=. Nature communications , volume=. 2018 , publisher=
work page 2018
-
[77]
Plant molecular biology , volume=
Iron deficiency regulated OsOPT7 is essential for iron homeostasis in rice , author=. Plant molecular biology , volume=. 2015 , publisher=
work page 2015
-
[78]
Nucleic Acids Research , volume=
Topokaryotyping demonstrates single cell variability and stress dependent variations in nuclear envelope associated domains , author=. Nucleic Acids Research , volume=. 2018 , publisher=
work page 2018
-
[79]
PUB-NChIP—“in vivo biotinylation” approach to study chromatin in proximity to a protein of interest , author=. Genome research , volume=. 2013 , publisher=
work page 2013
-
[80]
Journal of proteome research , volume=
PUB-MS: a mass spectrometry-based method to monitor protein--protein proximity in vivo , author=. Journal of proteome research , volume=. 2011 , publisher=
work page 2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.