Published December 8, 2023 | Version Submitted
Discussion Paper Open

Machine-learning-based Structural Analysis of Interactions between Antibodies and Antigens

  • 1. ROR icon Vanderbilt University
  • 2. ROR icon California Institute of Technology
  • 3. ROR icon Albert Einstein College of Medicine

Abstract

Computational analysis of paratope-epitope interactions between antibodies and their corresponding antigens can facilitate our understanding of the molecular mechanism underlying humoral immunity and boost the design of new therapeutics for many diseases. The recent breakthrough in artificial intelligence has made it possible to predict protein-protein interactions and model their structures. Unfortunately, detecting antigen-binding sites associated with a specific antibody is still a challenging problem. To tackle this challenge, we implemented a deep learning model to characterize interaction patterns between antibodies and their corresponding antigens. With high accuracy, our model can distinguish between antibody-antigen complexes and other types of protein-protein complexes. More intriguingly, we can identify antigens from other common protein binding regions with an accuracy of higher than 70% even if we only have the epitope information. This indicates that antigens have distinct features on their surface that antibodies can recognize. Additionally, our model was unable to predict the partnerships between antibodies and their particular antigens. This result suggests that one antigen may be targeted by more than one antibody and that antibodies may bind to previously unidentified proteins. Taken together, our results support the precision of antibody-antigen interactions while also suggesting positive future progress in the prediction of specific pairing.

Copyright and License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.

Acknowledgement

This work was supported by the National Institutes of Health under Grant Numbers R01GM120238 and R01GM122804. The work is also partially supported by a start-up grant from Albert Einstein College of Medicine. Computational support was provided by Albert Einstein College of Medicine High Performance Computing Center.

Data Availability

All relevant source codes of the CNN models can be found in the GitHub repository: https://github.com/EngiTom/2023AntibodyDataAnalysis.

Conflict of Interest

The authors declare no competing financial interests.

Files

nihpp-2023.12.06.570397v1.pdf

Files (317.5 kB)

Name Size Download all
md5:54bdd37ba770c32d0d0ca2ec346d42aa
317.5 kB Preview Download

Additional details

Identifiers

Funding

National Institutes of Health
R01GM120238
National Institutes of Health
R01GM122804
Albert Einstein College of Medicine