Abstract
Forecasting the interaction between compounds and proteins is crucial for discovering new drugs. However, previous sequence-based studies have not utilized three-dimensional (3D) information on compounds and proteins, such as atom coordinates and distance matrices, to predict binding affinity. Furthermore, numerous widely adopted computational techniques have relied on sequences of amino acid characters for protein representations. This approach may constrain the model’s ability to capture meaningful biochemical features, impeding a more comprehensive understanding of the underlying proteins. Here, we propose a two-step deep learning strategy named MulinforCPI that incorporates transfer learning techniques with multi-level resolution features to overcome these limitations. Our approach leverages 3D information from both proteins and compounds and acquires a profound understanding of the atomic-level features of proteins. Besides, our research highlights the divide between first-principle and data-driven methods, offering new research prospects for compound–protein interaction tasks. We applied the proposed method to six datasets: Davis, Metz, KIBA, CASF-2016, DUDE and BindingDB, to evaluate the effectiveness of our approach.
Original language | English |
---|---|
Article number | bbad484 |
Journal | Briefings in Bioinformatics |
Volume | 25 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2024 Jan 1 |
Bibliographical note
Publisher Copyright:© 2024 Oxford University Press. All rights reserved.
Keywords
- 3D geometric information
- compound–protein interaction
- cross-attention
- multi-level information
- transfer learning
ASJC Scopus subject areas
- Information Systems
- Molecular Biology