Prediction of drug-drug interaction events using graph neural networks based feature extraction

Al-Rabeah, Mohammad Hussain; Lakizadeh, Amir

doi:10.1038/s41598-022-19999-4

Prediction of drug-drug interaction events using graph neural networks based feature extraction

Article
Open access
Published: 16 September 2022

Volume 12, article number 15590, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Prediction of drug-drug interaction events using graph neural networks based feature extraction

Download PDF

Mohammad Hussain Al-Rabeah¹ &
Amir Lakizadeh¹

7750 Accesses
16 Citations
1 Altmetric
Explore all metrics

Abstract

The prevalence of multi_drug therapies has been increasing in recent years, particularly among the elderly who are suffering from several diseases. However, unexpected Drug_Drug interaction (DDI) can cause adverse reactions or critical toxicity, which puts patients in danger. As the need for multi_drug treatment increases, it's becoming increasingly necessary to discover DDIs. Nevertheless, DDIs detection in an extensive number of drug pairs, both in-vitro and in-vivo, is costly and laborious. Therefore, DDI identification is one of the most concerns in drug-related researches. In this paper, we propose GNN-DDI, a deep learning-based method for predicting DDI-associated events in two stages. In the first stage, we collect the drugs information from different sources and then integrate them through the formation of an attributed heterogeneous network and generate a drug embedding vector based on different drug interaction types and drug attributes. In the second stage, we aggregate the representation vectors then predictions of the DDIs and their events are performed through a deep multi-model framework. Various evaluation results show that the proposed method can outperform state-of-the methods in the prediction of drug-drug interaction-associated events. The experimental results indicate that producing the drug's representations based on different drug interaction types and attributes is efficient and effective and can better show the intrinsic characteristics of a drug.

DPDDI: a deep predictor for drug-drug interactions

Article Open access 24 September 2020

Multimodal CNN-DDI: using multimodal CNN for drug to drug interaction associated events

Article Open access 19 February 2024

SSF-DDI: a deep learning method utilizing drug sequence and substructure features for drug–drug interaction prediction

Article Open access 23 January 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Introduction

Recently, it became so popular to cure difficult diseases such as cancer using drug mixes or so-called Polypharmacy. It is a good approach, especially among the elderly who suffer from several diseases, using the synergistic effects of drug interactions. However, unplanned DDIs could risk a patient's life because they may cause side effects or perhaps dangerous toxicity. As the need for multidrug therapy increases, the detection of DDI becomes much more necessary^1,2. However, the diagnosis of DDI on a large number of drug pairs, both in vitro and in vivo, is costly and time-consuming³. Therefore, detecting DDIs is one of the main concerns in pharmaceutical research⁴. Detecting possible DDIs decreases the incidence of unexpected drug interactions and reduces drug production costs. It also can optimize the drug creation process. Therefore, the research of DDIs and adverse drug reactions (ADRs) is necessary for drug production and clinical applications, specifically for concomitant drugs⁵.

The explosive growth of large-scale and high-precision biological data has led to the formation of a research field called computational pharmacology. This data creates the opportunity for systematic analysis of various data. Analyzing this data can be useful to improve drug development and reduce the risk. The interactions are very popular in biological processes as bonds within a chemical compound. Therefore, networks are usually used to represent biological data. The emergence of this biological network requires new computational tools for analysis⁶. Thus, new studies have tried to address this shortcoming.

Recently, a large number of researchers in the graphical data structure field has led to a high level of promotion of graphical data structure analysis techniques. Indeed, there is a lot of attention on deep learning and its applications in this field. However, many researchers have presented a method for computing the weighted average for node neighbor information based on neural network processing methods. These graphic data structure processing models, using neural networks, are known as Graph Neural Networks (GNNs)⁷. This method extended the current neural network for processing graphical data structures.

In general, there are four popular approaches in the DDI prediction field: Similarity-based methods, Matrix Factorization-based methods, network analysis-based methods and Deep Learning-based methods. Similarity-based methods are based on the similarities between drugs and proteins, or drugs and diseases, and vice versa. They employ a classical classification model, such as SVMs, regular least squares, logistic regression, and random-forest to complete the prediction task⁸. Gottlieb et al.⁹ calculated feature vectors based on seven types of drug-drug similarities to represent drug-drug pairs and then used a weighted logistic regression model to predict DDI. Cheng et al.¹⁰ combined a variety of drug-drug similarities to represent drug-drug pairs and utilized five classifiers to construct the prediction models. Dang et al.¹¹ adopt a machine learning model to predict DDI types for histamine antagonist drugs using two similarity matrices as inputs. Then employ various classification algorithms such as Naive Bayes, Decision Tree, Random Forest, Logistic Regression, and XGBoost for DDIs prediction. Song et al.¹²developed a machine learning model using support vector machines (SVMs) based on several similarity matrices and then employed them as the input vector of the SVM.

Matrix Factorization is extensively used for data analysis. It factorizes the data matrix into a matrix with a smaller dimension. Then rebuild the adjacency matrix to determine DDIs. Yet, it maintains the complex structure and latent topological properties. Common Matrix Factorization has many forms, like (SVD) and graph factorization. Zhang et al.¹³ propose a matrix factorization method called (MRMF) which uses known DDIs and drug feature-based manifold to predict possible drug-drug interactions. Shi et al.¹⁴ develop a matrix factorization method named (BRSNMF) to divide drugs into communities and identify enhancive and degressive DDIs in the cold-start scenario. Rohani et al.¹⁵ collects several drug similarities and then utilized Integrated Similarity-constrained matrix factorization to identify DDIs. However, modern studies concentrate on developing different high-order data proximity matrices to maintain graph structure. For example, GraRep¹⁶ adopts the network high-order proximity and makes factorization by building k-step transition probability matrices.

Network-based methods employ network structure to construct relationships among biological and biomedical entities for predicting potential interactions⁸. Random walk-based methods are a famous approach in this field. These methods employ random walks in the networks to construct a node sequence. Then the method learns node embeddings using the word2vec model. One of the earliest models is DeepWalk¹⁷, which executes trimmed random walks on a network. Next,¹⁸ struc2vec is proposed for more acceptable modeling of the network structure. Especially, struct2vec can describe multi-layer weighted graph constructs. Lee et al.¹⁹ build a heterogeneous biological network using a combination of several databases and interaction data. Then adopted a method to calculate the relation strength between two drugs and discover paths of drug-drug pairs. Huang et al.²⁰ suggested a metric that calculates the relations strength of the network called 'S-score' to find possible PD DDIs. Lee et al.²¹ produce a global graph by employing a random walk with a restart algorithm and using the global information for prediction.

Deep learning is a recently popular branch of artificial neural networks that learns a sequential representation layer of features during the learning process. This approach is used in many fields effectively such as computer vision, NLP, and bioinformatics⁸. Many Types of neural networks were established in the graph embeddings area, such as autoencoder²², MLP²³, and GAN²⁴. Embedding network-based data is modeling a set of entities (nodes) and their links (edges). DeepWalk is the first model of processing graphical data using a deep learning approach. Many algorithms proposed motived by DeepWalk like node2vec and Metapath2vec. Also, there is recent progress in deep learning-based drug repositioning such as HINGRL²⁵, MGRL²⁶ and²⁷. Lately, a lot of studies in the field of graphical data structure have been extremely advanced for processing network data structure²⁸. Several researchers have developed a neural network method for computing a weighted average of information for each node's neighbor. These methods that employ neural networks for computing graphical data structure are generally known as Graph Neural Networks (GNNs). The GNN concept was initially introduced in 2009⁷ which expanded the current neural network for computing graphical structure data. Several GNN methods for graphical data structure were proposed, containing Graph Auto-encoders (GAEs)^29,30, Graph Convolutional Neural Networks (GCNs)^17,18, and Graph Recurrent Neural Networks (Graph RNNs)^23,31. Moreover, deep learning-based methods have been commonly used in the biomedical area^32,33 and have earned very good results. Karim et al.⁵ built a knowledge graph from several databases and used knowledge graph embedding to generate a drugs feature vector. Then employs a CNN-LSTM model for DDI prediction. Feng et al.³ proposed a technique called DPDDI to predict DDIs by collecting the drug's features from the DDI network with GCNs. Then uses the deep neural network model for prediction. Liu et al.³⁴ present a framework named DDI-MDAE supported by multi-modal deep auto-encoders using shared latent representation to identify DDIs. Lin et al.³⁵ present an end-to-end framework, called KGNN. This framework can effectively extract the drugs and their potential neighbors.

Normally, current methods are developed to predict whether drugs interact or not. However, DDIs may show different biological effects or events. Predicting DDI-associated events is an important and difficult task, and has acquired some attention³⁶. Ryu et al.³⁷ presented a deep learning approach based on drug chemical substructures to predict 86 crucial DDI types. Feng et al.³⁸ proposed a novel end-to-end deep learning-based predictive method called MTDDI to predict DDIs as well as their types. Deng et al.⁴ presented a multimodal deep learning framework that employed multiple drug features to predict 65 categories of DDI events. Even though the above methods have created strong efforts in event prediction but there is a space for advancement.

The DDI network can provide vital information about drugs interactions. Furthermore, using an attributed heterogeneous DDIs network that presents the drug's interaction types along with the drug features can better demonstrate the intrinsic characteristics of a drug. However, it is challenging to integrate various features effectively because the drug features might be correlated and contain redundant information. Directly combining various feature vectors is a common strategy, but we need a more effective framework for aggregating the features.

Here, we proposed a method for predicting DDI and their type (event) based on attributed heterogeneous graph embedding and a deep learning approach. The method consists of two stages. In the first stage, the data is collected and used to make four feature matrices (Chemical structure, Target, Enzyme, and Pathway) and one drug-drug matrix. Then the drug-drug matrix is used to build a heterogeneous network of drugs as nodes. The feature matrices after preprocessing are used in the network as node attributes. In this approach, we use the attributed heterogeneous network representation technique to integrate different drug properties in each type of drugs interactions and creates drug embedding vectors. The second step begins with the preparation of the embedding vector for each drug obtained from the previous step. Using one of the concatenation methods the feature victor of the drug pairs is obtained. Finally, a fully connected neural network uses these embedding vectors as input to predict the drug interaction types.

The proposed method is summarized in the following steps:

Step 1: Integrating data sources and extracting embedding vectors (final feature vectors):

Gathering drug data and calculating similarity matrices for each drug feature.
Building an attributed heterogeneous graph as an Integration graph.
Calculating drug embedding matrices by embedding an attributed heterogeneous graph using a new GNN model.

Step 2: Predicting Drug–Drug Interactions (DDI) types:

Reducing the dimensions of the matrix obtained from the previous step in the embedding process by merging the drug embedding vectors for each interaction type.
Creating matrices of drug pairs by Integrating the embedding vectors of each drug pairs.
Finally, the above vectors are given as input to a deep learning network to predict the type of drugs interaction.

Experiments and results

Evaluation metrics

There are two main tasks in DDI prediction, first is identifying the interactions among the drugs. The second is to determine what kind of interaction is between drugs. In this article, we employ k-fold cross-validation to evaluate the DDI prediction task. We randomly split the known drug-drug interactions into K subsets of equal size. Here we use fivefold (5-CV). In each fold, we use one subset as the testing set and keep the rest for training.

Here, we utilize different evaluation metrics to measure the prediction model performances. Our task is the multi-class classification work. We use accuracy (ACC), Area Under the Precision-Recall-Curve (AUPR), area under the ROC curve (AUC), F1 score and Precision and Recall as the evaluation metrics. We use micro metrics for AUPR and AUC and macro metrics for the others. The micro-scale studies the classes individually, but the macro-scale interacts with the sum or the whole, so the calculation is general. The difference between macro and micro scales is that the macro scale weighs all classes equally, while the micro-scale weighs each sample equally. If the number of samples is equal for each class, the micro and macro scales will have the same score. Here in this multi-class problem, micro-Precision, micro-Recall and micro-F1 are equal to accuracy.

Parameter setting

In this section, we discuss the effect of using different values for hyperparameters that influences the performance of the proposed model. The model consists of two stages. Therefore, we discuss embedding dimensions in the first stage and vector integration methods in the second stage.

Effect of embedding dimension size

Here, we evaluate the model performance using different sizes for embedding dimensions of the drugs. Figure 1 shows the performance of using different values for embedding dimensions. We found that the model with a vector size of 32 led to the best accuracy, which is probably due to the better representation of drugs. The embedding dimension with size 16 also shows good performance and is less time-consuming. Nevertheless, it achieves lower accuracy.

The effect of different integrating schema in terms of the model's accuracy

In this section, we discuss the effect of using different integration schemas of drug vectors. Integrating various features effectively is a difficult task because the drug features might be correlated and redundant. However, directly merging diverse feature vectors is a familiar strategy, but we need a more effective framework for aggregating the features. We test several aggregation schemas and choose the best one for the task. According to Table 1. the integration schema (a) shows better performance and achieves the best accuracy. This schema combines each drug embedding vector in all event types (interaction types) using (np.concatenate) as explained in the Eq. (1).

$$ F_{i} = \left[ {v_{i,1} , \ldots , v_{i,t} } \right] $$

(1)

$$ F_{i,j} = F_{i} \odot F_{j} $$

(2)

$$ F_{i,j} = \left[ {F_{i} ,F_{j} } \right] $$

(3)

where the embedding vector $v$ in certain edge type $t$ for certain drug $i$ is $v_{i,t}$ and the one-dimensional feature vector for drug $i$ is $F_{i}$. Then multiplies two vectors of drugs pair ($F_{i}$ and $F_{j}$ ) using (np.multiply) as shown in the Eq. (2). Where the feature vectors of the drugs pair are $F_{i,j}$. This schema led to the best accuracy. However, the integration schema (b) performs well but it consumes a lot of storage space and time and achieves less accuracy than (a). The integration method (b) uses the same way of combining each drug embedding vector in all event types using (np.concatenate). But then it uses (np.concatenate) again to merge the vectors of drugs pair as shown in the Eq. (3). The integration method (d) achieves the second-best accuracy after (a) but it is also time-consuming. Finally, Table 1. and Fig. 2 shows the effect of using different integrating approaches in term of the model's accuracy. Figure 3 shows an overview of integration methods.

Table 1 The effect of different integrating schema in term of model's accuracy.

Full size table

The effect of using different drug features

We examined the proposed method in several cases based on using different drug features as input to make a more accurate evaluation. First, we implemented the proposed method on each feature matrix (similarity‌ matrices) separately. Then, we implemented the proposed method on a combination of feature matrices. Table 2 shows the results for using the feature matrices in different ways, as well as the results for all feature matrices combinations. We refer to the embedding of drugs interactions network with GD, and to the Enzyme, Target, Chemical structures and Pathways with E, T, S, and P respectively.

Table 2 Effect of using different data sources in terms of evaluation measures.

Full size table

The combination of different feature matrices has led to better results. The model performance using GNN models shows the efficiency in extracting and summarizing the drug's features from the network structure. Furthermore, using the embedding of drug interactions network alongside the features matrices show better performance for the model and the best result in AUC and AUPR values. The model using the enzyme matrix shows the lowest accuracy. While the Chemical Structure matrix has the highest accuracy in the individual evaluation of each matrix, which appears to be a more informative and good effect on explaining the interaction. In general, using drug network embedding alone shows significant improvement in accuracy. It is probably due to the presence of topological information in this network structure that led to better modeling of the drug's interaction. However, using the feature matrices as node attribute improves the model performance because of the information that these features add to the model. Figure 4 shows the integration of different feature matrices improved the accuracy. Also, using drugs network embedding along with the feature matrices as node attributes achieved the best result.

Comparison with the other methods

We compared the proposed model with several state-of-the-art event prediction methods: DDIMDL, CNN-DDI, DANN-DDI and MDNN. We also consider many popular classification methods, i.e. random forest (RF), K-Nearest Neighbor (KNN) and Logistic Regression (LR). We compare the proposed model with these models to explain the advantages of utilizing the attributed heterogeneous network embedding method using drug features and the DDI edge list. Further, to show the impact of efficient aggregation schema. The DDIMDL model uses four similarity matrices of drug features as input. This model adopts four sub-network to learn cross-modality representations of drug-drug pairs. The DANN-DDI model after constructing multiple drug feature networks adopts an attention neural network to aggregate the learned drug representations and predict drug-drug interactions. We implement DANN-DDI according to the descriptions in³⁹. CNN-DDI model first gathers the feature vectors from interaction matrices and calculates drug similarity. Then, it uses the features representation as input for the convolution neural network model to identify DDIs. We perform the CNN-DDI model based on the descriptions in⁴⁰. The MDNN model develops a two-pathway framework. The framework includes a drug knowledge graph (DKG) based pathway and a heterogenous feature (HF) based pathway to produce drug multimodal representations. Next, the model employs a multimodal fusion neural layer to predict DDI events. We implement MDNN according to the descriptions in⁴¹.

We use Table 3 to list all the prediction model's results. Figure 5 shows the performance of different models. The results show that the proposed model outperforms all of the comparison models in all metrics. The proposed model can overcome the imbalance challenge in the dataset and achieve the best AUPR score for the DDI event prediction task. Due to the imbalance in the dataset, the other models easily overfit. This shows the advantage of using the attributed heterogeneous network embedding method because the model extracts the drug representation in all different interaction types which can better describe the intrinsic characteristics of a drug. Furthermore, the proposed model tests several aggregation schemas and applies the best one for aggregating the features. Then, the model uses the joint sub-networks framework to combine the feature vectors of the drugs and predict the DDI events. The proposed model improved the prediction process based on AUC, AUPR and F1_score metrics and achieved 0.9992, 0.9717 and 0.8579, respectively. The model results during five folds show minimum accuracy of 0.9196 and average accuracy of 0.9211 and maximum accuracy of 0.9220. The results of the model in five folds are shown in Table 4.

Table 3 Results of comparison of the proposed method with the previous methods.

Full size table

Table 4 The results of the proposed model in five folds (5 CV).

Full size table

Figure 6 shows the efficiency of the proposed method in predicting different interactions type between drugs independently. Here we use the word event to address the interaction type between drugs. The model achieved good AUC and F1 scores in predicting most drug events. Except the event 39, which is wrongly classified as event 1. It may be because both events are related to drug metabolism. Also, the model for drug events from 51 to 65 achieved low metric scores in AUC and F1 scores, and it is due to the lack of samples. It has already been pointed out that the data is unbalanced, which is a big challenge and a regular problem in biological data. But the proposed method was able to deal with this imbalance in data.

Conclusion

In this study, we construct a drug-drug heterogeneous network and several similarity matrices, such as drug-target, drug–chemical structure, etc. We use this network and the matrices in form of an attributed heterogeneous network to extract the drug feature vector using a GNN embedding method. The proposed model uses the drug network structure with the nodes attribute to generate drug embedding. Then, the proposed model integrates the drug feature vectors and finally adopts a fully connected sub-networks framework to predict the Drug-Drug Interaction type. We explain the dataset and evaluation metrics and discuss the results and evaluations of the proposed model. We apply five-fold cross-validation to the proposed model for evaluation. The model achieved 0.9220 as max accuracy and 0.9211 as average accuracy. The proposed model outperforms the existing DDI event prediction method. Also, we implement the model on each similarity matrix separately. Then we implement it on a combination of similarity matrices and report the results of predicting drug events. Further, we discuss the influence of using different hyperparameters in the model performance. We discuss utilizing various drug embedding dimensions and methods of integrating drug embedding vectors.

In conclusion, employing the attributed heterogeneous network embedding method can provide better drug representation in different drug interaction types and lead to better model performance. Also, using an effective aggregation schema and implementing a fully connected sub-networks framework can provide a powerful method to integrate various drug features. Furthermore, the experimental results indicate that this model outperforms the existing approaches. We can use the PU Learning strategy for future work to enhance the network positive samples by classifying the unlabeled data. Also, we can use a new approach to consider new drugs in the DDI event prediction process.

Materials and methods

In this work, we propose a framework of two stages that combines several drug features to predict DDI-associated events, using attributed heterogeneous networks representation and aggregation schema with multiple deep neural networks. Firstly, it generates drug embedding from attributed heterogeneous networks using a GNN model. Next, it aggregates the feature vectors and uses multiple deep neural networks for DDI event prediction.

Data collection

The data used in this research is derived from the study of Deng et al.⁴. Researchers in this study obtained and cleaned the required data from reputable databases such as DrugBank⁴² and KEGG⁴³. This dataset includes four types of property or feature matrices for drugs: Chemical structure, Target, Enzyme, and Pathway. We obtained the pathway matrix from DrugBank and KEGG databases. But the rest of the matrices were collected from the DrugBank database. Each column in the features matrices represents the drugs. The rows represent specific drug properties (For example, the number of Enzyme types). The values of one and zero for each entity indicate the presence/absence of a specific property, respectively (for example, a certain enzyme for a particular drug).

The dataset provides a drug-drug edge list that includes 65 types of drug-drug relationships. The drug relationship refers to drug interactions. This database displayed drug interaction events as a quadruple structure: (drug A, drug B, mechanism, action). "Mechanism" means the effect of drugs in terms of metabolism, therapeutic effect, etc. "Action" indicates an increase or decrease in the effects. We employ the first two sections as an edge list of drug interactions and the second two sections as an interaction type or so-called event. The distribution of these events is not even, so the data is unbalanced. Figure 7 shows the distribution of the samples between events in the dataset. Therefore, the model under-fits simply in training, which is one of the main difficulties in this dataset. We use the edge list of DDI and one of the feature matrices to construct an Attributed Heterogeneous Graph. In this graph, the nodes refer to drugs and the links between them indicate 65 types of interactions. Table 5 shows a summary of these matrices.

Table 5 Types of properties in the dataset.

Full size table

The first step of the proposed method

In the first stage of this approach, after preparing the attributed heterogeneous graph of drugs and feature matrices, we start the embedding process for each drug in each event type. At the end of this process, we will have an embedding matrix. In this matrix, each vector represents the embedding of that drug in a particular event type. Figure 8 shows a view of the first step of the method. Next, we discussed the details of each step in this phase.

Collect data and construct similarity matrices

Firstly, we collect five adjacency matrices from the information sources. A drug-drug matrix shows the interaction between two drugs and their event type. The feature matrices (drug-enzyme, drug-target, Drug–Chemical structure and drug-pathway) indicate the relationship between the drug and a particular feature. Then, after obtaining the matrices, we start constructing similarity matrices from the adjacency matrices of the properties.

We use the Jacquard similarity function to calculate the similarity matrix for each adjacency matrix. The Jacquard function is expressed in the following equation:

$$ J\left( {A,B} \right) = \frac{{M_{11} }}{{M_{01} + M_{10} + M_{11} }} $$

(4)

Considering two-feature vectors A and B, where each one contains n elements with values 0 or 1.

$M_{11}$ means the number of entities that is 1 in both vectors A and B.
$M_{01}$ means the number of entities that is 0 in vector A and 1 in B.
$M_{10}$ means the number of entities that is 1 in A and 0 in B.

The Jaccard function for each pair of drugs receives the binary vector of the features of the drugs. Then calculate their similarity using the above formula. For example, to calculate the similarity of two drugs, $d_{i}$ and $d_{j}$, based on the feature of the target proteins. The row $i$ and the row $j$ of the feature matrix are given as input to the Jaccard function.

Constructing a heterogeneous network for integrating the data resources

In this stage, we construct an attributed heterogeneous network from the drug_drug edge list. The drug_drug edge list shows interactions between the drugs in the list and specifies the type of relationship. This network has many different edge types. In this network, the nodes refer to drugs and the attributes of the nodes refer to the drug's features. For example, if we consider the drug_pathway similarity matrix as a node attribute, then each vector in this matrix is considered as a node attribute for the corresponding drug. To generate drug representation, we use the attributed heterogeneous network with one of the similarity matrixes as a node attribute in each step. As a result, we will have four networks. The representation vectors are made using network structure and node feature vectors in the embedding process. The embedding process will generate four embeddings’ matrices for each drug for all interaction types. Each matrix has three dimensions indicating the nodes number, the embedding size and the number of edge types.

Extracting drugs embedding vectors

In the concept of the neural network, extracting a low-dimensional vector for input entities based on their initial features is called embedding. There are several ways to generate graph embedding. In the proposed approach, we introduce a GNN-based model for learning Attributed Heterogeneous networks to extract the low-dimensional representation of nodes in the network.

In this approach, we adopt an algorithm based on the recent research⁴⁴ to learn the embedding of the attributed heterogeneous network. The proposed algorithm can derive the latent topological properties of the network structure along with the node's attributes. It generates the embedding of every node $v_{i}$ on each edge type $r$ in two parts: base embedding and edge embedding. The model uses the node's attributes and network structure in the transformation function to generate base embedding and edge embedding.

The model takes the heterogeneous network and the node's attributes as input. Then the process starts by generating training samples in each edge type using the Random Walk diffusion method. The model creates node sequences using Random Walk and then performs Skip-gram over node sequences to learn embeddings. The model updates to achieve the overall embedding for each node in each edge type. Figure 9 shows an overview of this stage.

Suppose that we have $n$ drugs and $r$ edge type; the drugs embedding using Enzyme similarity matrix as node attributes is $\{ \{ E_{i}^{e} \}^{r} \}^{n}$. The other matrices are $\{ \{ E_{i}^{t} \}^{r} \}^{n} , \{ \{ E_{i}^{p} \}^{r} \}^{n} , \{ \{ E_{i}^{s} \}^{r} \}^{n}$ using Target, Pathway and Chemical structure similarity matrices as node attributes respectively. Generally, the embedding process generates four embedding matrices made from the drug-drug interactions network and four similarity matrices of drug features. Each matrix has three dimensions indicating the nodes number, the embedding size and the number of edge types.

The second step of the proposed method

After creating the embedding matrices for drugs, we use a concatenation (aggregation) method to reduce the embedding matrices' dimensions into a one-dimensional feature vector. In a multi fully connected deep learning model, this feature vector is used as an input to predict the DDI types. Figure 10 shows an overview of this process.

Dimensions reduction of the embedding matrix

After generating the network embedding matrix, each drug is represented by a two-dimensional matrix. This matrix contains the node (Drug) embedding vectors in each edge type (Interaction type). We use a concatenation method for each drug matrix to merge the drug embedding vectors together. The generated one-dimensional vector represents the embedding vector of the drug $i$ in all edge types. Then, we obtain a feature vector for each drug pair in the DDI list by multiplying the feature vectors of drug $i$ and drug $j$ of the drug pair.

If the embedding matrix of drug $i$ is $M_{i}$ and the vector $v$ in certain edge type $t$ is $v_{i,t}$ then the one-dimensional feature vector $F_{i}$ for drug $i$ is $F_{i} = \left[ {v_{i,1} , \ldots , v_{i,t} } \right]$, and the feature vectors of the drugs pair $k$ is $F_{k} = F_{i} \odot F_{j}$, where $\odot$ is the element-wise product.

DDI prediction by a fully connected deep learning network

After producing the four matrices of feature(embedding) vectors in the first step, the fully connected deep learning network is used to perform the prediction task. As shown in Fig. 10, the designed model for the second step consists of four sub-networks. Motivated by the bottleneck-like neural network idea⁴⁵, each sub-network uses one of four matrices of the drug's feature vector as input. The result of these sub-networks is aggregated to achieve the final result. We use several hidden layers in the networks and batch normalization layers⁴⁶ between them. Then a softmax layer is employed for prediction in these sub-networks. Finally, to enhance generalization ability and avoid over-fitting, we add dropout layers to the networks⁴⁷. We adopt (ReLU)⁴⁸ as an activation function in the networks. Here, the outputs of the sub-networks are merged by calculating the average and producing the final prediction.

We choose the cross-entropy loss function and utilize the Adam optimizer with the default parameters for the optimization algorithm. To control over-fitting while speeding up the training process, we use the early-stopping approach⁴⁹. With this approach, if no improvement is observed in 10 epochs, the training automatically stops.

Data availability

The datasets and codes using in this study are available in https://github.com/Mohammad-Hussain95/GNN_DDI.

References

Han, K. et al. Synergistic drug combinations for cancer identified in a CRISPR screen for pairwise genetic interactions. Nat. Biotechnol. 35, 463–474 (2017).
Article CAS Google Scholar
Takeda, T., Hao, M., Cheng, T., Bryant, S. H. & Wang, Y. Predicting drug-drug interactions through drug structural similarities and interaction networks incorporating pharmacokinetics and pharmacodynamics knowledge. J. Cheminform. 9, 16 (2017).
Article Google Scholar
Feng, Y. H., Zhang, S. W. & Shi, J. Y. DPDDI: A deep predictor for drug-drug interactions. BMC Bioinform. 21, 419 (2020).
Article Google Scholar
Deng, Y. et al. A multimodal deep learning framework for predicting drug-drug interaction events. Bioinformatics 36, 4316–4322 (2020).
Article CAS Google Scholar
Karim, M. R. et al. Drug-drug interaction prediction based on knowledge graph embeddings and convolutional-LSTM network. In Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics 113–123 (Association for Computing Machinery, 2019).
Muzio, G., O’Bray, L. & Borgwardt, K. Biological network analysis with deep learning. Brief. Bioinform. 22, 1515–1530 (2021).
Article Google Scholar
Wu, Z. et al. A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 32, 4–24 (2021).
Article MathSciNet Google Scholar
Luo, H. et al. Biomedical data and computational models for drug repositioning: A comprehensive review. Brief. Bioinform. 22, 1604–1619 (2021).
Article CAS Google Scholar
Gottlieb, A., Stein, G. Y., Oron, Y., Ruppin, E. & Sharan, R. INDI: A computational framework for inferring drug interactions and their associated recommendations. Mol. Syst. Biol. 8, 592 (2012).
Article Google Scholar
Cheng, F. & Zhao, Z. Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J. Am. Med. Inform. Assoc. 21, e278-286 (2014).
Article Google Scholar
Dang, L. H. et al. Machine learning-based prediction of drug-drug interactions for histamine antagonist using hybrid chemical features. Cells 10, 3092 (2021).
Article CAS Google Scholar
Song, D. et al. Similarity-based machine learning support vector machine predictor of drug-drug interactions with improved accuracies. J. Clin. Pharm. Ther. 44, 268–275 (2019).
Article CAS Google Scholar
Zhang, W., Chen, Y., Li, D. & Yue, X. Manifold regularized matrix factorization for drug-drug interaction prediction. J. Biomed. Inform. 88, 90–97 (2018).
Article Google Scholar
Shi, J. Y., Mao, K. T., Yu, H. & Yiu, S. M. Detecting drug communities and predicting comprehensive drug-drug interactions via balance regularized semi-nonnegative matrix factorization. J. Cheminform. 11, 28 (2019).
Article Google Scholar
Rohani, N., Eslahchi, C. & Katanforoush, A. ISCMF: Integrated similarity-constrained matrix factorization for drug–drug interaction prediction. Netw. Model. Anal. Health Inform. Bioinform. 9, 11 (2020).
Article Google Scholar
Cao, S., Lu, W. & Xu, Q. GraRep: Learning Graph Representations with Global Structural Information. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management 891–900 (Association for Computing Machinery, 2015).
Perozzi, B., Al-Rfou, R. & Skiena, S. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 701–710 (2014).
Ribeiro, L. F., Saverese, P. H. & Figueiredo, D. R. struc2vec: Learning node representations from structural identity. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 385–394.
Lee, K., Lee, S., Jeon, M., Choi, J. & Kang, J. Drug-drug interaction analysis using heterogeneous biological information network. In 2012 IEEE International Conference on Bioinformatics and Biomedicine 1–5.
Huang, J. et al. Systematic prediction of pharmacodynamic drug-drug interactions through protein-protein-interaction network. PLoS Comput. Biol. 9, e1002998 (2013).
Article CAS Google Scholar
Lee, I. & Nam, H. Identification of drug-target interaction by a random walk with restart method on an interactome network. BMC Bioinform. 19, 208 (2018).
Article Google Scholar
Wang, D., Cui, P. & Zhu, W. Structural deep network embedding. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 1225–1234 (Association for Computing Machinery, 2016).
Tang, J. et al. LINE: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web 1067–1077 (International World Wide Web Conferences Steering Committee, 2015).
Wang, H. et al. Learning graph representation with generative adversarial nets. IEEE Trans. Knowl. Data Eng. 33, 3090–3103 (2019).
Article Google Scholar
Zhao, B.-W., Hu, L., You, Z.-H., Wang, L. & Su, X.-R. HINGRL: Predicting drug–disease associations with graph representation learning on heterogeneous information networks. Brief. Bioinform. 23, bba515 (2021).
Article Google Scholar
Zhao, B.-W. et al. MGRL: Predicting drug-disease associations based on multi-graph representation learning. Front. Genet. 12, 657182 (2021).
Article CAS Google Scholar
Su, X. et al. Biomedical knowledge graph embedding with capsule network for multi-label drug-drug interaction prediction. IEEE Trans. Knowl. Data Eng. https://doi.org/10.1109/TKDE.2022.3154792 (2022).
Article Google Scholar
Zhou, J. et al. Graph neural networks: A review of methods and applications. AI Open 1, 57–81 (2020).
Article Google Scholar
Kipf, T. N. & Welling, M. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 (2016).
Wang, Y., Yao, H. & Zhao, S. Auto-encoder based dimensionality reduction. Neurocomputing 184, 232–242 (2016).
Article Google Scholar
Cao, S., Lu, W. & Xu, Q. GraRep: Learning graph representations with global structural information. Proc. AAAI Conf. Artif. Intell. https://doi.org/10.1145/2806416.2806512 (2015).
Article Google Scholar
Jin, W., Yang, K., Barzilay, R. & Jaakkola, T. Learning multimodal graph-to-graph translation for molecular optimization. arXiv preprint arXiv:1812.01070 (2018).
Ozturk, H., Ozgur, A. & Ozkirimli, E. DeepDTA: Deep drug-target binding affinity prediction. Bioinformatics 34, i821–i829 (2018).
Article CAS Google Scholar
Zhang, Y., Qiu, Y., Cui, Y., Liu, S. & Zhang, W. Predicting drug-drug interactions using multi-modal deep auto-encoders based network embedding and positive-unlabeled learning. Methods 179, 37–46 (2020).
Article CAS Google Scholar
Lin, X., Quan, Z., Wang, Z.-J., Ma, T. & Zeng, X. KGNN: Knowledge Graph Neural Network for Drug-Drug Interaction Prediction (International Joint Conferences on Artificial Intelligence Organization, 2020).
Google Scholar
Zitnik, M., Agrawal, M. & Leskovec, J. Modeling polypharmacy side effects with graph convolutional networks. Bioinformatics 34, i457–i466 (2018).
Article CAS Google Scholar
Ryu, J. Y., Kim, H. U. & Lee, S. Y. Deep learning improves prediction of drug-drug and drug-food interactions. Proc. Natl. Acad. Sci. USA 115, E4304–E4311 (2018).
Article CAS Google Scholar
Feng, Y., Zhang, S.-W., Zhang, Q.-Q., Zhang, C.-H. & Shi, J.-Y. MTDDI: A Graph Convolutional Network Framework for Predicting Multi-Type Drug-Drug Interactions (Research Square, 2021).
Google Scholar
Liu, S. et al. Enhancing Drug-Drug Interaction Prediction Using Deep Attention Neural Networks (Cold Spring Harbor Laboratory, 2021).
Book Google Scholar
Zhang, C., Lu, Y. & Zang, T. CNN-DDI: A learning-based method for predicting drug-drug interactions using convolution neural networks. BMC Bioinform. 23, 88 (2022).
Article CAS Google Scholar
Lyu, T., Gao, J., Tian, L., Li, Z., Zhang, P. & Zhang, J. MDNN: A multimodal deep neural network for predicting drug-drug interaction events. In IJCAI 3536–3542 (2021).
Wishart, D. S. et al. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074–D1082 (2018).
Article CAS Google Scholar
Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y. & Morishima, K. KEGG: New perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–D361 (2017).
Article CAS Google Scholar
Cen, Y. et al. Representation learning for attributed multiplex heterogeneous network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining 1358–1368 (Association for Computing Machinery, 2019).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning 448–456 (PMLR) (2015).
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
MathSciNet MATH Google Scholar
Nair, V. & Hinton, G. E. Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on International Conference on Machine Learning 807–814 (2010).
Prechelt, L. Early stopping: But when? In Neural Networks: Tricks of the Trade 2nd edn (eds Montavon, G. et al.) 53–67 (Springer, 2012).
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Computer Engineering Department, University of Qom, Qom, Iran
Mohammad Hussain Al-Rabeah & Amir Lakizadeh

Authors

Mohammad Hussain Al-Rabeah
View author publications
You can also search for this author in PubMed Google Scholar
Amir Lakizadeh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.A. and A.L. conceived of the study. M.A. implemented the study. Both authors have written and read and approved the final manuscript.

Corresponding author

Correspondence to Amir Lakizadeh.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Al-Rabeah, M.H., Lakizadeh, A. Prediction of drug-drug interaction events using graph neural networks based feature extraction. Sci Rep 12, 15590 (2022). https://doi.org/10.1038/s41598-022-19999-4

Download citation

Received: 28 June 2022
Accepted: 07 September 2022
Published: 16 September 2022
DOI: https://doi.org/10.1038/s41598-022-19999-4
Springer Nature Limited

This article is cited by

Learning self-supervised molecular representations for drug–drug interaction prediction
- Rogia Kpanou
- Patrick Dallaire
- Jacques Corbeil
BMC Bioinformatics (2024)
Bridging the Worlds of Pharmacometrics and Machine Learning
- Kamilė Stankevičiūtė
- Jean-Baptiste Woillard
- Mihaela van der Schaar
Clinical Pharmacokinetics (2023)

Prediction of drug-drug interaction events using graph neural networks based feature extraction

Abstract

Similar content being viewed by others

DPDDI: a deep predictor for drug-drug interactions

Multimodal CNN-DDI: using multimodal CNN for drug to drug interaction associated events

SSF-DDI: a deep learning method utilizing drug sequence and substructure features for drug–drug interaction prediction

Explore related subjects

Introduction

Experiments and results

Evaluation metrics

Parameter setting

Effect of embedding dimension size

The effect of different integrating schema in terms of the model's accuracy

The effect of using different drug features

Comparison with the other methods

Conclusion

Materials and methods

Data collection

The first step of the proposed method

Collect data and construct similarity matrices

Constructing a heterogeneous network for integrating the data resources

Extracting drugs embedding vectors

The second step of the proposed method

Dimensions reduction of the embedding matrix

DDI prediction by a fully connected deep learning network

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Learning self-supervised molecular representations for drug–drug interaction prediction

Bridging the Worlds of Pharmacometrics and Machine Learning

Search

Navigation