Fundamental Toxicological Sciences

Paper Details

Fundamental Toxicological Sciences
Vol. 11 No. 6 November 22, 2024 p.279-288
Original Article
Constructing a graph neural network-based artificial intelligence model to predict drug-induced phospholipidosis potential
  • Hiroshi Yamada (Toxicogenomics Informatics Project, National Institutes of Biomedical Innovation, Health and Nutrition / h-yamada@nibiohn.go.jp)
Yoshinobu Igarashi 1) , Aki Hasegawa 2) , Shigeyuki Matsumoto 2) , Hiroaki Iwata 2) 3) , Ryosuke Kojima 2) , Yasushi Okuno 2) , Hiroshi Yamada 1)
1) Toxicogenomics Informatics Project, National Institutes of Biomedical Innovation, Health and Nutrition , 2) Department of Biomedical Data Intelligence, Graduate School of Medicine, Kyoto University , 3) Present address: Department of Biological Regulation, Faculty of Medicine, Tottori University
Keywords: Bagging method, Deep learning, Drug-induced phospholipidosis, Explainable artificial intelligence, Graph neural networks
Abstracts

Drug-induced phospholipidosis (DIPL) is linked to various toxicities, including hepatotoxicity, making it a critical screening factor in the early stages of drug discovery. Several models based on chemical structures have been constructed to predict compounds with DIPL potential. However, most of these models only classify results as inducers or non-inducers, without identifying the specific substructures responsible for positive outcomes. To address this limitation, we constructed an artificial intelligence (AI) model to predict compounds with DIPL potential and visualize structural alerts. The proposed model was constructed using kMoL, an open-source software library that employs a graph neural network approach to learn from chemical structural data. We employed the bagging method, resulting in a model with a high predictive performance. The model attained an F1 score of 0.796 on the external test set. In addition, we used the integrated gradient method to visualize the substructures that contributed to positive predictions. When applying the method to compounds that experimentally conducted structure-activity relationship investigations, the proposed AI model accurately predicted DIPL potential, demonstrating its practical utility in early-stage drug discovery. By predicting DIPL based on the chemical structure of compounds, the proposed model can aid in the screening for DIPL, potentially improving the safety profile of new drug candidates.