Explainable multimodal machine learning model for classifying pregnancy drug safety

Guy Shtar, Lior Rokach, Bracha Shapira, Elkana Kohn, Matitiahu Berkovitch, Maya Berlin

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Teratogenic drugs can cause severe fetal malformation and therefore have critical impact on the health of the fetus, yet the teratogenic risks are unknown for most approved drugs. This article proposes an explainable machine learning model for classifying pregnancy drug safety based on multimodal data and suggests an orthogonal ensemble for modeling multimodal data. To train the proposed model, we created a set of labeled drugs by processing over 100 000 textual responses collected by a large teratology information service. Structured textual information is incorporated into the model by applying clustering analysis to textual features. Results: We report an area under the receiver operating characteristic curve (AUC) of 0.891 using cross-validation and an AUC of 0.904 for cross-expert validation. Our findings suggest the safety of two drugs during pregnancy, Varenicline and Mebeverine, and suggest that Meloxicam, an NSAID, is of higher risk; according to existing data, the safety of these three drugs during pregnancy is unknown. We also present a web-based application that enables physicians to examine a specific drug and its risk factors.

Original languageEnglish
Pages (from-to)1102-1109
Number of pages8
JournalBioinformatics
Volume38
Issue number4
DOIs
StatePublished - 15 Feb 2022

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'Explainable multimodal machine learning model for classifying pregnancy drug safety'. Together they form a unique fingerprint.

Cite this