Background: Opioid use disorder (OUD) affects an estimated 16 million people worldwide. The diagnosis of OUD is commonly delayed or missed altogether. We aimed to test the utility of machine learning in creating a prediction model and algorithm for early diagnosis of OUD. Subjects and methods: We analyzed data gathered in a commercial claim database from January 1, 2006, to December 31, 2018 of 10 million medical insurance claims from 550 000 patient records. We compiled 436 predictor candidates, divided to six feature groups - demographics, chronic conditions, diagnosis and procedures features, medication features, medical costs, and episode counts. We employed the Word2Vec algorithm and the Gradient Boosting trees algorithm for the analysis. Results: The c-statistic for the model was 0.959, with a sensitivity of 0.85 and specificity of 0.882. Positive Predictive Value (PPV) was 0.362 and Negative Predictive Value (NPV) was 0.998. Significant differences between positive OUD- and negative OUD- controls were in the mean annual amount of opioid use days, number of overlaps in opioid prescriptions per year, mean annual opioid prescriptions, and annual benzodiazepine and muscle relaxant prescriptions. Notable differences were the count of intervertebral disc disorder-related complaints per year, post laminectomy syndrome diagnosed per year, and pain disorders diagnosis per year. Significant differences were also found in the episodes and costs categories. Conclusions: The new algorithm offers a mean 14.4 months reduction in time to diagnosis of OUD, at potential saving in further morbidity, medical cost, addictions and mortality.
- big data analytics
- machine learning
- Opioid use disorder
ASJC Scopus subject areas
- Pharmacology, Toxicology and Pharmaceutics (all)