A machine learning-based prediction model for gout in hyperuricemics: a nationwide cohort study

  • Shay Brikman
  • , Liel Serfaty
  • , Ran Abuhasira
  • , Naomi Schlesinger
  • , Amir Bieber
  • , Nadav Rappoport

    Research output: Contribution to journalArticlepeer-review

    7 Scopus citations

    Abstract

    Objective: To develop a machine learning-based prediction model for identifying hyperuricemic participants at risk of developing gout. Methods: A retrospective nationwide Israeli cohort study used the Clalit Health Insurance database of 473 124 individuals to identify adults 18 years or older with at least two serum urate measurements exceeding 6.8 mg/dl between January 2007 and December 2022. Patients with a prior gout diagnosis or on gout medications were excluded. Patients' demographic characteristics, community and hospital diagnoses, routine medication prescriptions and laboratory results were used to train a risk prediction model. A machine learning model, XGBoost, was developed to predict the risk of gout. Feature selection methods were used to identify relevant variables. The model's performance was evaluated using the receiver operating characteristic area under the curve (ROC AUC) and precision-recall AUC. The primary outcome was the diagnosis of gout among hyperuricemic patients. Results: Among the 301 385 participants with hyperuricemia included in the analysis, 15 055 (5%) were diagnosed with gout. The XGBoost model had a ROC-AUC of 0.781 (95% CI 0.78-0.784) and precision-recall AUC of 0.208 (95% CI 0.195-0.22). The most significant variables associated with gout diagnosis were serum uric acid levels, age, hyperlipidemia, non-steroidal anti-inflammatory drugs and diuretic purchases. A compact model using only these five variables yielded a ROC-AUC of 0.714 (95% CI 0.706-0.723) and a negative predictive value (NPV) of 95%. Conclusions: The findings of this cohort study suggest that a machine learning-based prediction model had relatively good performance and high NPV for identifying hyperuricemic participants at risk of developing gout.

    Original languageEnglish
    Pages (from-to)2411-2417
    Number of pages7
    JournalRheumatology
    Volume63
    Issue number9
    DOIs
    StatePublished - 1 Sep 2024

    Keywords

    • big data
    • electronic health records
    • gout
    • hyperuricemia
    • machine learning

    ASJC Scopus subject areas

    • Rheumatology
    • Pharmacology (medical)

    Fingerprint

    Dive into the research topics of 'A machine learning-based prediction model for gout in hyperuricemics: a nationwide cohort study'. Together they form a unique fingerprint.

    Cite this