TY - JOUR
T1 - Using topological data analysis and pseudo time series to infer temporal phenotypes from electronic health records
AU - Dagliati, Arianna
AU - Geifman, Nophar
AU - Peek, Niels
AU - Holmes, John H.
AU - Sacchi, Lucia
AU - Bellazzi, Riccardo
AU - Sajjadi, Seyed Erfan
AU - Tucker, Allan
N1 - Publisher Copyright:
© 2020 The Authors
PY - 2020/8/1
Y1 - 2020/8/1
N2 - Temporal phenotyping enables clinicians to better understand observable characteristics of a disease as it progresses. Modelling disease progression that captures interactions between phenotypes is inherently challenging. Temporal models that capture change in disease over time can identify the key features that characterize disease subtypes that underpin these trajectories. These models will enable clinicians to identify early warning signs of progression in specific sub-types and therefore to make informed decisions tailored to individual patients. In this paper, we explore two approaches to building temporal phenotypes based on the topology of data: topological data analysis and pseudo time-series. Using type 2 diabetes data, we show that the topological data analysis approach is able to identify disease trajectories and that pseudo time-series can infer a state space model characterized by transitions between hidden states that represent distinct temporal phenotypes. Both approaches highlight lipid profiles as key factors in distinguishing the phenotypes.
AB - Temporal phenotyping enables clinicians to better understand observable characteristics of a disease as it progresses. Modelling disease progression that captures interactions between phenotypes is inherently challenging. Temporal models that capture change in disease over time can identify the key features that characterize disease subtypes that underpin these trajectories. These models will enable clinicians to identify early warning signs of progression in specific sub-types and therefore to make informed decisions tailored to individual patients. In this paper, we explore two approaches to building temporal phenotypes based on the topology of data: topological data analysis and pseudo time-series. Using type 2 diabetes data, we show that the topological data analysis approach is able to identify disease trajectories and that pseudo time-series can infer a state space model characterized by transitions between hidden states that represent distinct temporal phenotypes. Both approaches highlight lipid profiles as key factors in distinguishing the phenotypes.
KW - Electronic phenotyping
KW - Longitudinal studies
KW - Type 2 diabetes
KW - Unsupervised machine learning
UR - http://www.scopus.com/inward/record.url?scp=85088372036&partnerID=8YFLogxK
U2 - 10.1016/j.artmed.2020.101930
DO - 10.1016/j.artmed.2020.101930
M3 - Article
C2 - 32972659
AN - SCOPUS:85088372036
SN - 0933-3657
VL - 108
JO - Artificial Intelligence in Medicine
JF - Artificial Intelligence in Medicine
M1 - 101930
ER -