The use of missing values in proteomic data-independent acquisition mass spectrometry to enable disease activity discrimination

Kathryn A. McGurk, Arianna Dagliati, Davide Chiasserini, Dave Lee, Darren Plant, Ivona Baricevic-Jones, Janet Kelsall, Rachael Eineman, Rachel Reed, Bethany Geary, Richard D. Unwin, Anna Nicolaou, Bernard D. Keavney, Anne Barton, Anthony D. Whetton, Nophar Geifman

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Motivation: Data-independent acquisition mass spectrometry allows for comprehensive peptide detection and relative quantification than standard data-dependent approaches. While less prone to missing values, these still exist. Current approaches for handling the so-called missingness have challenges. We hypothesized that non-random missingness is a useful biological measure and demonstrate the importance of analysing missingness for proteomic discovery within a longitudinal study of disease activity. Results: The magnitude of missingness did not correlate with mean peptide concentration. The magnitude of missingness for each protein strongly correlated between collection time points (baseline, 3 months, 6 months; R = 0.95-0.97, confidence interval = 0.94-0.97) indicating little time-dependent effect. This allowed for the identification of proteins with outlier levels of missingness that differentiate between the patient groups characterized by different patterns of disease activity. The association of these proteins with disease activity was confirmed by machine learning techniques. Our novel approach complements analyses on complete observations and other missing value strategies in biomarker prediction of disease activity.

Original languageEnglish
Pages (from-to)2217-2223
Number of pages7
JournalBioinformatics
Volume36
Issue number7
DOIs
StatePublished - 1 Apr 2020
Externally publishedYes

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'The use of missing values in proteomic data-independent acquisition mass spectrometry to enable disease activity discrimination'. Together they form a unique fingerprint.

Cite this