Optimal-design domain-adaptation for exposure prediction in two-stage epidemiological studies

Ron Sarafian, Itai Kloog, Jonathan D. Rosenblatt

Research output: Contribution to journalArticlepeer-review

Abstract

Background: In the first stage of a two-stage study, the researcher uses a statistical model to impute the unobserved exposures. In the second stage, imputed exposures serve as covariates in epidemiological models. Imputation error in the first stage operate as measurement errors in the second stage, and thus bias exposure effect estimates. Objective: This study aims to improve the estimation of exposure effects by sharing information between the first and second stages. Methods: At the heart of our estimator is the observation that not all second-stage observations are equally important to impute. We thus borrow ideas from the optimal-experimental-design theory, to identify individuals of higher importance. We then improve the imputation of these individuals using ideas from the machine-learning literature of domain adaptation. Results: Our simulations confirm that the exposure effect estimates are more accurate than the current best practice. An empirical demonstration yields smaller estimates of PM effect on hyperglycemia risk, with tighter confidence bands. Significance: Sharing information between environmental scientist and epidemiologist improves health effect estimates. Our estimator is a principled approach for harnessing this information exchange, and may be applied to any two stage study.

Original languageEnglish
Pages (from-to)963-970
Number of pages8
JournalJournal of Exposure Science and Environmental Epidemiology
Volume33
Issue number6
DOIs
StatePublished - 1 Nov 2023

Keywords

  • Domain-adaptation
  • Environmental epidemiology
  • Optimal-design
  • Two-stage studies

ASJC Scopus subject areas

  • Epidemiology
  • Toxicology
  • Pollution
  • Public Health, Environmental and Occupational Health

Fingerprint

Dive into the research topics of 'Optimal-design domain-adaptation for exposure prediction in two-stage epidemiological studies'. Together they form a unique fingerprint.

Cite this