Analyzing partially missing confounder information in comparative effectiveness and safety research of therapeutics

Pharmacoepidemiol Drug Saf. 2012 May;21 Suppl 2(0 2):13-20. doi: 10.1002/pds.3248.

Abstract

Purpose: Electronic healthcare databases are commonly used in comparative effectiveness and safety research of therapeutics. Many databases now include additional confounder information in a subset of the study population through data linkage or data collection. We described and compared existing methods for analyzing such datasets.

Methods: Using data from The Health Improvement Network and the relation between non-steroidal anti-inflammatory drugs and upper gastrointestinal bleeding as an example, we employed several methods to handle partially missing confounder information.

Results: The crude odds ratio (OR) of upper gastrointestinal bleeding was 1.50 (95% confidence interval: 0.98, 2.28) among selective cyclo-oxygenase-2 inhibitor initiators (n = 43 569) compared with traditional non-steroidal anti-inflammatory drug initiators (n = 411 616). The OR dropped to 0.81 (0.52, 1.27) upon adjustment for confounders recorded for all patients. When further considering three additional variables missing in 22% of the study population (smoking, alcohol consumption, body mass index), the OR was between 0.80 and 0.83 for the missing-category approach, the missing-indicator approach, single imputation by the most common category, multiple imputation by chained equations, and propensity score calibration. The OR was 0.65 (0.39, 1.09) and 0.67 (0.38, 1.16) for the unweighted and the inverse probability weighted complete-case analysis, respectively.

Conclusions: Existing methods for handling partially missing confounder data require different assumptions and may produce different results. The unweighted complete-case analysis, the missing-category/indicator approach, and single imputation require often unrealistic assumptions and should be avoided. In this study, differences across methods were not substantial, likely due to relatively low proportion of missingness and weak confounding effect by the three additional variables upon adjustment for other variables.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Anti-Inflammatory Agents, Non-Steroidal / adverse effects*
  • Anti-Inflammatory Agents, Non-Steroidal / therapeutic use
  • Confounding Factors, Epidemiologic*
  • Databases, Factual*
  • Electronic Health Records* / standards
  • Electronic Health Records* / statistics & numerical data
  • Gastrointestinal Hemorrhage / chemically induced
  • Gastrointestinal Hemorrhage / epidemiology*
  • Gastrointestinal Hemorrhage / etiology
  • Humans
  • Models, Statistical
  • Outcome and Process Assessment, Health Care
  • Pharmacoepidemiology / methods*
  • Pharmacoepidemiology / statistics & numerical data*
  • Research Design

Substances

  • Anti-Inflammatory Agents, Non-Steroidal