The use of multiple imputation for the analysis of missing data

S Sinharay; H S Stern; D Russell

The use of multiple imputation for the analysis of missing data

Psychol Methods. 2001 Dec;6(4):317-29.

Authors

S Sinharay¹, H S Stern, D Russell

Affiliation

¹ Department of Statistics, Iowa State University, Ames 50011-1210, USA.

PMID: 11778675

Abstract

This article provides a comprehensive review of multiple imputation (MI), a technique for analyzing data sets with missing values. Formally, MI is the process of replacing each missing data point with a set of m > 1 plausible values to generate m complete data sets. These complete data sets are then analyzed by standard statistical software, and the results combined, to give parameter estimates and standard errors that take into account the uncertainty due to the missing data values. This article introduces the idea behind MI, discusses the advantages of MI over existing techniques for addressing missing data, describes how to do MI for real problems, reviews the software available to implement MI, and discusses the results of a simulation study aimed at finding out how assumptions regarding the imputation model affect the parameter estimates provided by MI.

Publication types

Review

MeSH terms

Bias
Data Collection / statistics & numerical data*
Humans
Likelihood Functions
Mathematical Computing
Psychological Tests / statistics & numerical data*
Psychology, Experimental / statistics & numerical data*
Psychometrics*
Software