SPSS Syntax for Missing Value Imputation in Test and ...

4 downloads 351 Views 39KB Size Report
A well-known problem in the analysis of test and questionnaire data is that some item ... scores using simple methods, two SPSS subroutines were written.
SPSS Syntax for Missing Value Imputation in Test and Questionnaire Data Joost R. van Ginkel and L. Andries van der Ark, Department of Methodology and Statistics, Tilburg University

Description A well-known problem in the analysis of test and questionnaire data is that some item scores may be missing. Advanced methods for the imputation of missing data are available, such as multiple imputation under the multivariate normal model and imputation under the saturated logistic model (Schafer, 1997). Accompanying software was made available by, for example, Schafer (1998a, 1998b) and in SOLAS (2001) and S-Plus 6 for Windows (2001). However, these methods and software may be too complicated for a typical psychological researcher, and for the imputation of his or her missing data, he or she depends on the help of a trained statistician. If available, this statistician may not always have enough time or may not be an experienced software user, so the researcher may decide to simply delete all incomplete observations. To help researchers impute scores using simple methods, two SPSS subroutines were written. The aim of these subroutines is that researchers can apply them easily within SPSS and without experienced help. The subroutine “tw” performs two-way imputation, and the subroutine “rf” performs responsefunction imputation. Two-way imputation and response-function imputation are described by Sijtsma and Van der Ark (2003). Simulation studies by Van der Ark and Sijtsma (in press) indicate that these imputation methods work rather well when applied to an approximately unidimensional set of items (i.e., the items measure the same construct). The subroutines allow the researcher to transform an SPSS data file with missing values (an incomplete data file) into an SPSS data file without missing values (a completed data file). The researcher can use the completed data file for further analysis. To run the subroutines, one must select the variables containing the missing scores that need to be imputed, and some optional arguments also can be specified. For two-way imputation, the most important optional argument pertains to changing or removing the random error that is added to the imputed values by default. For response-function imputation, the most important optional argument pertains to changing the minimum group size used for estimating the response function (see Sijtsma & Van der Ark, 2003). Availability The SPSS syntax files, “tw.sps” and “rf.sps,” contain the subroutines that were written in SPSS 10 for Windows using the MATRIX-command (SPSS, 2000). They are available free of charge from http://www.uvt.nl/faculteiten/fsw/organisatie/departementen/mto/software2.html. A brief manual is also available from this Web site. The syntax files can be applied using SPSS 6 and later versions for Windows.

152

Applied Psychological Measurement, Vol. 29 No. 2, March 2005, 152–153 DOI: 10.1177/0146621603260688 © 2005 Sage Publications

J. R. VAN GINKEL and L. A. VAN DER ARK COMPUTER PROGRAM EXCHANGE

153

References Schafer, J. L. (1997). Analysis of incomplete multivariate data. London: Chapman & Hall. Schafer, J. L. (1998a). CAT: Software for S-PLUS Version 4.0 for Windows. Retrieved May 28, 2003, from www.stat.psu.edu/∼jls/sp40.html Schafer, J. L. (1998b). NORM: Software for S-PLUS Version 4.0 for Windows. Retrieved May 28, 2003, from www.stat.psu.edu/∼jls/sp40.html Sijtsma, K., & Van der Ark, L. A. (2003). Investigation and treatment of missing item scores in test and questionnaire data. Multivariate Behavioral Research, 38, 505-528. SOLAS. (2001). SOLAS for Missing Data Analysis 3.0 [Computer software]. Cork, Ireland: Statistical Solutions. S-Plus 6 for Windows [Computer software]. (2001). Seattle, WA: Insightful Corporation.

SPSS. (2000). SPSS Base 10.0 user’s guide [Software manual]. Chicago: Author. Van der Ark, L. A., & Sijtsma, K. (in press). The effect of missing data imputation on Mokken scale analysis. In L. A. van der Ark, M. A. Croon, & K. Sijtsma (Eds.), New developments in categorical data analysis for the social and behavioral sciences. Mahwah, NJ: Lawrence Erlbaum.

Author’s Address Joost R. van Ginkel, Department of Methodology and Statistics, Tilburg University, P.O. Box 90153, 5000 LE Tilburg, The Netherlands; phone: +31 13 466 8046; fax: +31 13 466 3002; e-mail: [email protected].

Suggest Documents