IRT-ZIP Modeling for Multivariate Zero-Inflated Count Data Author(s): Lijuan Wang Source: Journal of Educational and Behavioral Statistics, Vol. 35, No. 6 (December 2010), pp. 671-692 Published by: American Educational Research Association and American Statistical Association Stable URL: http://www.jstor.org/stable/40959474 Accessed: 11-04-2016 18:41 UTC REFERENCES Linked references are available on JSTOR for this article: http://www.jstor.org/stable/40959474?seq=1&cid=pdf-reference#references_tab_contents You may need to log in to JSTOR to access the linked references. Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://about.jstor.org/terms
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact
[email protected].
American Statistical Association, Sage Publications, Inc., American Educational Research Association are collaborating with JSTOR to digitize, preserve and extend access to Journal of Educational and Behavioral Statistics
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
Journal of Educational and Behavioral Statistics
December 2010, Vol. 35, No. 6, pp. 671-692 DOI: 10.3102/1076998610375838
© 2010 AERA, http://jebs.aera.net
IRT-ZIP Modeling for Multivariate Zero-Inflated Count Data Lijuan Wang University of Notre Dame
This study introduces an item response theory-zero-inflated Poisson (IRT-ZIP) model to investigate psychometric properties of multiple items and predict individuals' latent trait scores for multivariate zero-inflated count data. In the model, two link functions are used to capture two processes of the zero-inflated
count data. Item parameters are included to investigate item performance from
both propensity and level perspectives. The application of the model was illustrated by analyzing the substance use data from the National Longitudinal
Study of Youth. A simulation study based on the empirical data analysis scenario showed that the item parameters can be recovered accurately and precisely with adequate sample sizes. Limitations and future directions are discussed.
Keywords: multivariate zero-inflated count data; IRT-ZIP; item properties
In psychological and educational research, counts of a behavior or an event in a time interval of specified length often are collected through observations, surveys, or experiments. Cameron and Trivedi (1998) defined an event count as the number of times an event occurs, which is a realization of a nonnegative integer-
valued random variable. For example, in a classroom observational study, frequencies of specific behaviors such as effective interactions between students and their teacher are observed and recorded over a period of 10 minutes. Or, in a substance use study, the number of cigarettes smoked by teenagers in a day during the last 30 days can be collected via a survey questionnaire. The Poisson regression model (e.g., Cox & Lewis, 1966; Frome, Kutner, & Beauchamp, 1973; Gart, 1964; McCullagh & Neider, 1989) is a typical method for analyzing cross-
sectional count data. The mixed-effects Poisson factor model (Böckenholt, Kamakura, & Wedel, 2003) can be applied to analyze multivariate count data. Count data that contain excessive numbers of zeros are called zero-inflated
count data. Zero-inflated count data usually come from two distinct processes. One process is related to whether the event has a chance to happen, and the other process is related to the count of the event, given that the event does have a chance to happen. For example, in investigating the number of cigarettes smoked by a participant in a day during the last 30 days, some participants report zeros 671
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
Wang
because they are nonsmokers and some participants report zeros because they happen not to smoke during that period although they are smokers. We define the
former zeros (qualitative zeros) as perfect zero states and the latter zeros (quantitative zeros) as zero counts following a Poisson process. Zero-inflated count data can be analyzed using the zero-inflated Poisson model (ZIP model; Lambert, 1992). The ZIP model with random effects (ZIPR model; e.g., Hall, 2000; Lee,
Wang, Scott, Yau, & McLachlan, 2006; Liu & Powers, 2007; Min & Agresti, 2005; Rabe-Hesketh & Skrondal, 2007) can be used to analyze multilevel or longitudinal zero-inflated count data. What to do with measurement errors is always an issue for social and behavioral research. A typical method to reduce the effect of measurement errors is to measure a latent construct by collecting multivariate data using multiple man-
ifest variables. For continuous multivariate data, factor analyses are the typical
methods for examining the relationship between manifest variables and latent factors (e.g., Cattell, 1978; Fuller, 1987; McDonald, 1985). For categorical multivariate data, item response theory models (IRT models) are the typical methods to analyze the psychometric properties of items and to obtain latent trait scores for the measured constructs by modeling the relationship between items and trait scores (e.g., Embretson & Reise, 2000; Lord, 1980; Lord & Novick, 1968; Rasch,
1960). However, how to model multivariate zero-inflated count data to analyze psychometric properties of zero-inflated count items and how to obtain person trait scores from multiple zero-inflated count items were not discussed in previous lit-
erature. When analyzing multivariate zero-inflated count data, although one can form and model composite scores, a disadvantage of this approach is that the distribution of the composite scores is extremely skewed right due to excess zeros and the weights of items forming the composite scores are usually arbitrarily set to be 1. Therefore, the purposes of this study are to (a) introduce a model for
multivariate zero-inflated count data to analyze psychometric properties of zero-inflated count items and to predict person trait scores from multiple items; (b) explore an empirical data set to illustrate the application of the model; and (c) examine the performance of the proposed model via a simulation study with different sample sizes.
Review of Precursors of the Models ZIP Model
For handling univariate zero-inflated count data, Lambert (1992) introduced the ZIP model. The ZIP model uses two link functions, a logit link function and a log link function, to capture statistical features of the two processes: the perfect
zero state and the Poisson process, for zero-inflated count data. The model is expressed as follows: 672
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
IRT-ZIP Modeling
JO, with probability 1 - p¡
1 ~ 'Poisson(''), with probability p¡ log(A,) = Xfi iogit(pi)=z/ir,
where Yt is the observed datum for person i (i = 1,2, . . . , n). Therefore, we have
?v(Yi = 0) = (1 -Pi) +Pie-X> andPr (ÏJ = yt) = Pi6 ^ ¿ , yt = 1,2,... In this model, the logit link is used to predict the probability of subject i (pt) to be in the Poisson process. In addition, the log link is used to predict the expected mean (A,-) of event counts for subject /, given that the subject is not in the perfect
zero state, but rather in the Poisson process, x and z are the vectors of covariates
hypothesized to explain interindividual differences in the expected counts and interindividual differences in the types of states, x and z may or may not be the same sets of variables, ß and y are the associated regression parameters.
ZIP Model With Random Effects For handling nested zero-inflated count data, the ZIP models with random effects (the ZIPR models or multilevel ZIP models) were developed and applied
to analyze empirical data (e.g., Hall, 2000; Lee et al., 2006). For example, a modified version of the ZIPR model introduced by Lee, Wang, Scott, Yau, and
McLachlan (2006) can be expressed as follows:
Í 0, with probability l-py y ~ | Poisson (Xij) , with probability py l0g(A,y) = X^.ß + W;
logit(pij)=z^y + Ui, where Xy and ztJ represents theyth observations of the /th individual (i = 1, 2, . . . , n;
y=l,2, . . . , m). ß and y are the corresponding fixed-effects regression parameters. The vectors w,- and u,- are the subject random effects or random intercepts in two
equations that are assumed to be independently and normally distributed. Hall's (2000) model is a reduced model of the above model, which contains a random intercept parameter only in the log equation. By including random effects into the ZIP model, we can account for dependence of observations within clusters. How-
ever, a limitation of the ZIPR model is that it only analyzes univariate zeroinflated count variable and item information cannot be obtained because no item
parameters are included in the model. 673
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
Wang
Item Response Models For analyzing multivariate categorical data, item response modeling (e.g., Embretson & Reise, 2000; Lord, 1980) probably is the most widely used method, especially when individual trait scores and item psychometric features are of interest. There is a series of item response models for different kinds of catego-
rical data. Here, we focus only on the 2-parameter (2PL) item response model for binary data, because it is most pertinent to this study. The 2PL model is used to predict the probability (py) of a person endorsing an item while considering the item- varying parameters, ßj for item location (or difficulty) parameters and olj for item slope (or discrimination) parameters, which allow for different weights for different items, and the person- varying latent trait variable 0,. The 2PL model is
expressed as
lo&Hpv) = aJ(Oi-ßj). As ay increases, the item discriminates more precisely among trait levels. When OLj is fixed to be 1, the 2PL model is reduced to be a Rasch model (Rasch, 1960) or
a 1PL model. As ßj increases, the item is more and more difficult to endorse. This 2PL model has been shown to be mathematically equivalent to the confirmatory
factor analysis model for binary data (Takane & de Leeuw, 1987). The IRT models can be expressed as generalized mixed/multilevel models (e.g., Adams,
Wilson, & Wu, 1997; Rijmen, Tuerlinckx, De Boeck, & Kuppens, 2003). Raudenbush, Johnson, and Sampson (2003) also reexpressed the Rasch model and the 2PL model as a two-level logistic model by including dummy variables indicating item numbers.
Proposed Model In this section, we propose an IRT zero-inflated Poisson model (IRT-ZIP model) to handle multivariate zero-inflated count data. In the model, two sets of item parameters (item slope parameters and item location parameters) are used to explain item characteristics, and a common latent trait variable is included to obtain information on person trait scores. The model is described as follows. The observed zero-inflated count data indicate
f0, with probability 1 - ptj ij ~ 'Poisson('ij), with probability p¡j Therefore,
Pr(Yij = 0) = ('-pij)+pije-x«
Pt/e^M y (la) Pr{Yy=yv)=P9 Pt/e^M yvl y , ^ = 1,2,.... The parameters ptj and Xy can be modeled by 674
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
IRT-ZIP Modeling
MA,) = %j = yLi a"«^ (e> - TLi ft«7»») ' and 0°0 ö
^ I |i"!i1s-ÃŒ!lli.Ã-§||Ã-= u 1 CO
n
un co" -h" g Soocog^l «
S
'S
'S
'i
o g g'oS g^ü c^l £ £
N
S
S
^
680
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
IRT-ZIP Modeling increased over time for this NLSY97 sample. In other words, more and more participants became involved with substance use and the level of substance use was generally increasing with age. Results From the IRT-ZIP Models
To investigate psychometric properties of the 3 items and analyze how the item parameters change over time, the IRT-ZIP models were fitted to data of
each wave. Table 2 presents the item parameter estimates with fixed slope parameters (=1) and Table 3 displays the item parameter estimates without any parameter constraints. Key findings from Tables 2 and 3 are as follows. First, based on the likelihood ratio test, the models without any parameter constraints significantly fitted the data better than the models with parameter constraints for all eight waves. This indicates that the discrimination ability of different items in estimating trait levels differs by items. However, this conclusion is tempered by the fact that the differences in the fit between the two models
are not very large compared to the large sample size in the study. Second, the order of magnitude of the location parameter estimates was exam-
ined within each wave. The results showed that the orders of magnitude of the
location parameter estimates were similar across eight waves and between the two nested models. For example, for the location parameter estimates of the log
equations, the estimates of the alcohol use items were larger than those of the cigarette use items and the marijuana use items. This indicates that it is less likely
for a person with a fixed substance use level to reach the same amount of expected frequency in alcohol use as in cigarette use and marijuana use. This result was consistent with our descriptive statistics in the rows of "use frequencies" in Table 1 ? For the location parameters of the logit equations, the estimates
of the marijuana use items are all larger than those of the cigarette use items, while the estimates of the cigarette use items are larger than those of the alcohol
use items. This indicates that it is less likely for a person with a fixed substance use level to be in the Poisson process on marijuana use than on cigarette use and
on alcohol use. In other words, it is less likely on average for a person to be involved in marijuana use than cigarette use or alcohol use. And this result is also consistent with the descriptive statistics in the first six rows in Table 1 .
Third, after comparing the item location parameter estimates across eight waves, we find that the three estimates of the log equations are generally getting smaller and smaller, which means that overall the sample had increasing frequencies in substance use. This result is also consistent with the descriptive statistics
on "use frequencies" in Table 1. The location parameter estimates of the logit equations for the cigarette use item and alcohol use item are generally decreasing over time, which also means that the participants had increasing likelihoods of becoming involved with substance use or being substance users. The location parameter estimates of the logit equation for marijuana use fluctuated over time 681
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
« sss ¿¿s o g
cd (N00O VOf^Tj- - ~ «S £> cnooTf »ooTf Pc« ^ ^o-h' Ö ^ co S
III
ë
1 a
S
I
I"S
(N (N (N CO m *O b
t^ ooo pop ^ â
o^ o, o,t^o, o ¿ o^ ^ ^ « 'S ^*>*>r-ï n O' n; «ri (N p. S O -H* Ö oo O CO W3 III I J r? (NT r? c^T c? »rT ^
vo OOO OOO r-srë ^
o> OOO ¿¿S ri S H 00
c« »o »o >n ot^r^ ^r§
*> *>¡>^(NOOcn o ^h' VOVOCN ö ö m"f^co -g III
I
£
888 S88 |
s
1
I
« é¿¿ ¿¿¿ ^ •? >^ ^^o-h ^oo-h i>cn-: oom ^g „ III
1
1
„
«
&
M fîr? rööö ööen '5b §3g rt c5 TfOOl^ OOTj-Tt ^^^
::: ... si
SSS SS§ ^11
u ü ooö ööö ^_J.2wj
u ü www www ^ *5 a
I 'S
I
¿^ ^ooo incoio " - III 5 S « ^ M
9S
687
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
Wang
Pearson correlations were calculated between the person trait estimates and the true values when the sample size is 2,000. The average correlation was .81 (SD = 0.01). The average correlation between the composite scores and the true
values also was calculated and was .54 (SD = 0.05). The correlations indicated that the proposed method should be more appropriate than the composite score method for estimating the substance use of youths.
In terms of accuracies of predictions on the probabilities of being in the Poisson state (pi) and the expected counts (A,-), Pearson correlations between the
predictions from the model and the true values were calculated and averaged across different replications with the sample size of 2,000. The correlations for the probabilities were .85, .79, and .89, respectively, and the correlations for the expected counts are .94, .92, and .93 for the 3 items, respectively. The correlations from the simulation studies are higher than the corresponding correlations from the empirical data mainly because that the simulated data were simulated from the proposed model and thus followed the model assumptions more strictly. Discussion and Conclusion
This study introduced an IRT-ZIP model to analyze multivariate zero-inflated count data using two link functions: a log link function and a logit link function. By including two link functions, the probability of being in the Poisson process and the
expected frequency of an event given that participant is in the Poisson process can
be modeled simultaneously. A latent person trait variable is included in the model to measure latent trait scores. Both item slope and location parameters are included
in the model to investigate psychometric properties of the multiple items.
The model was applied to analyze the substance use data to illustrate the applications. The empirical results provided rich information regarding both the sub-
stance use of adolescence and the model performance. The order of magnitude of the item location parameter estimates in the logit equations indicates that it is less
likely for a person with a fixed substance use level to be involved with marijuana
use than cigarette use and alcohol use. This result is partly consistent with the "gateway hypothesis" in substance use research (Kandel, 2003; Morral, McCaffrey, & Paddock, 2002). Kandel (2003) described the three propositions in the "gateway hypothesis," which are sequencing (fixed relationship between two substances such that one substance is regularly initiated before the other), association (initiation of one substance increases the likelihood of initiation of
the second substance), and causation (a controversial proposition, use of the first substance causes the use of the second substance). The application of the IRT-ZIP model to the substance use data in this study is based on the association proposition and the results were consistent with the sequencing proposition from this naturalistic population example with a novel method. Overall, the associations and differences among the 3 items measuring latent substance use levels
were investigated successfully using the IRT-ZIP model. The consistency 688
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
IRT-ZIP Modeling between some of the current findings and previous findings supports the reason-
ableness and reliability of the results from the IRT-ZIP model. Simulation studies also were conducted to examine the performance of the model under the empirical scenario. The simulation results showed that, similar
to regular IRT models (Thissen & Wainer, 1982, 2001), fitting the IRT-ZIP model requires large sample sizes to obtain stable item parameter estimates. For a 2PL IRT model, rules of thumb for the minimum number of examinees required
for accurate parameter estimation range from 500 (Hulin, Lissak, & Drasgow, 1982) to 1,000 (Ree & Jensen, 1980). For the proposed IRT-ZIP model, in light of the tested simulation conditions, a sample size of 1,000 seems large enough to recover the parameter estimates. For large-scale psychological and educational observational or survey research, this might not be a big issue. However, in many practical psychological and educational experiments, only a small number of participants' data are collected and thus possibly preclude or at least limit the use of the proposed method. In terms of recovering the person trait scores, the performance of the IRT-ZIP model was better than the composite score method. This is because the IRT-ZIP model considers the natural features of the multivariate
zero-inflated count data. By including two link functions, we can model two pro-
cesses more accurately and validly. The model thus can be useful in large-scale psychological or educational settings where count data on the number of behaviors are available.
Limitations and Future Directions
There are several issues that should be investigated in the future. First, the
empirical data characteristics include a large sample size (N > 7,000) and repeated measures making them particularly apt for illustrating the application of the proposed model. However, the data were limited to only 3 items. With 3 items, the correlation between the person trait estimates and the true values was
only about .81 under the empirical data scenario, in accord with the simulation results. Further simulations should examine the performance of the model on estimating both item parameters and trait scores with larger numbers of items and
different proportions of zeros. Examples of applying the proposed models to problem behaviors or other psychological research with a larger number of items can also be included in future studies. Second, because the empirical data contain repeated measurements, change patterns of substance use levels from adoles-
cence to young adulthood could be investigated here. The proposed IRT-ZIP model can be extended to a three-level mixed-effects model, which involves estimating item parameters, latent person trait scores, and longitudinal change patterns simultaneously in future studies. Third, in the current proposed model, only one common latent variable is included. However, it is possible to include
two latent variables to investigate the propensity and level scores individually. Future studies can also be done to investigate the multidimensionality. 689
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
Wang In summary, both the simulation results and the empirical results supported
the conclusion that the proposed IRT-ZIP model is a valid and feasible method for analyzing multivariate zero-inflated count data more precisely and reasonably. The work so far should encourage researchers to apply the IRT-ZIP model to more substantive research areas and thereby answer more interesting empirical research questions.
Appendix
SAS Scripts for Fitting an IRT-ZIP Model proc nlmixed data=realdata qpoints=5; bounds al-a6>0;
parms al = l a2=l a3=l a4=l a5=l a6=l betall=0 betal2=0 betal3=0 beta21=0 beta22=0 beta23=0;
slopel=al*il+a2*i2+a3*i3; Slope2=a4*il+a5*i2+a6*i3; etal=slopel*(theta-betall*il-betal2*i2-betal3*i3); eta2=slope2*(theta-beta2 1 *i 1 -beta22*i2-beta23 *i3);
lambda=exp(eta 1 );
p=exp(eta2)/(l+exp(eta2)); IF (resp=0) then pai=(l-p)-h p*exp(-l*lambda);
ELSEpai=p*exp(-l*lambda)*(lambda**resp)/gamma(resp+l); II=LOG(PAI);
model resp-GENERAL(II); random theta^normal(0,l) subjected; predict theta out=personparm;
predict p out=pout; predict lambda out=lambdaout; run;
Acknowledgment The author is grateful for the helpful comments on this study from John R.
Nesselroade, John J. McArdle, Steven M. Boker, Karen M. Schmidt, Xiaohui Wang, and the reviewers. Notes
1 . If the answer from Question 1 is 0, then the value of this variable is set to be
0; if the answer from question 1 is not 0 but the answer from Question 2 is 0, then
the value is also set to be 0; otherwise, the value is set to be the answer from Question 3. 690
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
IRT-ZIP Modeling 2. It should be mentioned that for the marijuana use data, the observed frequency represents 30 days, and for the alcohol use and cigarette use, the frequency represents 1 day.
References
Adams, R. J., Wilson, M, & Wu, M. L. (1997). Multilevel item response models: An approach to errors in variables regression. Journal of Educational and Behavioral Statistics, 22, 47-76. Böckenholt, U., Kamakura, W. A., & Wedel, M. (2003). The structure of self-reported emotional experiences: A mixed-effects Poisson factor model. British Journal of Mathematical & Statistical Psychology, 56, 215-229. Cameron, C, & Trivedi, P. K. (1998). Regression analysis of count data, Econometric Society Monograph No. 30. Cambridge, England: Cambridge University Press. Cattell, R. B. (1978). The scientific use of factor analysis. New York, NY: Plenum.
Cox, D. R., & Lewis, P. A. W. (1966). The statistical analysis of series of events. (Methuen Monograph). London, England: Methuen. Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, NJ: Erlbaum.
Frome, E. L., Kutner, M. H., & Beauchamp, J. J. (1973). Regression analysis of Poisson-distributed data. Journal of the American Statistical Association, 68, 935-940.
Fuller, W. A. (1987). Measurement error models. New York, NY: Wiley. Gait, J. J. (1964). The analysis of Poisson regression with an application in virology. Biometrika, 51, 517-521. Hall, D. B. (2000). Zero-inflated Poisson and binomial regression with random effects: A case study. Biometrics, 56, 1030-1039.
Hulin, C. L., Lissak, R. I., & Drasgow, F. (1982). Recovery of two- and three- parameter
logistic item characteristic curves: A Monte Carlo study. Applied Psychological Measurement, 6, 249-260. Kandel, D. B. (2003). Does marijuana use cause the use of other drugs? Journal of the American Medical Association, 289, 482^483. Lambert, D. (1992). Zero-inflated Poisson regression, with an application to the defects in manufacturing. Technometrics, 34, 1-14.
Lee, A. H., Wang, K., Scott, J. A, Yau, K. K. W., & McLachlan, G. J. (2006). Multi-level zero-inflated Poisson regression modelling of correlated count data with excess zeros. Statistical Methods in Medical Research, 15, 47-61. Liu, H., & Powers, D. A. (2007). Growth curve models for zero-inflated count data: An
application to smoking behavior. Structural Equation Modeling, 14, 247-279. Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Erlbaum. Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison- Wesley.
McCullagh, P., & Neider, J. A. (1989). Generalized linear models (2nd ed.). London, England: Chapman and Hall. McDonald, R. P. (1985). Factor analysis and related methods. Hillsdale, NJ: Erlbaum. 691
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms
Wang Min, Y. Y., & Agresti, A. (2005). Random effect models for repeated measures of zeroinflated count data. Statistical Modeling, 5, 1-19.
Morral, A. R., McCaffery, D. F., & Paddock, S. M. (2002). Reassessing the marijuana gateway effect. Addiction, 97, 1493-1504. Pinheiro, J. C, & Bates, D. M. (1995). Approximation to the log-likelihood function in the
nonlinear mixed-effects model. Journal of Computational and Graphical Statistics, 4, 12-35.
Rabe-Hesketh, S., & Skrondal, A. (2007). Multilevel and latent variable modeling with composite links and exploded likelihoods. Psychometrika, 72, 123-140. Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Chicago, IL: University of Chicago Press. Raudenbush, S. W., Johnson, C, & Sampson, R. (2003). A multivariate, multilevel Rasch model with applications to self-reported criminal behavior. Sociological Methodology, 33, 169-211. Ree, M. J., & Jensen, H. E. (1980). Effects of sample size on linear equating of item characteristic curve parameters. In D. J. Weiss (Ed.), Proceedings of the 1979 computerized
adaptive testing conference. Minneapolis: University of Minnesota. Rijmen, F., Tuerlinckx, F., De Boeck, P., & Kuppens, P. (2003). A nonlinear mixed model framework for item response theory. Psychological Methods, 8, 185-205. Takane, Y., & de Leeuw, J. (1987). On the relationship between item response theory and
factor analysis of discretized variables. Psychometrica, 52, 393-408. Thissen, D., & Wainer, H. (1982). Some standard errors in item response theory. Psycho-
metrica, 47, 397^12. Thissen, D., & Wainer, H. (2001). Test scoring. Hillsdale, NJ: Erlbaum. Wolfinger, R. (1999). Fitting nonlinear mixed models with the new NLMIXED procedure.
SUGI 24 Conference Proceedings, Paper 287. Cary, NC: SAS Institute. Author
LIJUAN WANG, PhD, is an assistant professor at the University of Notre Dame, Notre Dame, IN, 46556; Email:
[email protected]. Her research interests are longitudinal data analysis and analysis of nonnormal data.
Manuscript received May 12, 2009 Revision received January 17, 2010 Accepted February 28, 2010
692
This content downloaded from 129.74.250.206 on Mon, 11 Apr 2016 18:41:50 UTC All use subject to http://about.jstor.org/terms