Multidimensional Models - Measurement Models in Assessment and Evaluation

Measurement Models in Assessment and Evaluation

7.4 Multidimensional Models

Note that δyk can be interpreted as a shift in ability that is proportional to the response level on item k. An LM statistic can be defined as a quadratic form in the first-order derivatives with respect to δ.

The relative importance of these ability dimensions in the responses to specific items is modeled by item-specific loadings akq and the relation between the ability dimensions in some population of respondents is modeled by the correlation between the ability dimensions.

Figure 7.6 Item response surface for a multidimensional IRT model (Reckase, 1977).

In the example of Figure 7.6, the probability of a correct response does not go to zero if the abilities go to minus infinity. In that case, the model must be extended to

by introducing a guessing parameter ck. A comparable model using a logistic rather than a normal-ogive representation has been proposed by Reckase (1985, 1997) and Ackerman (1996a and 1996b).

As in the unidimensional case, restrictions have to be imposed on the parameters to identify the model. One approach to identify the model is setting the mean and the covariance matrix equal to zero and the identity matrix, respectively, and introducing the constraints ajq=0, j=1,…, Q−1 and q=j+1,…,Q. So here the latent ability dimensions are independent and it is assumed that the first item loads on the first dimension only, the second item loads on the first two dimensions only, and so on, until item Q−1, which loads on the first Q−1 dimensions. All other items load on all dimensions. An alternative approach to identifying the model is setting the mean equal to the zero, considering the covariance parameters of proficiency distribution as unknown estimands. The model is then further identified by imposing the restrictions, αjq=1, if j=q, and αjq=0, if j≠q, for j=1,…, Q and q=1,…, Q. So here the first item defines the first dimension, the second item defines the second dimension, and so forth, until item Q which defines the Q-th

dimension. Further, the covariance matrix Σ describes the relation between the thus defined latent dimensions.

In general however, these identification restrictions will be of little help to provide an interpretation of the ability dimensions. Therefore, as in an exploratory factor analysis, the factor solution is usually visually or analytically rotated. Often, the rotation scheme is devised to approximate Thurstone’s simple-structure criterion (Thurstone, 1947), where the factor loadings are split into two groups, the elements of the one tending to zero and the elements of the other toward unity.

As an alternative, several authors (Glas, 1992; Adams & Wilson, 1996; Adams, Wilson & Wang, 1997; Béguin & Glas, 2001) suggest to identify the dimensions with subscales of items loading on one dimension only. The idea is to either identify these S<Q subscales a priori in an confirmatory mode, or to identify them using an iterative search. The search starts with fitting a unidimensional IRT model by discarding non- fitting items. Then, in the set of discarded items, items that form a second unidimensional IRT scale are identified, and this process is repeated until S subscales are formed. Finally, the covariance matrix Σ between the latent dimensions is estimated either by imputing the item parameters found in the search for subscales, or concurrently with the item parameters leaving the subscales intact.

Several methods have been proposed to estimate the model. The first approach is to use a two-step procedure where the first step consists of estimating the covariance matrix of the latent variables using tetrachoric correlations and the second step consists of factor analyzing this matrix using standard software (LISREL, Jöreskog & Sörbom, 1996; EQS, Bentler, 1992; LISCOMP, Muthén, 1987). A second approach, developed by McDonald (1967, 1982), is based on an expression for the association between pairs of items derived from a polynomial expansion of the normal ogive. The procedure is implemented in NOHARM (Normal-Ogive harmonic Analysis Robust Method, Fraser, 1988). The third approach, using all information in the data, and therefore labeled “‘Full Information Factor Analysis’”, was developed by Bock, Gibbons and Muraki (1988). This approach is a generalization of the marginal maximum likelihood (MML) estimation procedure for unidimensional IRT models (see, Bock & Aitkin, 1981), and has been implemented in TESTFACT (Wilson, Wood & Gibbons, 1991). MML estimates for MIRT models with subscales can be obtained using CONQUEST (Wu, Adams & Wilson, 1997). Finally, fully Bayesian approaches with computational methods based on the Gibbs sampler were proposed by Shi and Lee (1998) and Béguin and Glas (2001). For an overview of the relative merits of the various procedures refer to the latter two articles.

For the generalization of the MIRT model to polytomous items, the same three approaches are possible as in the unidimensional case: adjacent-category models, continuation-ratio models and cumulative probability models. All three possibilities are feasible, but only the former and the latter will be discussed here to explicate some salient points.

In the framework of the cumulative probabilities approach, a model for polytomous items with Mk ordered response categories can be obtained by assuming Mk standard normal random variables, and Mk cut-off points ηikm for m=1,…, Mk. The probability that the response is in category m is given by

Takane and de Leeuw (1987) point out that also this model is both equivalent to a MIRT model for graded scores (Samejima, 1969) and a factor analysis model for ordered categorical data (Muthén, 1984). This model can be estimated using standard software for factor analysis (see above) or using a fully Bayesian approach in combination with a MCMC algorithm (Shi & Lee, 1998).

In the framework of adjacent categories models, the logistic versions of the probability of a response in category m can be written as

where h(θi, ak, ek) is some normalizing factor that assures the sum over all possible responses on an item is equal to one. The probability pkm(θi) is determined by the compound so every item addresses the abilities of a respondent in a unique way.

Given this ability compound, the probabilities of responding a certain category are analogous to the unidimensional partial credit model by Masters (1982). Firstly, the factor m indicates that the response categories are ordered and that the expected item score increases as the ability compound increases. And secondly, the item parameters bkh are the points where the ability compound has such a value that the odds of scoring either in category m-1 or m are equal.

7.5 Multilevel IRT Model

Dalam dokumen PDF Educational Evaluation, Assessment, and Monitoring (Halaman 172-175)