README

The lordif package can be used to perform an ordinal (common odds-ratio) logistic regression differential item functioning (DIF) analysis using IRT theta estimates as the conditioning variable. The graded response model (GRM) or the generalized partial credit model (GPCM) is used for IRT parameter estimation. Items flagged for DIF are treated as unique to each group and group-specific item parameters are obtained. Non-DIF items serve as anchor items to the initial calibration. The procedure runs iteratively until the same set of items is flagged over two consecutive iterations, unless anchor items are specified a priori.

Quick start

Preparing data

Load the PROMIS Anxiety data (containing 766 observations on 29 five-point rating scale items measuring anxiety; try ?Anxiety for more information):

data("Anxiety")

The dataset also includes three demographic variables (age, gender, and education). Create a data frame containing only the item responses:

resp.data <- Anxiety[paste0("R", 1:29)]

gender <- Anxiety$gender

This numeric variable and coded as 0 = Male and 1 = Female. The lowest level serves as the reference group (0 = Male).

Running lordif

gender_DIF <- lordif(resp.data, gender)

By default, the graded response model is fitted to the response data and the likelihood-ratio \(\chi^2\) statistic is used to flag DIF items with \(\alpha = 0.01\).

Displaying DIF items

plot(gender_DIF, labels = c("Male", "Female"))

Saving DIF plots

pdf(...)
plot(gender_DIF, labels = c("Male", "Female"))
dev.off()

Flagging using empirical thresholds

The flagging criterion can be determined empirically through Monte Carlo simulations:

lordif(resp.data, gender, MonteCarlo = TRUE)

Estaglishing empirical thresholds through Monte Carlo simulations

The MonteCarlo = TRUE option above calls the montecarlo() function internally to generate and apply empirically determined thresholds in lieu of pre-specified or theoretically determined thresholds. We can call the montecarlo() function separately to generate empirical thresholds. The following generates empirical thresholds based on 200 DIF-free datasets (conforming to the dimension of the observed data, resp.data):

gender_DIF_MC <- montecarlo(gender_DIF, nr = 200)

Displaying empirical thresholds

plot(gender_DIF_MC)