Skip to contents

The text below is based on an ongoing simulation study, as of 2024-10-22.

Usage

RIbootRestscore(
  dat,
  iterations = 200,
  samplesize = 600,
  cpu = 4,
  output = "table",
  cutoff = 5
)

Arguments

iterations

How many bootstrap samples to run

samplesize

How large sample to use in each bootstrap

cpu

How many CPU's to use

output

Optional "dataframe", or "quarto" for knitr::kable() output

cutoff

Filter to include only rows with % of results above this value

data

Dataframe with only response data, with 0 as lowest response

Details

Item-restscore will often indicate false positives (item misfit when it is not misfitting) if the sample size is above 400 and there is one truly misfitting item in the data. If there is more than one misfitting item, false positives can occur at sample sizes of n = 150-200.

Conversely, when sample size is below n = 800, the detection rate of truly misfitting items is below 90%, particularly if misfitting items have location > 1.5 logits from the sample mean.

Thus, if one has a large dataset it may be useful to be able to use non-parametric bootstrapping with replacement to get a more nuanced view of the probability of items actually being misfit.