Q-matrix and variational autoencoders to estimate multidimensional item response theory models with correlated and independent latent variables

VAE Result Continue…

Mahbubul Hasan

11/18/2021

Research Questions

## Data

Data

Method

Research Model

(Research model: 1. low-high dimensional data from population, 2. VAE architecture, 3. synthesize data., 4. incorporate q-matrix,5. model parameter estimation)

What is latent variable?

What is Q-matrix and What can we do with Q-matrix?

Q-matrix & how to use it in VAE model?

Multidimensional item response theory(MIRT)

For example, if one student answers only questions 1 and 4 incorrect, and another student answers only questions 3 and 7 incorrect, they have the same percentage score. But it is not likely that the two students share the same latent trait values. Questions 3 and 7 may have tested a different skill than items 1 and 4, and could vary greatly in difficulty level. 

Multidimensional Logistic 2-Parameter (ML2P) model

ML2P-VAE model

\(Q(zi) = \frac{1}{1+e^{-z_i}}\)

Model parameters

Get parameter estimates for Model after training

Load in true values (included in this pacakge)

#disc_true <- as.matrix(disc_true) 
#diff_true <- as.matrix(diff_true)
#theta_true<- as.matrix(theta_true)

Assumes latent traits are correlated

(Small sample: ML2P-VAE parameter estimates for data set with 35 items and 6 correleted latent traits and 10000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. for difficulty paramterter by itself with 35 items)

Assumes latent traits are correlated(continue…)

(Small sample: ML2P-VAE parameter estimates for data set with 40 items and 6 correlated latent traits and 18,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 40 items)

Assumes latent traits are correlated(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 50 items and 6 correlated latent traits and 25,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 50 items)

Assumes latent traits are correlated(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 200 items and 20 correlated latent traits and 60,000 student.Each color corresponds to discrimination parameters related to one of the 20 latent traits. For difficulty parameter by itself with 200 items)

Assumes latent traits are independent

(Small sample: ML2P-VAE parameter estimates for data set with 35 items and 6 independent latent traits and 10000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. for difficulty parameter by itself with 35 items)

Assumes latent traits are independent(continue…)

(Small sample: ML2P-VAE parameter estimates for data set with 40 items and 6 independent latent traits and 18,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 40 items)

Assumes latent traits are independent(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 50 items and 6 independent latent traits and 25,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 50 items)

Assumes latent traits are independent(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 200 items and 20 independent latent traits and 60,000 student.Each color corresponds to discrimination parameters related to one of the 20 latent traits. For difficulty parameter by itself with 200 items)

Q-matrix Misspacification/Validation

(Small sample: Q-matrix Misspacification plots comparison with specified q-matrix vs under, over and mixed method for both 20 percent and 40 percent changes of items. Figure shows how data point goes below 45 degree once different method of misspacifications are applied. RMSE and Cor score also confirms this changes)

Q-matrix Misspacification/Validation(continue…)

(Large sample: Q-matrix Misspacification plots comparison with specified q-matrix vs under, over and mixed method for both 20 percent and 40 percent changes of items. Figure shows how data point goes below 45 degree once different method of misspacifications are applied. RMSE and Cor score also confirms this changes)

Q-matrix Misspacification/Validation(continue…)

(Error measures for ability (theta) parameters from various parameter estimation methods on two different data sets. Table shows how RMSE and BIAS and Cor score changes as we misfit q-matrix. other methods are kept blank intentionally)

Thank you!

Questions?