Q-matrix and variational autoencoders to estimate multidimensional item response theory models with correlated and independent latent variables

The Western Tennessee Chapter of the American Statistical Association:Annual Fall Chapter Symposium

Mahbubul Hasan(presenter), Faculty Mentors & Team Members- Dale Bowman, Lih Y Deng, Ching-Chi Yang, Jhon Sabatini, and John Hollander.

11/04/2021

Plan for this lecture

  1. Research Questions
  2. Data description
  3. Method:Q-matrix misspecification
  4. Generative model, why generative model?
  5. Autoencoder, Variational Autoencoder(VAE)
  6. Q-matrix & how to use it in VAE model?
  7. Multidimensional Logistic 2-Parameter (ML2P) Model
  8. VAE/deep learning frontiers in educational assessment
  9. Future directions or extensions

Which face is fake?

[Image:MITS191]

Research Questions

Data

Data

Method

Research Model

(Research model: 1. low-high dimensional data from population, 2. VAE architecture, 3. synthesize data., 4. incorporate q-matrix,5. model parameter estimation)

DataFrame(head)

Unsupervised learning

Generative model

Why generative model?

[Amini,2019]

What is latent variable?

What is autoencoder?

[Rocca,2019]

Why do we care about low dimentional Z?

What is VAE?

VAE optimization

Reparametrizing the sampling lyer

VAE: latent purturbation during training

VAE summary

What is Q-matrix and What can we do with Q-matrix?

Q-matrix & how to use it in VAE model?

What is difficulty and discrimination perameter?

Discrimination parameters

Multidimensional item response theory(MIRT)

For example, if one student answers only questions 1 and 4 incorrect, and another student answers only questions 3 and 7 incorrect, they have the same percentage score. But it is not likely that the two students share the same latent trait values. Questions 3 and 7 may have tested a different skill than items 1 and 4, and could vary greatly in difficulty level. 

Multidimensional Logistic 2-Parameter (ML2P) model

ML2P-VAE model

\(Q(zi) = \frac{1}{1+e^{-z_i}}\)

Model parameters

Get parameter estimates for Model after training

Load in true values (included in this pacakge)

#disc_true <- as.matrix(disc_true) 
#diff_true <- as.matrix(diff_true)
#theta_true<- as.matrix(theta_true)

Assumes latent traits are correlated

(Small sample: ML2P-VAE parameter estimates for data set with 35 items and 6 correleted latent traits and 10000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. for difficulty paramterter by itself with 35 items)

Assumes latent traits are correlated(continue…)

(Small sample: ML2P-VAE parameter estimates for data set with 40 items and 6 correlated latent traits and 18,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 40 items)

Assumes latent traits are correlated(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 50 items and 6 correlated latent traits and 25,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 50 items)

Assumes latent traits are correlated(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 200 items and 20 correlated latent traits and 60,000 student.Each color corresponds to discrimination parameters related to one of the 20 latent traits. For difficulty parameter by itself with 200 items)

Assumes latent traits are independent

(Small sample: ML2P-VAE parameter estimates for data set with 35 items and 6 independent latent traits and 10000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. for difficulty parameter by itself with 35 items)

Assumes latent traits are independent(continue…)

(Small sample: ML2P-VAE parameter estimates for data set with 40 items and 6 independent latent traits and 18,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 40 items)

Assumes latent traits are independent(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 50 items and 6 independent latent traits and 25,000 student.Each color corresponds to discrimination parameters related to one of the 6 latent traits. For difficulty parameter by itself with 50 items)

Assumes latent traits are independent(continue…)

(Large sample: ML2P-VAE parameter estimates for data set with 200 items and 20 independent latent traits and 60,000 student.Each color corresponds to discrimination parameters related to one of the 20 latent traits. For difficulty parameter by itself with 200 items)

Q-matrix Misspacification/Validation

(Small sample: Q-matrix Misspacification plots comparison with specified q-matrix vs under, over and mixed method for both 20 percent and 40 percent changes of items. Figure shows how data point goes below 45 degree once different method of misspacifications are applied. RMSE and Cor score also confirms this changes)

Q-matrix Misspacification/Validation(continue…)

(Large sample: Q-matrix Misspacification plots comparison with specified q-matrix vs under, over and mixed method for both 20 percent and 40 percent changes of items. Figure shows how data point goes below 45 degree once different method of misspacifications are applied. RMSE and Cor score also confirms this changes)

Q-matrix Misspacification/Validation(continue…)

(Error measures for ability (theta) parameters from various parameter estimation methods on two different data sets. Table shows how RMSE and BIAS and Cor score changes as we misfit q-matrix. other methods are kept blank intentionally)

Corr latent traits model

Corr latent traits

VAE for educational assessment(summary)

Other thoughts/comments

Future directions

Thank you!

Questions?