TIMSS: Math Achievement

Introduction

Present research project attempts to investigate math achievement of eight grade russian students by using OLS regression analysis and EFA. There is a total of three predictor variables two of which are associated with the presence of tablets and one indicating the number of books at home. Several control variables are also chosen and included in the regression model. Prior to the model construction, bivariate tests for the variables are conducted alongside the plotting of bivariate distributions. Theoretical background is provided in order to justify the hypotheses. EFA is conducted for the subset that contains variables indicating presence of various objects and possessions of the family of respondents in order to identify latent variables. Factor scores are later added to the final linear predictive model.

Preliminary preparations

The initial dataset contained 4780 observations of 436 variables, with 4517 observations remaining after only 20 variables were chosen for the further analysis and NA were deleted. Only 9 variables were chosen for the OLS regression analysis and re-named in order to make the interpretation easier. Apart from the math achievement which is numeric, all variables are coded as factors.

Descriptive statistics

Math achievement

vars n mean sd median trimmed mad min max range skew kurtosis se
1 4517 538.6858 79.27568 541.0424 539.7548 81.4219 273.0573 819.8235 546.7662 -0.1254828 -0.2051389 1.179546

The values range from 273.06 to 819.82, with the mean being equal to 538.69. In comparison to the mean math achievement in Singapore (616.5), the mean math achievement in Russia is significantly lower. Similarly to the Singaporean case, median (541.04) is bigger than the mean (538.69) which indicates that the distribution is a bit skewed: there are more observations with high math achievement values. That is also indicated by the negative value of the skew (-0.13). Since the kurtosis is negative, the distribution is platykurtic (-0.21). Apart from that, the distribution looks surprisingly normal and resembles the bell curve.

According to the TIMSS report published on the website for TIMSS and PIRLS International Study center, the average math achievement in Russia was equal to 538 with Russia standing at the top of the rating for non-east-asian countries, but falling behind Japan by 48 points. Since the gap was equal to 31 in 2011, it appears to be widening with Singapore standing at the very top of the overall rating. In comparison to the results of the TIMSS conducted in 2011, math scores of students in Russia didn’t significantly improve and the country was placed in the “Same average achievement” category. When results of TIMSS conducted in 2015 and 1995 were compared, however, Russia was placed in the “Higher average achievement” category.

Predictor variables

Number of books at home

As it was already established by extensive research in the area, academic performance of children is strongly positively correlated with the number of books at one’s home (Evans, Sikora & Kelley, 2014). In general there appears to be two conflicting explanations for such a phenomena. On one hand, “scholarly culture thesis” implies that scholarly culture contributes to the development of cognitive skills of children, their tastes and preferences in such a way it enables them to perform well at school. Children might develop a hobby of reading for pleasure, enrich their vocabulary, develop argumentative and analytical skills through the discussion of what they’ve read with their parents. On the other hand, according to the “elite closure / social reproduction / cultural reproduction” argument, it’s less about children’s skills as they are and more about teachers providing more opportunities to children that signal their high elite/cultural status. However, as it is indicated in the article of Evans, Sikora and Kelley, there is not much empirical evidence in support of this claim and it appears to be the case of reading practices not working in the same way status signals related to art do.

Whether the first or the second argument is applied, higher number of books at home is usually believed to be associated with higher academic performance. Taken as a base for the hypothesis for this study, that would mean that we expect to find a positive correlation between the number of books at home and math achievement of students. However, it has to be mentioned that the number of books is reported by the students themselves and might be different from how it really is.

As observed on the left bar chart, the majority of respondents indicated that they have from 26 to 100 books at home, with 38.8% choosing this option. The second most popular option chosen by the respondents was the “11-25 books” one with 29.4% picking it. Only 6.5% of respondents chose “0-10 books”. While it might be hypothesized that there exists a culture for reading in Russia which results in quite a large share of participants choosing the “26-100 books” option, it also has to be mentioned that eight grade students are most likely unaware of the actual number of books at their home and could simply pick that option as the one being in the middle. If the former is the case, the effect of the number of books at home might be weaker in Russia than it is in countries where people rarely read or buy books. The latter being true would result in that particular level of the factor not contributing to a statistically significant increase in the math scores.

As depicted on the box plot on the right, as one moves along the factor levels of the “books” variable indicating higher number of books at home, the median math scores keep rising for the first four levels. The median math score of those, who indicated that there are more than 200 books in their home seems to be a bit lower than that of respondents with 101-200 books at home. A table with descriptive statistics for each of the “books” factor levels in terms of math achievement is presented below to check what the precise difference between their mean and median scores is. As it can be already seen, the data supports the hypothesis since the median math score rises with the number of books at home. Further in the regression we would expect to see the increase in the number of books at home being associated in the increase in the predicted math achievement score.

Books vars n mean sd median trimmed mad min max range skew kurtosis se
0–10 books 1 294 514.1171 86.21753 517.1523 514.4245 94.63953 273.0573 719.3618 446.3045 -0.0858379 -0.4653014 5.028308
11–25 books 1 1330 525.0976 77.81189 529.4087 526.1386 80.78908 282.6379 732.3903 449.7524 -0.1375348 -0.3035723 2.133635
26–100 books 1 1751 541.2183 76.57222 541.1637 541.5559 78.12570 308.1848 777.5889 469.4041 -0.0297452 -0.2491015 1.829903
101–200 books 1 707 557.1524 79.39983 563.6180 559.2957 76.77207 307.1516 819.8235 512.6719 -0.2368695 0.1062879 2.986138
More than 200 1 435 556.6290 77.84036 560.6942 558.7776 76.31691 312.7139 756.8425 444.1286 -0.2420404 -0.1728280 3.732159

Both mean and median math achievement scores of those with more than 200 books at home are a bit lower than those of respondents with 101-200 books at home. Apart from such an unexpected drop, there is a general upward trend observed. All distributions have a negative skew indicating that there are more high math achievement scores on each of the levels. Only the distribution of math achievement scores on the “101 - 200” books level is characterized by a positive kurtosis indicating that the distribution is leptokurtic. In order to check if the difference in math achievement scores in statistically significant for these levels ANOVA is performed.

##               Df   Sum Sq Mean Sq F value Pr(>F)    
## data1$books    4   815416  203854   33.37 <2e-16 ***
## Residuals   4512 27565991    6109                   
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

P-value < .001 (<2e-16) means that the null hypothesis should be rejected (the probability to obtain such data that we have if H0 was true for the population is low), thus, the difference in the math scores across five groups accordingly to the number of books at home is statistically significant.

Personal tablet

There exists a debate on whether the usage of tablet computers does increase students performance or, on the contrary, has a negative effect on it. Supporters of the former claim that by using tablets in classroom teachers can encourage active participation during the class which, in turn, results in better students performance (Enriquez, 2010). On the other hand, the problem of information overload might lead to the students not being able to concentrate on particular information presented to them because multiple stimulus results in inability to focus and memorize (McEwen & Dubé, 2015). The topic is rather controversial and empirical evidence in support of both arguments is found.

In addition, the question that was used for the TIMSS itself does not specify whether the tablet one has is used for mainly educational purposes. Therefore, three hypotheses can be formulated.

  1. If a tablet is mostly used not for educational purposes, the presence of a tablet in use is associated with lower math achievement scores. This could be explained by children being distracted by the tablet and most likely wasting a large amount of time on leisure activities instead of focusing on studying.

  2. If a tablet is mostly used for studying, the presence of a tablet in use is associated with higher math achievement scores. This can be explained by the learning becoming more interactive and multiple stimulus being used to help children learn.

  3. If a tablet is mostly used for studying, the presence of a tablet in use is associated with lower math achievement scores. This can be explained by the overload problem.

There was a striking imbalance in terms of the share of those respondents who did have a personal tablet and those who didn’t: the latter only accounted for 15.1%. However, the median math achievement score was higher for those respondents with no tablet in personal use.

Selftab vars n mean sd median trimmed mad min max range skew kurtosis se
Yes 1 3835 536.8037 78.49988 538.6908 537.8300 80.51024 273.0573 819.8235 546.7662 -0.1201961 -0.1856804 1.267612
No 1 682 549.2690 82.76995 556.1940 550.8288 83.98118 310.6374 762.4422 451.8048 -0.1937453 -0.2936750 3.169425

As can be seen in the descriptive table, the median math score of those with no personal tablet is equal to 556.1940 and is bigger than that on those with a personal tablet (538.6908). Similarly, the mean math achievement is higher for those with no tablet: 549.2690 > 536.8037. Distribution is skewed and platykurtic in both cases. This supports either the first or the third hypothesis since it is still unknown what is the main purpose of use of a tablet is for each of the respondents.

Since there are only two groups, we can carry out the T-test instead of performing the ANOVA in order to check whether the difference in mean math scores for these two groups is statistically significant.

One of the assumptions we have to check before carrying out the test itself is the homogeneity of variances. The two hypothesis would be:

  • H0: Variances do not differ.
  • H1: Variances differ.
## 
##  F test to compare two variances
## 
## data:  as.numeric(data1$math) by data1$selftab
## F = 0.89948, num df = 3834, denom df = 681, p-value = 0.06621
## alternative hypothesis: true ratio of variances is not equal to 1
## 95 percent confidence interval:
##  0.7996111 1.0070762
## sample estimates:
## ratio of variances 
##          0.8994823

As we can see, the p-value is bigger than 0.05 (0.06621 > 0.05). Therefore, we don’t have enough evidence to reject the null hypothesis and conclude that variances do not differ. We also indicate it in the T-test function in what follows.

Hypotheses for the test itself:

  • H0: In population, two means of two groups DO NOT differ.
  • H1: In population, two means of two groups DO differ.
## 
##  Two Sample t-test
## 
## data:  as.numeric(data1$math) by data1$selftab
## t = -3.7892, df = 4515, p-value = 0.0001531
## alternative hypothesis: true difference in means between group Yes and group No is not equal to 0
## 95 percent confidence interval:
##  -18.914574  -6.015952
## sample estimates:
## mean in group Yes  mean in group No 
##          536.8037          549.2690

Since the p-value is small (0.0001531 < 0.05) the null hypothesis shall be rejected (because the probability to get the result we have if H0 was true is very low). Therefore, we conclude that the difference between mean math score of these two groups is statistically significant.

Shared tablet

Since the question is the same as it is in case of the tablet in personal use, it is still unclear what the main purpose of using a shared tablet is for each of the respondents. At a higher level a hypotheses might simply be about the presence of a shared and a personal tablet having the same effect (in terms of the direction) on the math achievement or not. If the direction is the same, it can be explained in a similar way (positive <- engaged interacted learning, negative <- distraction or overload). However, if the direction is the opposite the question of what is so different about tablets in personal and shared use arises. One of the possible suggestions would be the difference in the purpose of use: it can be hypothesized that while personal tablets are more likely to be used by children for leisure activities, shared tables are used for educational purposes more often because in such cases children have to justify why they need a tablet.

Since we have already observed that mean math achievement scores were higher for students who didn’t have a personal table, we can hypothesize that the opposite is the case for shared tablets: if another sibling or a parent owns a tablet, a child might have to justify why he/she needs a tablet and using it for educational purposes would be more appealing.

Similarly to the case of the tablet in personal use, the vast majority of respondents indicated having a shared tables in use (83.8%). What is striking in case of this variable in comparison to the former is that the median math achievement score is, in fact, higher for those who do have a tabled shared with other people in home. This supports the hypothesis made above.

Sharedtab vars n mean sd median trimmed mad min max range skew kurtosis se
Yes 1 3784 542.6973 78.16561 544.1606 543.7260 79.77062 306.9265 819.8235 512.8970 -0.1167763 -0.1899183 1.270692
No 1 733 517.9772 81.75465 518.7066 518.5645 87.94073 273.0573 753.6532 480.5959 -0.0981142 -0.3478073 3.019677

As presented in the table above, the mean math achievement score of those who have a tablet shared with other people in family is equal to 542.6973 and is higher than that of respondents with no shared tablet (517.9772). A similar situation is observed for the median math achievement score (544.1606 > 518.7066). Both distributions are skewed by more respondents having higher math achievement scores, and both are leptokurtic.

In order to check if the difference in mean math achievement scores is statistically significant between these two groups as it was in case of the personal tablet usage, we run a T-test.

One of the assumptions we have to check before carrying out the test itself is the homogeneity of variances. The two hypothesis would be:

  • H0: Variances do not differ.
  • H1: Variances differ.
## 
##  F test to compare two variances
## 
## data:  as.numeric(data1$math) by data1$sharedtab
## F = 0.91413, num df = 3783, denom df = 732, p-value = 0.1094
## alternative hypothesis: true ratio of variances is not equal to 1
## 95 percent confidence interval:
##  0.8155238 1.0202617
## sample estimates:
## ratio of variances 
##           0.914127

As we can see, the p-value is bigger than 0.05 (0.1094 > 0.05). Therefore, we don’t have enough evidence to reject the null hypothesis and conclude that variances do not differ.

Hypotheses for the test itself:

  • H0: In population, two means of two groups DO NOT differ.
  • H1: In population, two means of two groups DO differ.
## 
##  Two Sample t-test
## 
## data:  as.numeric(data1$math) by data1$sharedtab
## t = 7.7778, df = 4515, p-value = 9.084e-15
## alternative hypothesis: true difference in means between group Yes and group No is not equal to 0
## 95 percent confidence interval:
##  18.48903 30.95109
## sample estimates:
## mean in group Yes  mean in group No 
##          542.6973          517.9772

Since the p-value is small (9.084e-15 < 0.05) the null hypothesis shall be rejected (because the probability to get the result we have if H0 was true is very low). Therefore, we conclude that the difference between mean math scores of these two groups is statistically significant.

Control variables

Several control variables were also added to the regression equation since we might suspect that they effect either predictor variables, outcome variable or both and want to remove their effects from the equation. This is also done to improve interpretation which might be confusing in case of, for instance, Simpson’s paradox where the overall trend is the opposite of trends in individual groups. Control variables include gender, parental education and country of origin.

Gender

Unlike two previous variables, the sample is balanced in terms of gender of the respondents: there are only sligthly more boys in it (51.8 > 48.2). Judjuing by the boxplot on the right, median math score seems to be slightly higher for boys.

Gender vars n mean sd median trimmed mad min max range skew kurtosis se
Girl 1 2179 535.7442 79.73808 538.5804 536.9520 79.94751 273.0573 819.8235 546.7662 -0.1407540 -0.1532673 1.708194
Boy 1 2338 541.4274 78.76046 542.8349 542.3563 82.14488 307.1516 786.5518 479.4002 -0.1084828 -0.2654966 1.628868

We find the precise median math score values in the table above: median math scores of boys are, in fact, higher (542.8349 > 538.5804). The same goes for the mean values: 541.4274 > 535.7442. Both distributions are skewed and platykurtic.

The difference in mean math scores seems to be quite small and has to be tested in order to find if it’s actually statistically significant or not.

One of the assumptions we have to check before carrying out the test itself is the homogeneity of variances. The two hypothesis would be:

  • H0: Variances do not differ.
  • H1: Variances differ.
## 
##  F test to compare two variances
## 
## data:  as.numeric(data1$math) by data1$gender
## F = 1.025, num df = 2178, denom df = 2337, p-value = 0.5577
## alternative hypothesis: true ratio of variances is not equal to 1
## 95 percent confidence interval:
##  0.9438002 1.1132763
## sample estimates:
## ratio of variances 
##           1.024979

As we can see, the p-value is equal to 0.5577 and exceeds the significance level of 0.05. In other words, there is not enough evidence to reject the null hypothesis and we conclude that variance don’t differ.

Hypotheses for the test itself:

  • H0: In population, two means of two gender groups DO NOT differ.
  • H1: In population, two means of two gender groups DO differ.
## 
##  Two Sample t-test
## 
## data:  as.numeric(data1$math) by data1$gender
## t = -2.4089, df = 4515, p-value = 0.01604
## alternative hypothesis: true difference in means between group Girl and group Boy is not equal to 0
## 95 percent confidence interval:
##  -10.308599  -1.057829
## sample estimates:
## mean in group Girl  mean in group Boy 
##           535.7442           541.4274

Since the p-value is small (0.01604 < 0.05) the null hypothesis shall be rejected (because the probability to get the result we have if H0 was true is very low). Therefore, we conclude that the difference between mean math scores of two gender groups is statistically significant.

Country

The imbalance in the sample for the country variable is quite apparent: only 3.3% of the respondents weren’t born in Russia. Their median math achievement score appears to be slightly lower than that of respondents born in Russia.

Country vars n mean sd median trimmed mad min max range skew kurtosis se
Born in Russia 1 4368 538.8316 79.27657 541.2203 539.9625 81.70736 273.0573 819.8235 546.7662 -0.1321644 -0.2125026 1.199509
Born not in Russia 1 149 534.4124 79.39743 531.1304 533.7157 74.30472 311.9141 735.0309 423.1168 0.0697964 0.0141724 6.504491

Apart from the difference in median math achievement score (541.2203 > 531.1304), the difference in means is also present with it being smaller among respondents born not in Russia (538.8316 > 534.4124). Even though both distributions are skewed, there are more respondents with lower math achievement scores among those born not in Russia and the distribution is leptokurtic.

In order to check whether the difference in mean math achievement scores among these two groups is significant, T-test is carried out.

## 
##  F test to compare two variances
## 
## data:  as.numeric(data1$math) by data1$country
## F = 0.99696, num df = 4367, denom df = 148, p-value = 0.9501
## alternative hypothesis: true ratio of variances is not equal to 1
## 95 percent confidence interval:
##  0.7801853 1.2418778
## sample estimates:
## ratio of variances 
##          0.9969578

As we can see, the p-value is equal to 0.9501 and exceeds the significance level of 0.05. In other words, there is not enough evidence to reject the null hypothesis and we conclude that variance don’t differ.

## 
##  Two Sample t-test
## 
## data:  as.numeric(data1$math) by data1$country
## t = 0.6691, df = 4515, p-value = 0.5035
## alternative hypothesis: true difference in means between group Yes and group No is not equal to 0
## 95 percent confidence interval:
##  -8.529349 17.367784
## sample estimates:
## mean in group Yes  mean in group No 
##          538.8316          534.4124

Since the p-value is big (0.5035 > 0.05) the null hypothesis cannot be rejected. Therefore, we conclude that the difference between means of these two groups is not statistically significant.

The results of the test may be questionable because of the striking imbalance in the dataset. In order to solve this problem, oversampling and undersampling are performed at the same time to make the number of respondents in each group nearly equal.

## 'data.frame':    4517 obs. of  2 variables:
##  $ math   : num  570 506 523 433 389 ...
##  $ country: Factor w/ 2 levels "Yes","No": 1 1 1 1 1 1 1 1 1 1 ...
## [1] 0.03411172
## 
##  F test to compare two variances
## 
## data:  as.numeric(newdata1$math) by newdata1$country
## F = 0.95326, num df = 2233, denom df = 2282, p-value = 0.2557
## alternative hypothesis: true ratio of variances is not equal to 1
## 95 percent confidence interval:
##  0.8777629 1.0352848
## sample estimates:
## ratio of variances 
##          0.9532579
## 
##  Two Sample t-test
## 
## data:  as.numeric(newdata1$math) by newdata1$country
## t = 2.6637, df = 4515, p-value = 0.007757
## alternative hypothesis: true difference in means between group Yes and group No is not equal to 0
## 95 percent confidence interval:
##   1.714889 11.277485
## sample estimates:
## mean in group Yes  mean in group No 
##          539.0832          532.5870

Unfortunately, since p-value is still bigger than the significance level (0.1232 > 0.05), we have to admit that the initial conclusion was correct and the difference between means of these two groups is not statistically significant.

Parental education

Interestingly, while the largest share of respondents reported not knowing their father’s education (28.6%) the share of respondents who didn’t know their mother’s education was lower than the share of those who indicated it to be a Bacherlor’s or equivalent (25.7% > 15%). ‘Bacherlor’s or equivalent’ was the second most common option for the variable indicating father’s education (17.2%). The third most popular category was the ‘Post-secondary, non-tertiary’ option for both father’s and mother’s education. Shares of respondents reporting their parent’s only having some primary or lower secondary education were extremely small (0.2% and 0.5%).

In terms of the box plots, the general trend seems to be similar in both cases of parental education. While the general trend seems to be upward with median math achievement score rising as parental education gets to higher levels, median math achievement score of respondents who reported their parents having some primary or lower secondary education was higher that the median math score of respondents with parents who obtained a lower secondary education. Median math score of those who didn’t know the education of their parents was one of the lowest: we can hypothesize that there is a possibility of students whose parents have lower education preferring to say that thay don’t know what the education of their parents is. If, in fact, their parents can be ascribed to the “some primary or lower secondary education” category, that would drive the median score of the latter lower, making the general trend easier to understand.

Ed vars n mean sd median trimmed mad min max range skew kurtosis se
Some Primary or Lower secondary or did not go to school 1 7 514.6946 108.48242 531.9383 514.6946 92.54371 312.7139 646.6540 333.9401 -0.6074477 -0.9160710 41.002499
Lower secondary 1 492 504.5873 80.98609 504.2409 504.0263 84.59516 309.9401 739.5915 429.6514 0.0603578 -0.3713925 3.651135
Upper secondary 1 604 507.0375 78.95424 502.2818 505.3105 83.71570 282.6379 735.4748 452.8369 0.1954891 -0.3887144 3.212602
Post-secondary, non-tertiary 1 678 548.0093 73.00858 548.8516 549.0753 73.64548 310.6374 738.3907 427.7533 -0.1417502 -0.3116524 2.803878
Short-cycle tertiary 1 432 546.0860 73.69947 548.6751 547.6520 71.25224 306.9265 762.4422 455.5158 -0.2163085 0.0845101 3.545867
Bachelor’s or equivalent 1 1160 558.6289 71.92260 560.3628 559.6322 71.76946 340.1206 819.8235 479.7029 -0.0714999 0.0353499 2.111720
Postgraduate degree 1 466 569.1063 81.17303 575.1782 570.9561 83.82083 307.1516 767.6349 460.4833 -0.2688936 -0.0790803 3.760268
Don’t know 1 678 522.8036 76.82149 528.0815 524.4496 76.06783 273.0573 705.8814 432.8241 -0.2251237 -0.2154175 2.950312

Descriptive statistics are summarized in the table above. In order to check whether the difference in math acievement scores for different levels of parental education is statisctially significant, we proceed to ANOVA.

##                  Df   Sum Sq Mean Sq F value Pr(>F)    
## data1$edmother    7  2327279  332468   57.54 <2e-16 ***
## Residuals      4509 26054127    5778                   
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##                  Df   Sum Sq Mean Sq F value Pr(>F)    
## data1$edfather    7  1921077  274440   46.77 <2e-16 ***
## Residuals      4509 26460330    5868                   
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

In both cases P-value < .001 (<2e-16) means that the null hypothesis should be rejected (the probability to obtain such data that we have if H0 was true for the population is low), thus, the difference in math scores across different levels of education of parents is statistically significant.

Another bivariate distribution of interest in this case is the distribution of the number of books for each of the education levels. We can suspect that higher levels of parental education are associated with a bigger number of books at home in both cases. Firstly, we take a look at distributions for mother’s education.

What’s striking about the lowest possible educational level, “some primary or lower secondary education”, is that the share of respondents reporting having more than 200 books at home is the highest in comparison to other educational levels (28.6%). However, we need to keep in mind that there were only a few respondents reporting this level of their mother’s education and the results are not that reliable. Apart from this level, the share of respondents reporting having more than 200 books at home was steadily increasing with higher levels of mother’s education. At the same time share of respondents reporting having less than 10 books declines. Shares of respondents reporting remaining three categories slightly fluctuated.

Since in this case we are dealing with two categorical variables, Chi-squared test has to be performed instead of T-test.

0–10 books 11–25 books 26–100 books 101–200 books More than 200
Some Primary or Lower secondary or did not go to school 3 2 0 0 2
Lower secondary 76 204 161 38 13
Upper secondary 56 245 207 65 31
Post-secondary, non-tertiary 38 199 305 98 38
Short-cycle tertiary 21 130 164 67 50
Bachelor’s or equivalent 31 253 475 243 158
Postgraduate degree 12 75 190 91 98
Don’t know 57 222 249 105 45
## 
##  Pearson's Chi-squared test
## 
## data:  ct
## X-squared = 446.31, df = 28, p-value < 2.2e-16

We obtained the Chi-square statistic of 446.31 and a p-value equal to 2.2e-16 having the degree of freedom 28. P-value is a lot smaller than the significance level of 0.05, meaning that the probability to obtain the observed, or more extreme, results if the null hypothesis is true (variables are independent) is extremely low. Therefore we conclude that variables are not independent: the education level of respondent’s mother and the number of books at home are not independent.

Now we proceed to go through the same steps for father’s education. Table with descriptive statistics is presented below.

Ed vars n mean sd median trimmed mad min max range skew kurtosis se
Some Primary or Lower secondary or did not go to school 1 23 523.0153 108.35070 536.7389 527.4363 94.83894 311.9141 719.3618 407.4476 -0.3755879 -0.6718116 22.592683
Lower secondary 1 466 505.9230 79.25411 505.8465 506.3999 79.21341 282.6379 739.5915 456.9536 -0.0212454 -0.2419399 3.671376
Upper secondary 1 518 516.9538 77.15240 513.3934 515.7255 84.36265 313.0166 762.4422 449.4257 0.1445043 -0.3873860 3.389882
Post-secondary, non-tertiary 1 709 548.2601 74.98492 549.5374 549.5523 77.16401 309.3283 777.5889 468.2606 -0.1401615 -0.2864373 2.816118
Short-cycle tertiary 1 385 547.1812 72.79266 550.2453 547.8284 73.48339 358.8245 739.9707 381.1462 -0.0793597 -0.3823686 3.709857
Bachelor’s or equivalent 1 779 562.0063 73.61017 565.4501 562.5096 70.62829 358.2006 819.8235 461.6229 -0.0162757 0.0524767 2.637358
Postgraduate degree 1 345 576.5189 76.28275 587.9637 579.7077 71.07793 307.1516 756.8425 449.6909 -0.5120903 0.4204483 4.106925
Don’t know 1 1292 527.5458 78.57615 531.6270 528.5375 79.86980 273.0573 738.3907 465.3334 -0.1461551 -0.2362484 2.186047

The problem with the “some primary or lower secondary education” level that was observed in case of mother’s education does not appear to be present here, however, the share of respondent claiming they have more than 200 books at home for that level is still bigger than the same share for the “lower secondary” level (8.7% > 4.7%). Apart from that the trend in shares of respondents saying they have more than 200 books at home seems to be upward and stable, while the trend for shares of respondents reporting having less than 10 books at home is downward.

0–10 books 11–25 books 26–100 books 101–200 books More than 200
Some Primary or Lower secondary or did not go to school 8 7 5 1 2
Lower secondary 47 200 160 37 22
Upper secondary 47 201 184 56 30
Post-secondary, non-tertiary 34 217 314 97 47
Short-cycle tertiary 18 100 157 68 42
Bachelor’s or equivalent 23 156 314 163 123
Postgraduate degree 9 44 132 89 71
Don’t know 108 405 485 196 98
## 
##  Pearson's Chi-squared test
## 
## data:  ct
## X-squared = 366.99, df = 28, p-value < 2.2e-16

Just as it was with mother’s education p-value is smaller than 0.05 and we conclude that the education level of respondent’s father and the number of books at home are not independent.

Summary

  • Higher math achievement scores are observed for factor levels indicating a bigger number of books at home with an exception of the very last level “more than 200” mean math achievement score of which is slightly lower than that of “101-200” books level. The difference in math achievement between these groups is statistically significant.

  • Higher math achievement scores are observed in a group of students who indicated not having a personal tablet while the situation was the opposite for the presence of a shared tablet: higher math achievement values were observed when students had a shared tablet. The difference in math achievement between the first two and the second two groups is statistically significant.

  • Higher math achievement scores are observed for boys and the gender difference in math achievement scores is statistically significant.

  • Higher math achievement scores are observed among students who were born in Russia. However, the difference in math achievement between two groups (born in Russia and born not in Russia) is not statistically significant and does not change after the oversampling and undersampling are applied.

  • Except for some ambiguity in case of the “Don’t know” level, higher math achievement scores are observed for students whose parents have obtained higher levels of education. The difference in math achievement scores is statistically significant for both father’s and mother’s educational levels. Additionally, it was shown that a larger number of books at home was observed in families where parents have obtained higher levels of education. The number of books and home and parental education was proven to be not independent.

Models

No interaction

For the sake of experiment we can compare two models based solely on the number of books at home and presence of either personal or shared tablet. Since models are not nested, we have to use AIC instead of ANOVA.

  math
Predictors Estimates CI p
(Intercept) 514.12 505.18 – 523.05 <0.001
books [11–25 books] 10.98 1.10 – 20.86 0.029
books [26–100 books] 27.10 17.44 – 36.76 <0.001
books [101–200 books] 43.04 32.40 – 53.67 <0.001
books [More than 200] 42.51 30.94 – 54.08 <0.001
Observations 4517
R2 / R2 adjusted 0.029 / 0.028
  math
Predictors Estimates CI p
(Intercept) 540.88 538.18 – 543.58 <0.001
selftab [No] 11.68 5.27 – 18.09 <0.001
sharedtab [No] -24.36 -30.59 – -18.14 <0.001
Observations 4517
R2 / R2 adjusted 0.016 / 0.016
## [1] 52203.08
## [1] 52257.89

AIC of the first model appears to be lower 52203.08 > 52257.89 and we conclude that it is better. Additionally, it explains slightly more variance in the outcome variable (0.02787 > 0.01557) while both models are better than no model at all, judjing by the p-value being less than 0.05.

We then proceed to include control variables in the equation and test whether the addition of two tablet predictor variables improves the model that is focused on the number of books at home only.

  math
Predictors Estimates CI p
(Intercept) 501.51 442.64 – 560.39 <0.001
books [11–25 books] 5.03 -4.54 – 14.59 0.303
books [26–100 books] 13.08 3.61 – 22.54 0.007
books [101–200 books] 24.21 13.72 – 34.69 <0.001
books [More than 200] 19.25 7.77 – 30.74 0.001
gender [Boy] 6.83 2.39 – 11.27 0.003
country [No] 1.53 -10.82 – 13.87 0.809
edmother [Lower
secondary]
1.95 -55.76 – 59.65 0.947
edmother [Upper
secondary]
0.29 -57.49 – 58.06 0.992
edmother [Post-secondary,
non-tertiary]
32.39 -25.42 – 90.20 0.272
edmother [Short-cycle
tertiary]
29.32 -28.57 – 87.22 0.321
edmother [Bachelor’s or
equivalent]
36.03 -21.65 – 93.70 0.221
edmother [Postgraduate
degree]
40.91 -16.90 – 98.72 0.165
edmother [Don’t know] 11.83 -46.00 – 69.67 0.688
edfather [Lower
secondary]
-20.85 -53.28 – 11.58 0.208
edfather [Upper
secondary]
-10.91 -43.40 – 21.57 0.510
edfather [Post-secondary,
non-tertiary]
2.77 -29.68 – 35.22 0.867
edfather [Short-cycle
tertiary]
0.51 -32.29 – 33.31 0.976
edfather [Bachelor’s or
equivalent]
10.28 -22.20 – 42.76 0.535
edfather [Postgraduate
degree]
18.79 -14.29 – 51.87 0.265
edfather [Don’t know] -7.19 -39.42 – 25.04 0.662
Observations 4517
R2 / R2 adjusted 0.106 / 0.102
  math
Predictors Estimates CI p
(Intercept) 502.65 444.03 – 561.27 <0.001
books [11–25 books] 5.25 -4.27 – 14.76 0.279
books [26–100 books] 12.83 3.41 – 22.25 0.008
books [101–200 books] 23.69 13.26 – 34.13 <0.001
books [More than 200] 18.67 7.24 – 30.10 0.001
selftab [No] 14.11 8.00 – 20.23 <0.001
sharedtab [No] -18.65 -24.61 – -12.69 <0.001
gender [Boy] 8.30 3.87 – 12.73 <0.001
country [No] -0.31 -12.58 – 11.96 0.961
edmother [Lower
secondary]
-1.62 -58.97 – 55.73 0.956
edmother [Upper
secondary]
-3.35 -60.78 – 54.08 0.909
edmother [Post-secondary,
non-tertiary]
27.96 -29.50 – 85.43 0.340
edmother [Short-cycle
tertiary]
25.48 -32.06 – 83.03 0.385
edmother [Bachelor’s or
equivalent]
31.32 -26.02 – 88.65 0.284
edmother [Postgraduate
degree]
36.28 -21.19 – 93.76 0.216
edmother [Don’t know] 7.96 -49.52 – 65.45 0.786
edfather [Lower
secondary]
-17.36 -49.60 – 14.88 0.291
edfather [Upper
secondary]
-8.09 -40.39 – 24.20 0.623
edfather [Post-secondary,
non-tertiary]
6.10 -26.17 – 38.36 0.711
edfather [Short-cycle
tertiary]
3.43 -29.17 – 36.04 0.837
edfather [Bachelor’s or
equivalent]
13.98 -18.31 – 46.27 0.396
edfather [Postgraduate
degree]
21.73 -11.15 – 54.61 0.195
edfather [Don’t know] -3.40 -35.44 – 28.64 0.835
Observations 4517
R2 / R2 adjusted 0.118 / 0.114
## Analysis of Variance Table
## 
## Model 1: math ~ books + gender + country + edmother + edfather
## Model 2: math ~ books + selftab + sharedtab + gender + country + edmother + 
##     edfather
##   Res.Df      RSS Df Sum of Sq      F    Pr(>F)    
## 1   4496 25361460                                  
## 2   4494 25028212  2    333248 29.919 1.237e-13 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Since P-value is small (1.237e-13) we conclude that the additon of two new predictors did improve the model. The final model includes both the number of books and presence of shared or personal tablet as well as control variables.

  math
Predictors Estimates CI p
(Intercept) 502.65 444.03 – 561.27 <0.001
books [11–25 books] 5.25 -4.27 – 14.76 0.279
books [26–100 books] 12.83 3.41 – 22.25 0.008
books [101–200 books] 23.69 13.26 – 34.13 <0.001
books [More than 200] 18.67 7.24 – 30.10 0.001
selftab [No] 14.11 8.00 – 20.23 <0.001
sharedtab [No] -18.65 -24.61 – -12.69 <0.001
gender [Boy] 8.30 3.87 – 12.73 <0.001
country [No] -0.31 -12.58 – 11.96 0.961
edmother [Lower
secondary]
-1.62 -58.97 – 55.73 0.956
edmother [Upper
secondary]
-3.35 -60.78 – 54.08 0.909
edmother [Post-secondary,
non-tertiary]
27.96 -29.50 – 85.43 0.340
edmother [Short-cycle
tertiary]
25.48 -32.06 – 83.03 0.385
edmother [Bachelor’s or
equivalent]
31.32 -26.02 – 88.65 0.284
edmother [Postgraduate
degree]
36.28 -21.19 – 93.76 0.216
edmother [Don’t know] 7.96 -49.52 – 65.45 0.786
edfather [Lower
secondary]
-17.36 -49.60 – 14.88 0.291
edfather [Upper
secondary]
-8.09 -40.39 – 24.20 0.623
edfather [Post-secondary,
non-tertiary]
6.10 -26.17 – 38.36 0.711
edfather [Short-cycle
tertiary]
3.43 -29.17 – 36.04 0.837
edfather [Bachelor’s or
equivalent]
13.98 -18.31 – 46.27 0.396
edfather [Postgraduate
degree]
21.73 -11.15 – 54.61 0.195
edfather [Don’t know] -3.40 -35.44 – 28.64 0.835
Observations 4517
R2 / R2 adjusted 0.118 / 0.114

The final model appears to be better than no model at all (p-value: < 2.2e-16) and explains 11% of the variance in the outcome variable. Three factor levels of the number of books at home (base level excluded) were coded as statistically significant, while none of the parental education variables were. Gender and both tablet variables were found to be statistically significant, while country of origin wasn’t.

While the regression equation would look a bit confusing with that many variables, the interpretation can be presented as:

  • When all variables are equal to 0 (meaning that respondent is a girl, has a personal tablet and a shared tablet, was born in Russia, has less than 10 books at home and both parents obtained a “some primary or lower secondary education” education) the predicted math score is equal to 502.6493.

  • The increase of the predicted math score in comparison to the base category of books (less than 10 books at home) is getting bigger as we move to levels indicating larger numbers of books at home, however, an increase associated with “More than 200” level is smaller than that of “101–200 books” (18.6710 < 23.6901).

  • If respondent doesn’t have a personal tablet, the predicted math score is higher by 14.1105.

  • If respondent doesn’t have a shared tablet, the predicted math score is lower by 18.6497.

  • If respondent is a boy, the predicted math score is higher by 8.3003.

  • If respondent wasn’t born in Russia, the predicted math score is lower by 0.3096.

  • Starting from the “Post-secondary, non-tertiary” change in the education level of mother and father is associated with an increase in the predicted math score. Increase attributed to the “Short-cycle tertiary” is lower than that of “Post-secondary, non-tertiary” and “Bachelor’s or equivalent” in both cases. While in case of mother’s education by ascribing to the “Don’t know” category” increases the predicted math score (7.9611), for father’s education it actually decreases it (-3.3976).

The findings are consistent with the hypothesis presented above: presence of a personal tablet is associated with a decrease in predicted math achievement score, presence of a shared tablet is associated with an increase in predicted math achievement score, increase in the number of books at home is associated with an increase in the predicted math achievement score.

Interaction

We now proceed to add a new variable as an interaction term, but before doing so, we need to take a closer look at it. The variable we want to include it’s respondents own educational aspirations, namely, the level of education he or she is planning to obtain. The reason behind the addition of this variable is the following: we add personal educational aspirations of a student as an interaction term to the varible indicating the number of books at home. We might hypothesize that the effect of the number of books at home on the math achievement will be different for different levels of educational aspiration: high educational aspirations (and willingness to work to achieve one’s goal) might compensate for the lack of either abilities or opportunities priveded by teachers at school that a small number of books at home led to.

The largest share of respondents stated that they want to obtain a Bachelor’s degree (36.7%), followed by the 19.8% of respondents claiming that they want to obtain a postgraduate degree. The general trend with the median math achievement was upward: the higher were the educational aspirations of respondents, the higher were their median math achievement score. That is also demonstrated in the descriptive table below.

Ed vars n mean sd median trimmed mad min max range skew kurtosis se
Finish Lower secondary 1 872 485.3189 78.03657 479.1925 483.7742 81.99048 273.0573 697.0503 423.9930 0.1549919 -0.4530134 2.642652
Finish Upper secondary 1 354 505.7826 71.45162 506.2134 505.5721 69.58748 282.6379 739.5915 456.9536 0.0251689 0.0432201 3.797611
Finish Post-secondary, non-tertiary 1 737 528.7562 72.55027 532.5299 530.0173 77.71502 307.1516 708.8806 401.7290 -0.1766014 -0.3357636 2.672423
Finish Bachelor’s or equivalent 1 1658 557.9863 68.93484 558.3862 558.1474 67.86502 323.8450 819.8235 495.9785 -0.0060154 0.0352018 1.692959
Finish Postgraduate degree 1 896 576.0760 72.60047 578.2024 577.3281 71.98893 353.2630 786.5518 433.2887 -0.1527531 -0.2237756 2.425411

Finally, we can perform ANOVA to check if the difference in math scores for these groups is statistically significant.

##                  Df   Sum Sq Mean Sq F value Pr(>F)    
## data1$futureed    4  4809646 1202412   230.2 <2e-16 ***
## Residuals      4512 23571760    5224                   
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Since P-value < .001 (<2e-16), we conlude that the difference in math scores across different levels of educational aspiration is statistically significant.

  math
Predictors Estimates CI p
(Intercept) 487.51 431.25 – 543.77 <0.001
books [11–25 books] 1.37 -13.60 – 16.34 0.858
books [26–100 books] 7.19 -8.25 – 22.64 0.361
books [101–200 books] -8.01 -28.33 – 12.31 0.440
books [More than 200] -2.32 -27.57 – 22.92 0.857
futureed [Finish Upper
secondary]
16.49 -12.56 – 45.53 0.266
futureed [Finish
Post-secondary,
non-tertiary]
47.19 23.01 – 71.37 <0.001
futureed [Finish
Bachelor’s or equivalent]
55.51 33.99 – 77.04 <0.001
futureed [Finish
Postgraduate degree]
54.55 26.88 – 82.23 <0.001
selftab [No] 12.08 6.23 – 17.92 <0.001
sharedtab [No] -15.49 -21.19 – -9.79 <0.001
gender [Boy] 13.04 8.78 – 17.30 <0.001
country [No] -3.20 -14.91 – 8.52 0.593
edmother [Lower
secondary]
-4.80 -59.81 – 50.22 0.864
edmother [Upper
secondary]
-7.90 -63.01 – 47.22 0.779
edmother [Post-secondary,
non-tertiary]
8.29 -46.88 – 63.45 0.768
edmother [Short-cycle
tertiary]
5.25 -49.97 – 60.47 0.852
edmother [Bachelor’s or
equivalent]
5.05 -49.99 – 60.10 0.857
edmother [Postgraduate
degree]
5.40 -49.84 – 60.64 0.848
edmother [Don’t know] -2.96 -58.12 – 52.21 0.916
edfather [Lower
secondary]
-13.98 -44.77 – 16.81 0.373
edfather [Upper
secondary]
-9.64 -40.50 – 21.22 0.540
edfather [Post-secondary,
non-tertiary]
-3.03 -33.88 – 27.81 0.847
edfather [Short-cycle
tertiary]
-4.74 -35.88 – 26.41 0.766
edfather [Bachelor’s or
equivalent]
0.92 -29.94 – 31.77 0.954
edfather [Postgraduate
degree]
4.71 -26.73 – 36.15 0.769
edfather [Don’t know] -6.35 -36.96 – 24.26 0.684
books [11–25 books] ×
futureed [Finish Upper
secondary]
5.49 -26.83 – 37.81 0.739
books [26–100 books] ×
futureed [Finish Upper
secondary]
0.37 -32.23 – 32.97 0.982
books [101–200 books] ×
futureed [Finish Upper
secondary]
22.97 -16.27 – 62.22 0.251
books [More than 200] ×
futureed [Finish Upper
secondary]
-11.87 -59.27 – 35.53 0.623
books [11–25 books] ×
futureed [Finish
Post-secondary,
non-tertiary]
-12.97 -39.80 – 13.86 0.343
books [26–100 books] ×
futureed [Finish
Post-secondary,
non-tertiary]
-18.36 -45.14 – 8.43 0.179
books [101–200 books] ×
futureed [Finish
Post-secondary,
non-tertiary]
9.44 -22.27 – 41.14 0.559
books [More than 200] ×
futureed [Finish
Post-secondary,
non-tertiary]
-7.49 -45.33 – 30.35 0.698
books [11–25 books] ×
futureed [Finish
Bachelor’s or equivalent]
2.46 -21.15 – 26.06 0.838
books [26–100 books] ×
futureed [Finish
Bachelor’s or equivalent]
0.84 -22.74 – 24.41 0.944
books [101–200 books] ×
futureed [Finish
Bachelor’s or equivalent]
32.01 4.22 – 59.80 0.024
books [More than 200] ×
futureed [Finish
Bachelor’s or equivalent]
13.92 -18.31 – 46.15 0.397
books [11–25 books] ×
futureed [Finish
Postgraduate degree]
10.76 -19.89 – 41.42 0.491
books [26–100 books] ×
futureed [Finish
Postgraduate degree]
18.60 -11.17 – 48.37 0.221
books [101–200 books] ×
futureed [Finish
Postgraduate degree]
41.34 8.08 – 74.60 0.015
books [More than 200] ×
futureed [Finish
Postgraduate degree]
40.36 3.46 – 77.25 0.032
Observations 4517
R2 / R2 adjusted 0.204 / 0.196

The interaction effect is significant on several levels: 101–200 books at home and aspired to finish Bachelor’s or equivalent; 101–200 books at home and aspired to obtain Postgraduate degree; more than 200 books at home and aspired to obtain Postgraduate degree. Three out of four levels of the variable educational aspiration (base level excluded) were found to be statistically significant, while the variable indicating the number of books at home lost it statistical significance. The model is still better than no model at all and explains around 20% of the variance in the outcome variable.

We can also take a look at the interaction plot presented above. Predcited math scores of those with higher educational aspirations are always higher and the lines do not overlap. Predicted math score drops at the “More than 200 books at home” level for three out of five educational aspiration types: Upper secondary, Post-secondary but non-tertiary and Bachelor’s. For those aspired to obrain Postgraduate degree or only finish lower secondary education the predicted math score increases at that level of number of books compared to the previous one. Another pecularity in case of the “Finish lower secondary” aspiration level is that the predicted math score drops at the level of 101-200 books at home in comparison to the previous one unlike it happens in all four remaining aspiration levels. In general it can be seen that the increase in predicted math score on each level compared to the previous one is more steep for such high aspiration level as Postgraduate degee and it is the only level for which the predicted math achievement always increases.

Now we want to compare the model with and without interaction to check which one if better.

## Analysis of Variance Table
## 
## Model 1: math ~ books + selftab + sharedtab + gender + country + edmother + 
##     edfather
## Model 2: math ~ books * futureed + selftab + sharedtab + gender + country + 
##     edmother + edfather
##   Res.Df      RSS Df Sum of Sq     F    Pr(>F)    
## 1   4494 25028212                                 
## 2   4474 22596800 20   2431412 24.07 < 2.2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Since P-value is small (2.2e-16) we conclude that the additon of the interaction did improve the model and claim it to be our final one.

Model diagnostics

In model diagnostics, we can first check for outliers.

## No Studentized residuals with Bonferroni p < 0.05
## Largest |rstudent|:
##       rstudent unadjusted p-value Bonferroni p
## 4173 -3.576513         0.00035189           NA

## 1712 4173 
## 1615 3944

We observe that there is an outlier observation - 4173 (at the low left corner of the QQ Plot). We will have to check whether it is an influential outlier later on the diagnostic plots.

In order to check for multicollinearity we calculate variance inflation factors for all the variables.

##                        GVIF Df GVIF^(1/(2*Df))
## books          1.085381e+03  4        2.395785
## futureed       6.492610e+04  4        3.995328
## selftab        1.019190e+00  1        1.009549
## sharedtab      1.028435e+00  1        1.014118
## gender         1.053147e+00  1        1.026229
## country        1.018837e+00  1        1.009374
## edmother       4.139937e+00  7        1.106805
## edfather       3.571689e+00  7        1.095194
## books:futureed 3.172123e+07 16        1.715605
Names Raw Squared
books 2.395785 5.739784
futureed 3.995328 15.962644
selftab 1.009549 1.019190
sharedtab 1.014118 1.028435
gender 1.026229 1.053147
country 1.009375 1.018837
edmother 1.106805 1.225017
edfather 1.095194 1.199450
books:futureed 1.715604 2.943299

In order to make GVIF^(1/(2 X DF)) analogous to the standard VIF measure, we need to square it. As we observe some squared values of GVIF^(1/(2 X DF)) exceed the threshold of 5 (15.962644 and 5.739784) which is a problem and indicates multicillinearity. Meaning, some of our explanatory variables are highly linearly correlated and it might be hard to distinguish their individual effects on the outome variable.

We then proceed to check for heteroscedasticity of residuals.

## Non-constant Variance Score Test 
## Variance formula: ~ fitted.values 
## Chisquare = 18.10096, Df = 1, p = 2.095e-05

Unfortunately, p-value is statistically significant (2.095e-05<0.05) which is a sign of heteroscedasticity (the residuals don’t have a constant variance).

One of possible explanations for this is that, heteroscedasticity is quite a common phenomena in cross-sectional. Additionally, Russia is a relatively big country with quite different regions and while extremely high values of some variables can be observed in one area, the other area might have extremely low values of the same variables. We will also observe heteroscedasticity on the diagnostic plots in what follows.

  • The horizontal line on the “Residuals vs Fitted” graph seems to be quite straight, which is good since here we are checking the linearity.

  • On the second graph we observe the outliers we’ve already mentioned. Apart from those, the line is actually quite straight.

  • “Scale-Location” graph is used to check the homogeneity of variance of the residuals and the ideal case would be a horizontal line with points equally spread around it - that is not our case. The line on our plot seems to have a slightly negative slope.

  • There are no points on the Line of Cook’s distance which means there are no high influence outliers that should be deleted from the model.

EFA

We now proceed to the factor analysis and create a separate dataset out of the variables that we’ve already selected. We also rename the variables for easier interpretation.

## 'data.frame':    4517 obs. of  11 variables:
##  $ selftab  : Factor w/ 2 levels "Yes","No": 1 1 1 1 1 1 1 1 2 1 ...
##  $ sharedtab: Factor w/ 2 levels "Yes","No": 1 1 1 1 1 1 2 1 1 1 ...
##  $ desk     : Factor w/ 2 levels "Yes","No": 1 1 1 1 1 1 1 1 1 1 ...
##  $ room     : Factor w/ 2 levels "Yes","No": 1 1 1 1 1 1 1 1 1 1 ...
##  $ internet : Factor w/ 2 levels "Yes","No": 1 1 2 1 1 1 1 1 1 1 ...
##  $ phone    : Factor w/ 2 levels "Yes","No": 1 1 1 1 1 1 1 1 1 1 ...
##  $ game     : Factor w/ 2 levels "Yes","No": 1 2 2 2 1 2 2 1 2 1 ...
##  $ music    : Factor w/ 2 levels "Yes","No": 2 2 2 2 1 2 2 1 1 2 ...
##  $ car      : Factor w/ 2 levels "Yes","No": 2 2 2 1 1 1 1 1 1 1 ...
##  $ flat     : Factor w/ 2 levels "Yes","No": 1 2 2 1 1 2 2 2 2 1 ...
##  $ dish     : Factor w/ 2 levels "Yes","No": 2 2 2 2 1 2 2 2 1 2 ...

As it can be seen, all variables in the dataset are factors. We first use hetcor() function that will calculate correlations between our variables taking into account their type and then present it in a form of a corrplot.

The corrplot does not look that promising: there are no clear highly correlated areas that we would hope to then see as factors. However, we can observe correlation between the variables “internet”, “phone” and two of the tablet variables and might suspect that they will load on the same factor.

We have to transform all our variables into numeric because we want to later use factor scores in the regression. In order to still note that our variables were not initially numeric we will use the cor=“mixed” specification. Prior to that the factor variables were also re-coded: initially the “No” response was coded as “2” and “Yes” was coded as “1”. For the sake of easier interpretation “No” was ascribed the value of 0, “Yes” - the value of 1.

## Parallel analysis suggests that the number of factors =  5  and the number of components =  4

We then build a screeplot to identify what a suggested number of factors would be. Since there are five trianges lying above the red line, we specify the number of factors equal to 5 in our first model.

No rotation, five factors

## Factor Analysis using method =  minres
## Call: fa(r = datanum, nfactors = 5, rotate = "none", cor = "mixed")
## Standardized loadings (pattern matrix) based upon correlation matrix
##            MR1   MR2   MR3   MR4   MR5   h2    u2 com
## selftab   0.56 -0.36 -0.09 -0.54  0.28 0.82 0.184 3.3
## sharedtab 0.24  0.54 -0.08  0.15  0.22 0.43 0.574 2.0
## desk      0.71 -0.38 -0.49  0.27 -0.18 1.00 0.005 2.9
## room      0.45 -0.31 -0.02  0.23  0.06 0.36 0.643 2.4
## internet  0.66  0.36 -0.19 -0.07  0.14 0.62 0.378 1.9
## phone     0.65  0.38 -0.12 -0.10 -0.13 0.60 0.395 1.8
## game      0.37 -0.06  0.35 -0.27 -0.18 0.37 0.629 3.3
## music     0.29  0.17  0.06  0.08 -0.17 0.15 0.849 2.7
## car       0.48  0.10  0.29  0.12  0.14 0.36 0.643 2.1
## flat      0.36 -0.23  0.41  0.34  0.27 0.53 0.468 4.4
## dish      0.54  0.01  0.38 -0.03 -0.30 0.52 0.477 2.4
## 
##                        MR1  MR2  MR3  MR4  MR5
## SS loadings           2.80 1.03 0.82 0.66 0.44
## Proportion Var        0.25 0.09 0.07 0.06 0.04
## Cumulative Var        0.25 0.35 0.42 0.48 0.52
## Proportion Explained  0.49 0.18 0.14 0.12 0.08
## Cumulative Proportion 0.49 0.66 0.81 0.92 1.00
## 
## Mean item complexity =  2.7
## Test of the hypothesis that 5 factors are sufficient.
## 
## df null model =  55  with the objective function =  2.41 with Chi Square =  10886.92
## df of  the model are 10  and the objective function was  0.03 
## 
## The root mean square of the residuals (RMSR) is  0.01 
## The df corrected root mean square of the residuals is  0.03 
## 
## The harmonic n.obs is  4517 with the empirical chi square  69.41  with prob <  5.8e-11 
## The total n.obs was  4517  with Likelihood Chi Square =  128.97  with prob <  7.6e-23 
## 
## Tucker Lewis Index of factoring reliability =  0.94
## RMSEA index =  0.051  and the 90 % confidence intervals are  0.044 0.059
## BIC =  44.81
## Fit based upon off diagonal values = 1
## Measures of factor score adequacy             
##                                                    MR1  MR2  MR3  MR4  MR5
## Correlation of (regression) scores with factors   0.95 0.87 0.86 0.85 0.74
## Multiple R square of scores with factors          0.90 0.76 0.74 0.73 0.54
## Minimum correlation of possible factor scores     0.81 0.53 0.48 0.46 0.09

Let’s first assess the model fit by the produced output.

  • Cumulative proportion of variance explained by all factors is equal to 0.52 which is relatively high.
  • ‘Proportion Var’ values are equal to 0.25, 0.09, 0.07, 0.06 and 0.04 with all of them being bigger 0.1.
  • We want the values of ‘Proportion Explained’ to be more or less similar. However, we observe that they range from 0.49 to 0.08 which is far from being ideal. Values for MR2 (0.18), MR3 (0.14), and MR4 (0.12) are actually quite similar and the problem is simply in the first factor having that high of a value (0.49).
  • The root mean square of the residuals (RMSR) is 0.01 which is good since we want it to be closer to zero and at least less than 0.05.
  • RMSEA index is equal to 0.051 which is, in fact, we good (<.08 acceptable, <.05 excellent).
  • Tucker Lewis index of factoring reliability is equal to 0.94 which also makes it acceptable (>.90 acceptable, >.95 excellent).

In general, it seems like the fit of the model is quite good apart from one of the factors having a too high ‘proportion explained’ value. However, if we look at the plot we observe that there are multiple problems present: two of the factors have only one variable loaded on them, while two have none. The majority of variables are loaded on the first factor which can also be seen in the output. It’s quite natural to have such a bad picture since we haven’t performed the rotation yet.

Varimax rotation, five factors

While keeping the number of factors to 5 we proceed to perform the varimax (orthogonal) rotation which assumes that factors are not correlated. We suppose that after the rotation the indicators will load high on one factor and one factor only - we will basically try to maximize each indicator’s largest factor loading and minimize others.

## Factor Analysis using method =  minres
## Call: fa(r = datanum, nfactors = 5, rotate = "varimax", cor = "mixed")
## Standardized loadings (pattern matrix) based upon correlation matrix
##             MR2   MR1   MR3   MR4   MR5   h2    u2 com
## selftab    0.06  0.22  0.15  0.85  0.09 0.82 0.184 1.2
## sharedtab  0.63 -0.09 -0.06 -0.11  0.10 0.43 0.574 1.2
## desk       0.16  0.96  0.10  0.16  0.07 1.00 0.005 1.1
## room       0.02  0.46  0.09  0.12  0.35 0.36 0.643 2.1
## internet   0.70  0.21  0.17  0.24  0.05 0.62 0.378 1.6
## phone      0.62  0.23  0.38  0.13 -0.07 0.60 0.395 2.1
## game       0.00 -0.03  0.55  0.24  0.11 0.37 0.629 1.5
## music      0.23  0.12  0.27 -0.10  0.04 0.15 0.849 2.7
## car        0.29  0.05  0.28  0.08  0.42 0.36 0.643 2.7
## flat      -0.01  0.12  0.13  0.03  0.71 0.53 0.468 1.1
## dish       0.12  0.14  0.67  0.04  0.21 0.52 0.477 1.4
## 
##                        MR2  MR1  MR3  MR4  MR5
## SS loadings           1.45 1.35 1.13 0.93 0.89
## Proportion Var        0.13 0.12 0.10 0.08 0.08
## Cumulative Var        0.13 0.25 0.36 0.44 0.52
## Proportion Explained  0.25 0.23 0.20 0.16 0.15
## Cumulative Proportion 0.25 0.49 0.68 0.85 1.00
## 
## Mean item complexity =  1.7
## Test of the hypothesis that 5 factors are sufficient.
## 
## df null model =  55  with the objective function =  2.41 with Chi Square =  10886.92
## df of  the model are 10  and the objective function was  0.03 
## 
## The root mean square of the residuals (RMSR) is  0.01 
## The df corrected root mean square of the residuals is  0.03 
## 
## The harmonic n.obs is  4517 with the empirical chi square  69.41  with prob <  5.8e-11 
## The total n.obs was  4517  with Likelihood Chi Square =  128.97  with prob <  7.6e-23 
## 
## Tucker Lewis Index of factoring reliability =  0.94
## RMSEA index =  0.051  and the 90 % confidence intervals are  0.044 0.059
## BIC =  44.81
## Fit based upon off diagonal values = 1
## Measures of factor score adequacy             
##                                                    MR2  MR1  MR3  MR4  MR5
## Correlation of (regression) scores with factors   0.85 0.99 0.78 0.88 0.77
## Multiple R square of scores with factors          0.72 0.98 0.60 0.78 0.59
## Minimum correlation of possible factor scores     0.44 0.96 0.20 0.56 0.19

We observe how the values of “Proportion explained” changed: while they used to range from 0.49 to 0.08, they now range from 0.15 to 0.25 which is a lot better (MR2 now has to biggest value). We also observe how the plot changed: now all factors have at least one variables loaded on them, but only one of the factors have a sufficient number of variables loading on it equal to three. Changes are also apparent for the factor loadings: for instance, “selftab” that was previously loaded on the MR1 factor is now mainly loaded on MR4 with 0.85 factor loading, “sharedtab” that was initially loaded on the MR2 factor with 0.54 is now loaded on it with 0.63. Changes in other varaibles are also presented in the output.

Since varimax (orthogonal) rotation assumes that factors are not correlated, the correlations are not seen in the output.

Oblimin rotation, five factors

## Factor Analysis using method =  minres
## Call: fa(r = datanum, nfactors = 5, rotate = "oblimin", cor = "mixed")
## Standardized loadings (pattern matrix) based upon correlation matrix
##             MR2   MR1   MR3   MR4   MR5   h2    u2 com
## selftab    0.02  0.01 -0.01  0.90  0.01 0.82 0.184 1.0
## sharedtab  0.69 -0.12 -0.14 -0.12  0.10 0.43 0.574 1.3
## desk       0.01  1.00 -0.01  0.01  0.00 1.00 0.005 1.0
## room      -0.05  0.43  0.01  0.07  0.32 0.36 0.643 1.9
## internet   0.68  0.11  0.03  0.19 -0.01 0.62 0.378 1.2
## phone      0.56  0.15  0.30  0.04 -0.16 0.60 0.395 1.9
## game      -0.07 -0.14  0.56  0.20  0.02 0.37 0.629 1.4
## music      0.19  0.11  0.27 -0.17 -0.01 0.15 0.849 2.9
## car        0.28 -0.05  0.21  0.05  0.38 0.36 0.643 2.5
## flat      -0.01  0.04  0.05  0.02  0.70 0.53 0.468 1.0
## dish       0.02  0.06  0.68 -0.06  0.11 0.52 0.477 1.1
## 
##                        MR2  MR1  MR3  MR4  MR5
## SS loadings           1.45 1.33 1.14 0.98 0.85
## Proportion Var        0.13 0.12 0.10 0.09 0.08
## Cumulative Var        0.13 0.25 0.36 0.45 0.52
## Proportion Explained  0.25 0.23 0.20 0.17 0.15
## Cumulative Proportion 0.25 0.48 0.68 0.85 1.00
## 
##  With factor correlations of 
##      MR2  MR1  MR3  MR4  MR5
## MR2 1.00 0.27 0.31 0.17 0.06
## MR1 0.27 1.00 0.27 0.40 0.17
## MR3 0.31 0.27 1.00 0.34 0.25
## MR4 0.17 0.40 0.34 1.00 0.12
## MR5 0.06 0.17 0.25 0.12 1.00
## 
## Mean item complexity =  1.6
## Test of the hypothesis that 5 factors are sufficient.
## 
## df null model =  55  with the objective function =  2.41 with Chi Square =  10886.92
## df of  the model are 10  and the objective function was  0.03 
## 
## The root mean square of the residuals (RMSR) is  0.01 
## The df corrected root mean square of the residuals is  0.03 
## 
## The harmonic n.obs is  4517 with the empirical chi square  69.41  with prob <  5.8e-11 
## The total n.obs was  4517  with Likelihood Chi Square =  128.97  with prob <  7.6e-23 
## 
## Tucker Lewis Index of factoring reliability =  0.94
## RMSEA index =  0.051  and the 90 % confidence intervals are  0.044 0.059
## BIC =  44.81
## Fit based upon off diagonal values = 1
## Measures of factor score adequacy             
##                                                    MR2  MR1  MR3  MR4  MR5
## Correlation of (regression) scores with factors   0.87 1.00 0.83 0.91 0.79
## Multiple R square of scores with factors          0.76 1.00 0.69 0.83 0.62
## Minimum correlation of possible factor scores     0.52 0.99 0.37 0.66 0.24

We then perform the “oblimin” rotation and immediately see the correlations between factors on the plot: MR1 and MR4 - 0.4, MR2 and MR3 - 0.3, MR3 and MR4 - 0.3. Since the results of the tests won’t change we have to look at the changes in SS loadings, proportion variance etc: however, the “Proportion Explained” values, in fact, almost didn’t change.

Since there is no significant improvement in comparison to the results of the varimax rotation, the second model appears to be preferable.

Varimax rotation, three factors

As we observed on the plots, the number of variables loaded on factors was not sufficient with only one factor having three variables loaded on it. In order to solve this problem we can reduce the number of factors. Since in case of the four factors some of them still have less than three variables loaded on them, we proceed to reduce the number of factors to three.

## Factor Analysis using method =  minres
## Call: fa(r = datanum, nfactors = 3, rotate = "varimax", cor = "mixed")
## Standardized loadings (pattern matrix) based upon correlation matrix
##             MR2   MR1   MR3   h2   u2 com
## selftab    0.13  0.40  0.25 0.24 0.76 1.9
## sharedtab  0.53 -0.10 -0.02 0.29 0.71 1.1
## desk       0.21  0.86  0.05 0.79 0.21 1.1
## room      -0.01  0.56  0.23 0.37 0.63 1.3
## internet   0.73  0.28  0.14 0.64 0.36 1.4
## phone      0.68  0.23  0.23 0.57 0.43 1.5
## game       0.07  0.07  0.54 0.30 0.70 1.1
## music      0.26  0.06  0.19 0.11 0.89 2.0
## car        0.27  0.13  0.46 0.30 0.70 1.8
## flat      -0.04  0.22  0.39 0.20 0.80 1.6
## dish       0.19  0.16  0.61 0.44 0.56 1.3
## 
##                        MR2  MR1  MR3
## SS loadings           1.52 1.47 1.26
## Proportion Var        0.14 0.13 0.11
## Cumulative Var        0.14 0.27 0.39
## Proportion Explained  0.36 0.34 0.30
## Cumulative Proportion 0.36 0.70 1.00
## 
## Mean item complexity =  1.5
## Test of the hypothesis that 3 factors are sufficient.
## 
## df null model =  55  with the objective function =  2.41 with Chi Square =  10886.92
## df of  the model are 25  and the objective function was  0.32 
## 
## The root mean square of the residuals (RMSR) is  0.05 
## The df corrected root mean square of the residuals is  0.08 
## 
## The harmonic n.obs is  4517 with the empirical chi square  1370.03  with prob <  9.6e-274 
## The total n.obs was  4517  with Likelihood Chi Square =  1421.89  with prob <  8.1e-285 
## 
## Tucker Lewis Index of factoring reliability =  0.716
## RMSEA index =  0.111  and the 90 % confidence intervals are  0.106 0.116
## BIC =  1211.5
## Fit based upon off diagonal values = 0.95
## Measures of factor score adequacy             
##                                                    MR2  MR1  MR3
## Correlation of (regression) scores with factors   0.85 0.89 0.78
## Multiple R square of scores with factors          0.72 0.80 0.61
## Minimum correlation of possible factor scores     0.44 0.59 0.22

  • The Cumulative proportion of variance explained by all factors dropped to 0.39.
  • ‘Proportion Var’ values are equal to 0.14, 0.13 and 0.11 with all of them being bigger 0.1.
  • The values of ‘Proportion Explained’ are quite similar and equal to 0.36, 0.34, 0.30.
  • The root mean square of the residuals (RMSR) increased to 0.05 which is worse than it was previously since we want it to be closer to zero and at least less than 0.05.
  • RMSEA index is equal to 0.111 and is now declared unacceptable.
  • Tucker Lewis index of factoring reliability is equal to 0.716 which also makes it not acceptable.

The model fit according to the tests worsened. However, the plot now seems to be easier to interpret as it makes more sense.

EFA Final

Factor Interpretation

  • MR1 - “Independance, personal space”

Such variables as “desk”, “room” and “selftab” are loaded on the first factor. One of the qualities that unites these variables and can be seen as a latent variable behind them is the independence of a child and personal space provided to him / her. It might be hypothesized that students that have higher ML1 factor scores and more personal space have higher math ahievement scores because they are not distracted by other family members and have more opportunity to concentrate on studying.

  • MR2 - “Digital exposure”

“internet”, “phone” and “shared” variables loaded on the second factor that can be interpreted as the one’s level of “digital exposure”. While the presence of a personal use tablet indiates independance and is loaded on the first factor, the presence of shared tablet is loaded on the MR2 together with “phone” which is of personal use. This might seem a bit confusing, but we can hypothesize that in case of the presence of a shared tablet it’s not the “shared” quality of it that matters but the presence and access to the internet itself. Access to the internet is also loaded on this factor supporting this claim. The hypothesis can go both ways in the same way it did for the use of tablets: high “Digital exposure” factor scores might be associated with low math achievement scores because children lose their ability to concenrate and to memorize information; on the other hand, usage of the internet for educational purposes can contribute to the opposite effect.

  • MR3 - “Wealth”

“dish”, “car” and “flat” variables are all, in fact, variables that were included in the dataset as indicators of wealth for Russia. Therefore, it would be natural to assume that the factor can be interpreted precisely as that. Interestingly, “game” variable is also loaded on this factor which shows that the possession of a gaming system can also be seen as an indicator of wealth. We can hypothesize that high factor “Wealth” scores are associated with higher math achievement scores because wealthy families can afford tutors and all the necessary studying materials.

Scales fit

We can now check the fit of all four scales.

## 
## Reliability analysis   
## Call: alpha(x = MR1, check.keys = TRUE)
## 
##   raw_alpha std.alpha G6(smc) average_r  S/N   ase mean   sd median_r
##       0.38      0.41    0.32      0.19 0.68 0.015 0.81 0.25     0.18
## 
##     95% confidence boundaries 
##          lower alpha upper
## Feldt     0.35  0.38  0.41
## Duhachek  0.35  0.38  0.41
## 
##  Reliability if an item is dropped:
##         raw_alpha std.alpha G6(smc) average_r  S/N alpha se var.r med.r
## desk         0.23      0.24    0.14      0.14 0.32    0.022    NA  0.14
## room         0.29      0.30    0.18      0.18 0.43    0.020    NA  0.18
## selftab      0.35      0.39    0.24      0.24 0.64    0.017    NA  0.24
## 
##  Item statistics 
##            n raw.r std.r r.cor r.drop mean   sd
## desk    4517  0.60  0.70  0.44   0.28 0.92 0.28
## room    4517  0.78  0.68  0.39   0.24 0.68 0.47
## selftab 4517  0.63  0.65  0.31   0.19 0.85 0.36
## 
## Non missing response frequency for each item
##            0    1 miss
## desk    0.08 0.92    0
## room    0.32 0.68    0
## selftab 0.15 0.85    0
## 
## Reliability analysis   
## Call: alpha(x = MR2, check.keys = TRUE)
## 
##   raw_alpha std.alpha G6(smc) average_r  S/N   ase mean   sd median_r
##       0.26      0.35    0.27      0.15 0.53 0.016 0.93 0.16     0.14
## 
##     95% confidence boundaries 
##          lower alpha upper
## Feldt     0.22  0.26  0.30
## Duhachek  0.23  0.26  0.29
## 
##  Reliability if an item is dropped:
##           raw_alpha std.alpha G6(smc) average_r  S/N alpha se var.r med.r
## internet       0.13      0.19    0.10      0.10 0.23    0.017    NA  0.10
## phone          0.20      0.24    0.14      0.14 0.32    0.019    NA  0.14
## sharedtab      0.34      0.35    0.21      0.21 0.54    0.019    NA  0.21
## 
##  Item statistics 
##              n raw.r std.r r.cor r.drop mean   sd
## internet  4517  0.55  0.68  0.40   0.20 0.97 0.18
## phone     4517  0.45  0.67  0.36   0.18 0.98 0.14
## sharedtab 4517  0.86  0.63  0.26   0.16 0.84 0.37
## 
## Non missing response frequency for each item
##              0    1 miss
## internet  0.03 0.97    0
## phone     0.02 0.98    0
## sharedtab 0.16 0.84    0
## 
## Reliability analysis   
## Call: alpha(x = MR3, check.keys = TRUE)
## 
##   raw_alpha std.alpha G6(smc) average_r  S/N   ase mean   sd median_r
##       0.43      0.43    0.37      0.16 0.76 0.014 0.41 0.28     0.15
## 
##     95% confidence boundaries 
##          lower alpha upper
## Feldt      0.4  0.43  0.45
## Duhachek   0.4  0.43  0.45
## 
##  Reliability if an item is dropped:
##      raw_alpha std.alpha G6(smc) average_r  S/N alpha se  var.r med.r
## dish      0.33      0.33    0.25      0.14 0.50    0.017 0.0044  0.13
## game      0.38      0.38    0.30      0.17 0.62    0.016 0.0014  0.16
## car       0.33      0.35    0.27      0.15 0.53    0.017 0.0054  0.14
## flat      0.38      0.39    0.30      0.17 0.63    0.016 0.0023  0.16
## 
##  Item statistics 
##         n raw.r std.r r.cor r.drop mean   sd
## dish 4517  0.59  0.63  0.41   0.27 0.21 0.40
## game 4517  0.58  0.59  0.34   0.21 0.26 0.44
## car  4517  0.63  0.62  0.39   0.26 0.68 0.47
## flat 4517  0.63  0.59  0.33   0.22 0.51 0.50
## 
## Non missing response frequency for each item
##         0    1 miss
## dish 0.79 0.21    0
## game 0.74 0.26    0
## car  0.32 0.68    0
## flat 0.49 0.51    0

Unfortunately, for all three factors standartized alpha is lower than the 0.7 treshold (0.41, 0.35, 0.43) which indicates a bad scale reliability. Moreover, when the “reliability if an item is dropped” is checked, there are no variables by dropping which higher alpha values could be reached which means that we shouldn’t get rid of any of them. Most importantly, by dropping any of the variables it would be still impossible to exceed the 0.7 treshold.

Factor scores in regression

We proceed to add factor scores to the first dataset that contains variables for the linear regression.

Before we actually put the factor scores into the regression we can take a look at bivariate distributions of these factors and the outcome variables.

## 
##  Pearson's product-moment correlation
## 
## data:  data1$math and data1$MR1
## t = 2.2642, df = 4515, p-value = 0.02361
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
##  0.004517573 0.062779082
## sample estimates:
##        cor 
## 0.03367694
## 
##  Pearson's product-moment correlation
## 
## data:  data1$math and data1$MR2
## t = 10.26, df = 4515, p-value < 2.2e-16
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
##  0.1223145 0.1793144
## sample estimates:
##       cor 
## 0.1509399
## 
##  Pearson's product-moment correlation
## 
## data:  data1$math and data1$MR3
## t = 4.5041, df = 4515, p-value = 6.833e-06
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
##  0.03779188 0.09585879
## sample estimates:
##        cor 
## 0.06688197

We observe a positive correlations between these variables in all three cases with the strongest one observed for the MR2 factor (0.15 > 0.07 > 0.03).

We then proceed to include the MR2 factor scores in the regression equation as an independent variable.

  math
Predictors Estimates CI p
(Intercept) 487.57 431.30 – 543.84 <0.001
books [11–25 books] 1.52 -13.44 – 16.49 0.842
books [26–100 books] 7.31 -8.14 – 22.76 0.353
books [101–200 books] -7.86 -28.19 – 12.46 0.448
books [More than 200] -2.17 -27.42 – 23.08 0.866
futureed [Finish Upper
secondary]
16.97 -12.08 – 46.02 0.252
futureed [Finish
Post-secondary,
non-tertiary]
47.98 23.79 – 72.18 <0.001
futureed [Finish
Bachelor’s or equivalent]
55.45 33.92 – 76.97 <0.001
futureed [Finish
Postgraduate degree]
55.22 27.53 – 82.90 <0.001
selftab [No] 13.86 7.53 – 20.19 <0.001
MR3 -1.04 -3.72 – 1.64 0.447
sharedtab [No] -15.15 -20.89 – -9.42 <0.001
gender [Boy] 13.12 8.85 – 17.40 <0.001
country [No] -3.03 -14.75 – 8.68 0.612
edmother [Lower
secondary]
-5.25 -60.26 – 49.76 0.852
edmother [Upper
secondary]
-8.38 -63.50 – 46.73 0.766
edmother [Post-secondary,
non-tertiary]
7.86 -47.31 – 63.02 0.780
edmother [Short-cycle
tertiary]
4.64 -50.58 – 59.86 0.869
edmother [Bachelor’s or
equivalent]
4.61 -50.43 – 59.66 0.869
edmother [Postgraduate
degree]
4.91 -50.33 – 60.16 0.862
edmother [Don’t know] -3.36 -58.52 – 51.80 0.905
edfather [Lower
secondary]
-13.72 -44.51 – 17.07 0.382
edfather [Upper
secondary]
-9.27 -40.14 – 21.59 0.556
edfather [Post-secondary,
non-tertiary]
-2.78 -33.63 – 28.06 0.860
edfather [Short-cycle
tertiary]
-4.65 -35.80 – 26.50 0.770
edfather [Bachelor’s or
equivalent]
1.17 -29.70 – 32.05 0.941
edfather [Postgraduate
degree]
5.12 -26.34 – 36.58 0.749
edfather [Don’t know] -6.20 -36.81 – 24.42 0.692
books [11–25 books] ×
futureed [Finish Upper
secondary]
4.98 -27.34 – 37.30 0.763
books [26–100 books] ×
futureed [Finish Upper
secondary]
-0.23 -32.84 – 32.37 0.989
books [101–200 books] ×
futureed [Finish Upper
secondary]
22.77 -16.47 – 62.02 0.255
books [More than 200] ×
futureed [Finish Upper
secondary]
-12.51 -59.91 – 34.89 0.605
books [11–25 books] ×
futureed [Finish
Post-secondary,
non-tertiary]
-13.78 -40.62 – 13.07 0.314
books [26–100 books] ×
futureed [Finish
Post-secondary,
non-tertiary]
-19.45 -46.27 – 7.36 0.155
books [101–200 books] ×
futureed [Finish
Post-secondary,
non-tertiary]
8.83 -22.88 – 40.54 0.585
books [More than 200] ×
futureed [Finish
Post-secondary,
non-tertiary]
-8.32 -46.17 – 29.53 0.667
books [11–25 books] ×
futureed [Finish
Bachelor’s or equivalent]
2.49 -21.12 – 26.10 0.836
books [26–100 books] ×
futureed [Finish
Bachelor’s or equivalent]
0.93 -22.65 – 24.50 0.938
books [101–200 books] ×
futureed [Finish
Bachelor’s or equivalent]
31.96 4.17 – 59.75 0.024
books [More than 200] ×
futureed [Finish
Bachelor’s or equivalent]
14.12 -18.11 – 46.35 0.390
books [11–25 books] ×
futureed [Finish
Postgraduate degree]
10.14 -20.52 – 40.81 0.517
books [26–100 books] ×
futureed [Finish
Postgraduate degree]
17.89 -11.90 – 47.67 0.239
books [101–200 books] ×
futureed [Finish
Postgraduate degree]
40.43 7.16 – 73.71 0.017
books [More than 200] ×
futureed [Finish
Postgraduate degree]
40.03 3.14 – 76.92 0.033
selftab [No] × MR3 6.10 -1.09 – 13.30 0.096
Observations 4517
R2 / R2 adjusted 0.204 / 0.196

It appears to be the case that the interaction between the wealth factor and the variable indicating the presence of a personal tablet is statistically significant even though the MR3 variable itself is not. The model explains around 20% of the variable in the outcome variable and is better than no model at all (p-value: < 2.2e-16). In order to determine if the model with the MR3 variable is better of not we can carry out the ANOVA.

## Analysis of Variance Table
## 
## Model 1: math ~ books * futureed + selftab + sharedtab + gender + country + 
##     edmother + edfather
## Model 2: math ~ books * futureed + selftab * MR3 + sharedtab + gender + 
##     country + edmother + edfather
##   Res.Df      RSS Df Sum of Sq      F Pr(>F)
## 1   4474 22596800                           
## 2   4472 22582652  2     14148 1.4008 0.2465

Since the p-value isbigger than 0.05 we conclude that the addition of the MR3 variable didn’t significantly improve the model.

We can still look at the plot for the better understanding of the interaction effect. For the pink line of those who don’t have a personal tablet the line is going upwards: as the wealth of the family increases, the predicted math score increases as well. For the blue line (those who do have a tables) the trend is the opposite: the wealthier the family is the lower is the predicted math score. It can be hypothesized that if a child has a personal tablet, the wealthier the family is the more spoiled the child becomes and the more likely he or she is to spend time playing on the tablet instead of using it for studying. On the contrary, if a child does not have a personal tablet, the wealthier the family is the higher is the predicted math score: if a family is wealthy but does not buy a personal tablet for a child it might value education more and try to avoid contributing to the development of bad addictive gaming habits in their children.

Summary

  • The model based solely on the number of books at home performed better than the one based solely on the tablet-related predictors, but the addition of the latter predictors to the former model did significantly improve it. Therefore, the final model included both types of predictors.

  • The model that included the interaction between educational aspirations and the number of books at home was better than the one without it and the interaction was statistically significant. The final model explained around 20% of the variance.

  • There were three factors identified - “Digital exposure”, “Independence and personal space” and “Wealth”. The model fit was worse than that of a model with five factors, but the results were better in terms of the interpretation. None of the standardized alpha values exceeded 0.7 and couldn’t be improved if any of the variables were dropped.

  • The interaction effect of MR3 (Wealth) and presence of the personal tablet was significant, but the addition of this effect did not significantly improve the model.

In conclusion, we have observed that the model with both tablet-related predictors and the number of books at home performed better than the model based solely on the latter. Just as it was hypothesized, the predicted math scores increased with the increase in the number of books at home, decreased in presence of the personal tablet (most probably because kids are more likely to use it to play games and get distracted), increased in presence of the shared tablet (probably, because in this case the usage of the tablet is limited either by the purposes or by the time). The interaction between the number of books and educational aspirations was significant and proved that the predicted math scores were higher on all “books” variables levels for those who had higher aspirations. The addition of the interaction effect did significantly improve the model and was added to the final version of it. The addition of the factor scores of one of the three identifies factors did not significantly improve the model even though the interaction effect was statistically significant.

Sources

Enriquez, A. (2010). Enhancing Student Performance Using Tablet Computers. College Teaching, 58(3), 77-84.

Evans, M., Kelley, J., & Sikora, J. (2014). Scholarly Culture and Academic Performance in 42 Nations. Social Forces, 92, 1573-1605.

McEwen, N., & Dubé, A. K. (2015). Engaging or Distracting: Children’s Tablet Computer Use in Education. Journal of Educational Technology & Society, 18(4), 9-23.

TIMSS and PIRLS, International Study Center. Distribution of Mathematics Achievement URL: http://timssandpirls.bc.edu/timss2015/international-results/timss-2015/mathematics/student-achievement/