Michael S. Matthews, Matthew T. McBee, and Scott J. Peters
April 29, 2017
African American
Native American
Latino/a
Low-income
English language learners (ELLs)
The RI compares eaach demographic group’s representation in gifted education with their share of the population.
An RI of 1.0 indicates perfect proportionality.
\[\begin{equation} \text{RI}=\frac{\text{% gifted}}{\text{% total}} \end{equation}\]Group | RI |
---|---|
Male | 0.96 |
Female | 1.04 |
Latino/a | 0.67 |
Am Indian/Alaskan | 0.61 |
Asian | 2.02 |
Hawiian/Pacific Islander | 0.69 |
Black | 0.6 |
White | 1.13 |
Multiracial | 1.05 |
LEP/ELL | 0.27 |
IDEA | 0.25 |
Group membership related to test performance
Status changes over time
Tests and programs are generally language-intense
Teacher nominations particularly problemmatic
Data from the Office of Civil Rights
Teacher / Parent Nomination
Screening via group-administered ability test
Confirmatory testing via individually-administered ability test
Plan A
135 screening cutoff
130 confirmatory test cutoff
Plan B
120 screening cutoff
115 confirmatory test cutoff
non-ELL | ELL | |
---|---|---|
Screening test cutoff | 135 | 120 |
Test cutoff | 130 | 115 |
Population size | 175,268 | 24,732 |
Number nominated | 22,368 | 432 |
Number identified under Plan A | 11,184 | 92 |
Number identified under Plan B | 265 | |
Total number identified | 11,184 | 357 |
Nomination rate | 0.128 | 0.017 |
Identification rate under Plan A | 0.064 | 0.004 |
Total identification rate | 0.064 | 0.014 |
Nomination pass rate under Plan A | 0.500 | 0.213 |
Total nomination pass rate | 0.500 | 0.826 |
The classical test theory model is a reasonably accurate portrayal of measurement errors and resulting diagnostic classification errors.
The informal teacher or parent nomination process can be modeled as an implicit measurement that is compared against some cutoff (see McBee et al, 2016 for discussion).
The nomination, screening, and confirmatory test scores follow a multivariate normal distribution.
The ELL population mean cognitive ability score is 100.
The screening and confirmatory tests do not suffer from systematic bias. This means that their effective cutoffs as specified by the school district (\(\nu=1.33\), \(\tau=1.0\) in the standardized metric).
Structural equation model for generating the Theta matrix
Grade | Opportunities |
---|---|
Kindergarten | 1 |
1st | 2 |
2nd | 3 |
3rd | 4 |
4th | 5 |
5th | 6 |
The state vector (\(\boldsymbol{S}_i\)) describes the proportion of students in each possible state: true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN)
\(\boldsymbol{S}_0\) is the initial state, before any opportunities for identification. Therefore, all the gifted students begin as false negatives, the non-gifted as true negatives. Using the IQ=115 Plan B cutoff:
\[\begin{equation} \boldsymbol{S}_0 = \begin{bmatrix} 0 & 0.84 & 0 & 0.16 \end{bmatrix} \end{equation}\]where the order is TP, TN, FP, FN.
The transition matrix (\(\boldsymbol{T}\)) gives the probabilities of moving from one state to another. Rows describe the previous state, columns describe the next state.
\[\begin{equation}\label{transition} \boldsymbol{T} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & (1-\text{fpr}) & \text{fpr} & 0 \\ 0 & 0 & 1 & 0 \\ \text{sens} & 0 & 0 & (1-\text{sens}) \end{bmatrix} \end{equation}\]where the order is TP, TN, FP, FN; fpr is the false positive rate and sens the sensitivity.
Note: the values in this matrix assume that gifted identification is permanent once granted.
When there is no system memory, each chance is independent. The proportion of TPs, TNs, FPs, and FNs after \(i\) chances can be calculated using a Markov chain model.
\[\begin{equation} \boldsymbol{S}_i = \boldsymbol{S}_0 \boldsymbol{T}^i \end{equation}\]Gifted identification probably isn’t memoryless.
A student tested last year is probably less likely to be tested again this year.
This can be modeled by a generalized Markov model, introducing a memory parameter (\(m\)) to the transition matrix equation.
\[\begin{equation}\label{transition_m} \begin{split} \boldsymbol{T}_{i,m} = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & (1-\text{fpr})^{i^m} & \text{fpr}^{i^m} & 0 \\ 0 & 0 & 1 & 0 \\ \text{sens}^{i^m} & 0 & 0 & (1-\text{sens})^{i^m} \end{bmatrix} \end{split} \end{equation}\]The implication is a different transition matrix for each \(i\) opportunity.
The state vector after k opportunities is:
\[\begin{equation}\label{statevector_m} \boldsymbol{S}_{k,m} = \boldsymbol{S}_0 \prod_{i=1}^k \boldsymbol{T}_{i,m} \end{equation}\]We knew the screening test cutoff (\(\nu=120\)), the confirmatory test cutoff (\(\tau=115\)) and could approximate the nomination reliability (\(\rho_{NN}=0.60\)), screener reliability (\(\rho_{SS}=0.90\)), and confirmatory test reliability (\(\rho_{CC}=0.95\)).
We did not know the screening test validity (\(r_{CS}\)), the implicit nomination cutoff (\(\kappa\)), the nomination validity coefficient (\(r_{NC}\)), or the memory parameter (\(m\)).
Using the previously-presented equations, we calculated the system performance statistics under every possible combination of:
After eliminating impossible correlations, we calculated the multiyear performance statistics for each of the previous combination of parameters at the following:
memory: 0 to 2 (increment .05)
For each, we calculated marginal perforance statistics by averaging over the statistics after each of six chances. So the marginal is based on \(\frac{1}{6}\) having one chance (Kindergarteners), \(\frac{1}{6}\) having two chances (1st graders), and so on.
Performance statistics were calculated for 41,123 combinations of parameters.
We then selected the subset of these best able to reproduce the ELL data. A total of 2,335 combinations had RMS error values smaller than 0.01.
Upper panel: nomination cutoff. Middle panel: memory parameter. Bottom panel: nomination validity coefficient
We considered: what would happen to the performance metrics under less restrictive nomination cutoffs?
We analyzed nomination cutoffs of \(\kappa \in \{100, 105, 110, 115\}\).
We used the plausible values of the nominatation validity and screener validity and updated the calculated system performance statistics.
We first considered single-opportunity performance statistics.
Upper left panel: sensitivity. Upper right panel: false positive rate. Lower left panel: nomination hit rate. Lower right panel: identification rate."
Then we considered multi-year performance statisics, marginal over six opportunities for identification.
Since the memory parameter \(m\) is unknown, we assumed values of \(m \in \{0, .5, 1, 1.5, 2\}\).
Panels display strength of memory parameter
Panels display strength of memory parameter. Horizontal reference line indicates actual non-ELL identification rate.
Panels display strength of memory parameter.