NICA-Exploratory Data Analysis


This is a table of the average, median, and standard deviation for each division.

Division Gender avgtime sdtime medtime
7 F 0 00:25:20 0 00:03:50 0 00:24:50
7D1 M 0 00:24:21 0 00:05:20 0 00:22:46
7D2 M 0 00:24:17 0 00:04:49 0 00:23:30
8 F 0 00:25:14 0 00:04:32 0 00:24:10
8D1 M 0 00:22:53 0 00:04:10 0 00:21:57
8D2 M 0 00:23:02 0 00:04:15 0 00:22:23
F F 0 00:48:07 0 00:11:00 0 00:46:15
FD1 M 0 00:42:41 0 00:07:20 0 00:40:49
FD2 M 0 00:43:21 0 00:07:23 0 00:42:12
JV2 F 0 00:52:10 0 00:10:24 0 00:50:06
JV2 M 0 00:40:56 0 00:04:35 0 00:39:42
JV2D1 M 0 00:43:33 0 00:06:43 0 00:41:51
JV2D2 M 0 00:43:38 0 00:06:59 0 00:42:36
JV3 F 0 01:06:31 0 00:08:32 0 01:05:18
JV3 M 0 00:57:26 0 00:06:40 0 00:56:47
V F 0 01:22:50 0 00:11:20 0 01:21:22
V M 0 01:11:05 0 00:09:19 0 01:09:12

Histograms of the total time in seconds for each division.

The Division System Exploration

As you can see below, all D1 and D2 groups have very similar means, medians and standard deviations. If you look at the density plots below you can also see that there is little differecne between the two divisions after accounting for the number of individuals in each division.

FD1/FD2

Division Gender avgtime sdtime medtime
FD1 M 0 00:42:41 0 00:07:20 0 00:40:49
FD2 M 0 00:43:21 0 00:07:23 0 00:42:12

JV2D1/JV2D2

Division Gender avgtime sdtime medtime
JV2D1 M 0 00:43:33 0 00:06:43 0 00:41:51
JV2D2 M 0 00:43:38 0 00:06:59 0 00:42:36

8D1/8D2

Division Gender avgtime sdtime medtime
8D1 M 0 00:22:53 0 00:04:10 0 00:21:57
8D2 M 0 00:23:02 0 00:04:15 0 00:22:23

7D1/7D2

Division Gender avgtime sdtime medtime
8D1 M 0 00:22:53 0 00:04:10 0 00:21:57
8D2 M 0 00:23:02 0 00:04:15 0 00:22:23

Missing Previous Race

Here are the results of missing the previous race. On average after missing one race it isnt a very big deal. Individuals who missed the previous race on average change about 1 more position than individuals who didnt miss the previous race.

missedone missednone
8.406279 7.428597

Missing Previous Two Races

Here are the results of missing the previous two races. Individuals who missed the previous two races on average change about 5 more position than individuals who didnt miss the previous race two. To me this seems to be a more serious problem.

missedtwo missednone
10.90444 6.634261

Mean Change in Position by number of races attended, gender, and division.

mean.change.in.position mean.sd.change.in.position Division Gender num.races.attended
6.225000 7.4312464 7D1 M 1
3.588889 2.7531973 7D1 M 2
5.944444 4.6657570 7D1 M 3
7.030556 5.5491471 7D1 M 4
4.532273 2.6070659 7D1 M 5
12.144444 9.6289218 7D2 M 1
5.340000 4.2427942 7D2 M 2
10.362362 5.3178415 7D2 M 3
9.325463 5.0047894 7D2 M 4
7.079246 5.5358342 7D2 M 5
7.000000 NaN 7 F 1
5.666667 0.0000000 7 F 2
5.791667 3.5783751 7 F 3
5.783862 4.0955716 7 F 4
3.576058 3.3734979 7 F 5
6.000000 4.3349061 8D1 M 1
10.600000 6.5142040 8D1 M 2
9.463333 5.7625429 8D1 M 3
7.000830 5.0740845 8D1 M 4
4.119279 3.9573973 8D1 M 5
15.819444 10.6722276 8D2 M 1
10.391667 6.4749712 8D2 M 2
16.108333 8.0994533 8D2 M 3
12.509785 7.4325588 8D2 M 4
7.828871 6.3400412 8D2 M 5
4.000000 NaN 8 F 1
5.566667 2.3261140 8 F 2
4.527778 4.5953619 8 F 3
3.271825 2.8224417 8 F 4
3.097084 2.3768596 8 F 5
15.166667 16.6950320 FD1 M 1
10.152778 3.9281821 FD1 M 2
15.693333 15.3265205 FD1 M 3
16.331867 15.0663368 FD1 M 4
8.920810 9.4689567 FD1 M 5
15.000000 18.9718488 FD2 M 1
18.858333 12.9380371 FD2 M 2
12.529960 9.4166509 FD2 M 3
11.962710 8.9167198 FD2 M 4
8.926891 8.6059967 FD2 M 5
7.000000 20.0000000 F F 1
2.854167 5.8555972 F F 2
4.100000 2.8988747 F F 3
2.902182 3.0136585 F F 4
5.616667 3.3891254 F F 5
18.250000 9.0000000 JV2D1 M 1
11.750000 19.0086982 JV2D1 M 2
14.642857 11.1397019 JV2D1 M 3
7.112958 7.6805836 JV2D1 M 4
8.576923 8.4478209 JV2D1 M 5
10.078571 10.9130209 JV2D2 M 1
18.224286 16.9837352 JV2D2 M 2
14.983333 12.1274941 JV2D2 M 3
15.176117 12.8094996 JV2D2 M 4
14.350582 12.9380485 JV2D2 M 5
10.125000 5.1516504 JV2 F 1
4.625000 2.0118446 JV2 F 2
9.760000 9.6757510 JV2 F 3
4.884343 3.7906291 JV2 F 4
2.918651 2.9816082 JV2 F 5
13.861111 11.3041256 JV2 M 1
2.500000 1.0606602 JV3 F 1
9.500000 1.4142136 JV3 F 3
3.281597 2.2470206 JV3 F 4
2.560000 1.9648970 JV3 F 5
25.750000 4.9497475 JV3 M 1
11.888889 6.8248960 JV3 M 2
19.375000 11.9412646 JV3 M 3
10.512170 7.6294121 JV3 M 4
7.534112 6.0217139 JV3 M 5
4.125000 2.1213203 V F 3
1.884444 1.7997278 V F 4
1.766667 0.9982501 V F 5
2.500000 NaN V M 1
9.500000 NaN V M 2
12.875000 3.3409903 V M 3
4.664355 3.0642992 V M 4
4.486499 3.6672310 V M 5

GEE Modeling

Using GEE modeling we can look to see if the number of races attended has a significant effect on the change in position.

With a p-value of 0.86 we fail reject the null hypothesis. The number of races attended does not have a significant effect on change in position. The race number, division, gender, and starting position all have a significant affect on the change in position.

## 
## Call:
## geeglm(formula = diffsf ~ numracesattended + Division + totaltime + 
##     startorder + race + Place, data = geedata, id = interaction(race, 
##     Division), corstr = "ar1")
## 
##  Coefficients:
##                    Estimate    Std.err   Wald Pr(>|W|)    
## (Intercept)       5.6897966  1.7130532 11.032 0.000896 ***
## numracesattended -0.0499713  0.2465218  0.041 0.839365    
## Division7D2M      0.6060399  1.0805450  0.315 0.574890    
## Division7F       -0.9943370  0.6906058  2.073 0.149923    
## Division8D1M     -0.2753780  1.0590214  0.068 0.794840    
## Division8D2M      1.0447334  1.2243237  0.728 0.393485    
## Division8F       -1.5292316  0.6958381  4.830 0.027972 *  
## DivisionFD1M      4.2669811  1.3985644  9.308 0.002281 ** 
## DivisionFD2M      1.6792618  1.1517053  2.126 0.144822    
## DivisionFF        0.3594046  1.6390854  0.048 0.826439    
## DivisionJV2D1M    3.3885625  1.4084135  5.789 0.016131 *  
## DivisionJV2D2M    7.2538030  1.5501945 21.896 2.88e-06 ***
## DivisionJV2F      1.4952427  1.7351989  0.743 0.388846    
## DivisionJV2M      3.5025205  1.0905591 10.315 0.001320 ** 
## DivisionJV3F      0.6595845  2.2038087  0.090 0.764717    
## DivisionJV3M      1.1727374  2.2752962  0.266 0.606258    
## DivisionVF        0.0312107  2.9550718  0.000 0.991573    
## DivisionVM        0.1360948  2.2681296  0.004 0.952153    
## totaltime        -0.0003262  0.0007747  0.177 0.673705    
## startorder        0.1975407  0.0351524 31.579 1.91e-08 ***
## race2             0.0667098  1.1408504  0.003 0.953371    
## race3            -2.1001106  0.8879189  5.594 0.018020 *  
## race4            -3.1215438  0.8336035 14.022 0.000181 ***
## race5            -2.6398537  0.8776793  9.047 0.002632 ** 
## race6            -3.0965613  0.9367611 10.927 0.000948 ***
## race7            -6.8502317  1.1414288 36.017 1.96e-09 ***
## Place            -0.0462183  0.0320856  2.075 0.149735    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Estimated Scale Parameters:
##             Estimate Std.err
## (Intercept)    80.04   8.513
## 
## Correlation: Structure = ar1  Link = identity 
## 
## Estimated Correlation Parameters:
##       Estimate Std.err
## alpha  0.08135 0.02406
## Number of clusters:   105   Maximum cluster size: 118