NR Catalog Predictions

This report analyses the demographic prediction results of Boosted Decision Tree Model trained on the NR data.

Top 20 rows of the result set:

##                           title gender     age    score
## 2605 10 THINGS I HATE ABOUT YOU Female   [2-5] 1.802336
## 2583 10 THINGS I HATE ABOUT YOU Female   [6-8] 1.861817
## 2610 10 THINGS I HATE ABOUT YOU Female  [9-11] 1.912219
## 2581 10 THINGS I HATE ABOUT YOU Female [12-14] 2.428242
## 2601 10 THINGS I HATE ABOUT YOU Female [15-17] 3.265264
## 2604 10 THINGS I HATE ABOUT YOU Female [18-20] 3.016504
## 2607 10 THINGS I HATE ABOUT YOU Female [21-24] 2.320502
## 2584 10 THINGS I HATE ABOUT YOU Female [25-29] 3.021620
## 2593 10 THINGS I HATE ABOUT YOU Female [30-34] 3.567641
## 2594 10 THINGS I HATE ABOUT YOU Female [35-39] 4.210353
## 2595 10 THINGS I HATE ABOUT YOU Female [40-44] 4.662144
## 2596 10 THINGS I HATE ABOUT YOU Female [45-49] 5.088395

A summary of the prediction set:

##                          title          gender           age       
##  10 THINGS I HATE ABOUT YOU :   30   Female:36030   [2-5]  : 4804  
##  11.22.63                   :   30   Male  :36030   [6-8]  : 4804  
##  12 MONKEYS                 :   30                  [9-11] : 4804  
##  14 DIARIES OF THE GREAT WAR:   30                  [12-14]: 4804  
##  16 AND PREGNANT            :   30                  [15-17]: 4804  
##  1600 PENN                  :   30                  [18-20]: 4804  
##  (Other)                    :71880                  (Other):43236  
##      score        
##  Min.   :-0.2748  
##  1st Qu.: 3.0824  
##  Median : 4.3618  
##  Mean   : 4.8006  
##  3rd Qu.: 6.0395  
##  Max.   :26.7000  
## 

Count of Unique Titles in the Test Set:

## [1] 2402

Histograpm of Predicted Scores:

plot of chunk unnamed-chunk-5

The density of the Predicted Scores:

plot of chunk unnamed-chunk-6

Distributional behaviour of predicted scores:

plot of chunk unnamed-chunk-7

Mean Score of each demo bracket:

##    gender     age        x
## 4    Male   [6-8] 2.503623
## 2    Male   [2-5] 2.538721
## 1  Female   [2-5] 2.602004
## 6    Male  [9-11] 2.718975
## 3  Female   [6-8] 2.842069
## 5  Female  [9-11] 3.063976
## 12   Male [18-20] 3.327857
## 14   Male [21-24] 3.348244
## 8    Male [12-14] 3.446253
## 10   Male [15-17] 3.508210
## 7  Female [12-14] 3.782942
## 16   Male [25-29] 3.934580
## 11 Female [18-20] 3.976440
## 9  Female [15-17] 4.165474
## 18   Male [30-34] 4.392505
## 13 Female [21-24] 4.614908
## 20   Male [35-39] 4.850410
## 22   Male [40-44] 5.052668
## 15 Female [25-29] 5.127233
## 24   Male [45-49] 5.376008
## 17 Female [30-34] 5.728973
## 26   Male [50-54] 6.135201
## 19 Female [35-39] 6.139818
## 21 Female [40-44] 6.289342
## 30   Male   [65+] 6.305009
## 28   Male [55-64] 6.792366
## 23 Female [45-49] 7.230381
## 29 Female   [65+] 7.840718
## 25 Female [50-54] 7.843698
## 27 Female [55-64] 8.538608

Pie Chart Age brackets by the sum of Scores:

plot of chunk unnamed-chunk-9

Bar chart indicating the ranked Age brackets by the sum of scores:

plot of chunk unnamed-chunk-10

Pie Chart indicating Gender by the sum of Scores:

plot of chunk unnamed-chunk-11

Bar chart indicating the gender by the sum of scores: plot of chunk unnamed-chunk-12

Heat map of Demo by predictions scores on all titles:

## [1] "title"  "gender" "age"    "score"
##                         title Female_[2-5] Female_[6-8] Female_[9-11]
## 1  10 THINGS I HATE ABOUT YOU     1.802336     1.861817      1.912219
## 2                    11.22.63     2.974509     3.273336      3.428418
## 3                  12 MONKEYS     3.075230     3.259543      3.143986
## 4 14 DIARIES OF THE GREAT WAR     1.766652     1.881658      2.427419
## 5             16 AND PREGNANT     2.628726     2.792781      3.293288
## 6                   1600 PENN     3.608641     3.449332      3.976758
##   Female_[12-14] Female_[15-17] Female_[18-20] Female_[21-24]
## 1       2.428242       3.265264       3.016504       2.320502
## 2       4.223701       4.733591       4.231344       5.120004
## 3       4.574559       4.239665       4.333588       4.811667
## 4       3.157037       2.906526       2.265247       3.527208
## 5       3.354633       3.773571       3.666661       4.785141
## 6       4.712257       5.231903       4.554941       6.770905
##   Female_[25-29] Female_[30-34] Female_[35-39] Female_[40-44]
## 1       3.021620       3.567641       4.210353       4.662144
## 2       6.175825       6.855114       6.581566       6.647938
## 3       5.121606       5.933879       6.855699       6.261017
## 4       3.984225       5.444674       5.633984       5.756648
## 5       5.186256       5.998479       7.456518       7.782208
## 6       7.451282       7.386765       8.760273       8.356054
##   Female_[45-49] Female_[50-54] Female_[55-64] Female_[65+] Male_[2-5]
## 1       5.088395       6.517895       7.909960     7.264472   1.551064
## 2       6.643283       8.601527       7.759574     3.936278   2.999649
## 3       7.730012       7.718658       7.705619     7.282412   2.056509
## 4       7.416484       8.475748       9.624836     5.193168   1.704469
## 5       7.826052       7.168458       9.572624     9.521175   1.853084
## 6      10.791605      10.254705      10.919290    10.949645   2.508597
##   Male_[6-8] Male_[9-11] Male_[12-14] Male_[15-17] Male_[18-20]
## 1   1.551109    1.560587     1.980615     2.207429     2.180390
## 2   2.985757    3.267674     4.053395     4.298790     3.710979
## 3   2.243221    2.257262     2.742471     2.550031     2.462600
## 4   1.450483    2.051251     3.115283     2.888333     2.433157
## 5   2.082242    2.136810     2.743986     2.241293     2.055996
## 6   1.619166    2.335706     2.822180     2.981873     2.654560
##   Male_[21-24] Male_[25-29] Male_[30-34] Male_[35-39] Male_[40-44]
## 1     1.092943     1.964153     2.510173     3.144986     3.516248
## 2     3.949586     5.059438     5.385683     5.414477     5.195742
## 3     2.635873     2.960226     3.863172     4.629673     4.422003
## 4     2.892704     3.450329     4.573165     4.762475     4.874094
## 5     2.495761     2.830602     3.419202     4.960741     5.088940
## 6     3.586306     4.266683     4.202165     5.749847     5.171454
##   Male_[45-49] Male_[50-54] Male_[55-64] Male_[65+]
## 1     3.208461     4.641830     5.693545   5.644154
## 2     4.618092     6.600969     5.360947   2.208800
## 3     4.960439     4.993796     4.759609   4.767396
## 4     6.086612     7.486463     8.620275   4.370435
## 5     4.628096     4.251280     6.604912   6.737432
## 6     7.240077     6.788130     7.271012   7.616503
## [1] 2402   31
## [1] Female_[55-64] Female_[50-54] Female_[45-49] Female_[55-64]
## [5] Female_[55-64] Female_[65+]  
## 20 Levels: Female_[12-14] Female_[15-17] Female_[18-20] ... title

plot of chunk unnamed-chunk-13

Titles By Gender Split:

plot of chunk unnamed-chunk-14

Count of Female Titles:

## [1] 2110

Count of Male Titles:

## [1] 291

Count of Negative predictions:

## 
##    -1     1 
##     3 72057