Purpose/Context:

Get an understanding of person-level metrics representing the Devontez lead by relying on both the manually vetted/"golden" dataset and the output of our v2 matching processes. Compare how stats derived from our v2 matching processes compare to those derived from the manually vetted dataset in an effort evaluate the performance of v2 matching processes.

A note on terminology/methods/decisions:

Ads Per Person

Manually Vetted Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     1.0     4.0    13.0    57.7    43.0  1103.0
Processed Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    1.00    1.00    5.00   83.24   22.50 2035.00

Cities Per Person

Manually Vetted Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   2.000   4.034   4.000  66.000
Processed Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   1.000   4.445   3.000  85.000

States Per Person

Manually Vetted Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   1.000   2.298   2.000  21.000
Processed Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   1.000   2.218   2.000  31.000

Phones Per Person

Manually Vetted Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   3.000   4.226   5.000  53.000
Processed Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   1.000   4.983   3.000 161.000

Days Lapsed Across Ads Per Person

Manually Vetted Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    0.00    6.75   54.00  115.81  172.25  757.00
Processed Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    0.00    0.00    6.00   84.84   76.00  772.00

Ages Per Person

Manually Vetted Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   2.000   2.274   3.000  12.000
Processed Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   1.000   1.916   2.000  12.000

Age Ranges Per Person

Manually Vetted Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   0.000   0.000   1.000   2.625   3.000  30.000
Processed Data
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    0.00    0.00    0.00    3.05    2.00   30.00