Loading Testing Dataset
Data Size
length(obj$click)
[1] 38980658
Avg. CTR
mean(obj$click)
[1] 0.001297926
Check if all values are 0 and 1
range(obj$click)
[1] 0 1
Distribution
table(obj$click)
0 1
38930064 50594
range(obj$price)
[1] 1 1000
plot(hist(log(obj$price)))
Warning messages:
1: In grDevices::png(f) : unable to open connection to X11 display ''
2: In grDevices::png(f) : unable to open connection to X11 display ''
dim(obj$X)
[1] 38980658 1785106
range(features)
[1] 0 38980651
quantile(features.rate[features.rate != 0], seq(0, 1, 0.1))
0% 10% 20% 30% 40%
2.565375e-08 2.565375e-08 5.130750e-08 1.026150e-07 1.282687e-07
50% 60% 70% 80% 90%
1.795762e-07 2.565375e-07 3.591525e-07 5.643825e-07 1.103111e-06
100%
9.999998e-01
sum(features.rate > 0.5)
[1] 9
sum(features.rate > 0.6)
[1] 7
sum(features.rate > 0.7)
[1] 4
sum(features.rate > 0.8)
[1] 3
sum(features.rate > 0.9)