LEADS Stats Analysis

HTI Labs

Purpose/Context

We want to get an understanding of features and relationships between features in our online commerical sex data that's been aggregated for use in LEADS, Specifically, feature breakdowns by content classification and by domain.

By Content Classification

Units of Observation

Numer of Unique Ads/CS Profiles by Content Classification Category
Content Classification Count
Sex Provider 1334601
Remote only 41745
Business 33068

Multiple Sex Providers

Remote Only

Services

Venues

Category

Price Per Hour

Descriptive Statistics: Price Per Hour Rate by Content Classification
Content Classification Min Q1 Median Mean Q3 Max Std. Deviation
Sex Provider 0 150 200 224.7706 280 700000 1455.858
Remote only 0 50 120 161.6723 225 2000 148.526
Business 0 50 60 170.1607 160 31374 1224.481

By Domain

Units of Observation

Numer of Unique Ads/CS Profiles by Domain Category
Domain Count
listcrawler 630610
skipthegames 553766
adultsearch 109208
backpage 102409
sipsap 9527
theeroticreview 3893

Multiple Sex Providers

Remote Only

Services

Venues

Category

Price Per Hour

Descriptive Statistics: Price Per Hour Rate by Domain
Content Classification Min Q1 Median Mean Q3 Max Std. Deviation
theeroticreview 6 400 467 505.5365 600 10229 547.9293
adultsearch 1 200 200 254.1630 300 32767 255.8609
listcrawler 0 120 200 214.3640 280 10000 171.2622
skipthegames 0 120 200 206.6457 250 10752 191.6172
sipsap 1 100 150 357.0455 200 700000 10114.4844
backpage 1 50 60 182.3874 120 31374 1432.5794