How to build BigQuery ML classification

1 Objectives

  • The tasks are to perform:
    • Use BigQuery to find public datasets
    • Query and explore the ecommerce dataset
    • Create a training and evaluation dataset to be used for batch prediction
    • Create a classification (logistic regression) model in BQML
    • Evaluate the performance of your machine learning model
    • Predict and rank the probability that a visitor will make a purchase

1.1 Google Cloud Platform Console

1.2 Explore online google cloud ecommerce data

1.5 Improve model performance with Feature Engineering

  • create a second machine learning model called classification_model_2:

    • How far the visitor got in the checkout process on their first visit
    • Where the visitor came from (traffic source: organic search, referring site etc..)
    • Device category (mobile, tablet, desktop)
    • Geographic information (country)
  • Evaluate this new model for predictive power:

1.6 Predict which new visitors will come back and purchase

  • The prediction query below uses the improved classification model to predict the probability that a first-time visitor to the Google Merchandise Store will make a purchase in a later visit:

2 Conclusions:

  • More than 6% make a purchase in a later visit.
  • These users represent nearly 50% of all first-time visitors who make a purchase in a later visit.
  • Only 0.7% of first-time visitors make a purchase in a later visit.
  • Targeting the top 6% of first-time increases marketing ROI by 9 times instead of targeting them all!

DK WC

2020-01-31