WhatsNext Capstone Project Pitch

Mike Wehinger
Mar 2019

Introduction

This presentation explains a text prediction app for the Capstone project in the John Hopkins University Data Science specialization through Coursera and in partnership with Swiftkey.

  • How to use the app
  • How it works
  • Performance Summary

The App

The WhatsNext App demonstrates a text prediction algorithm. The app is available on shinyapps.

Instructions

  1. Text is entered in the top box,

  2. The three most likely next words are displayed

  3. How the app processes text and calculates probabilities

WhatsNext App

Under The Hood


Determining the most likely next words

  1. Clean incoming utterances
  2. Match to n-gram frequency tables backing-off to lower order n-grams
  3. Calculate probabilities

The table (right) shows matching the utterance: in the early part

The word of has the highest probability of being the next word

nGram Matches

Performance

The user experience should be quick and responsive.

A balance between optimizing predictors and app processing time was found by measuring two processes,

  • Loading models upon arrival
  • Finding utterance matches in models

Optimal app performance & accuracy Performance Box Plot