Juan Pablo Moraga
31/12/2016
Predict the next word
This presentation is created as part of the requirement for the Coursera Data Science Capstone Course. The goal of the project is to build a predictive text model combined with a shiny app that predict the next word as the user types a sentence similar to the way most smart phone keyboards are implemented today.
Before building the word prediction algorithm, data are first processed and cleaned as steps below:
The prediction model for next word is based on the Katz Back-off algorithm. Explanation of the next word prediction flow is as below: