Coursera Capstone Project Presentation

Next Word Prediction Model

Katherine

April 6, 2019

Overview of the model algorithm

  1. Explain the core algorithm of the model.
  2. Summarize the performance of the prediction algorith.
  3. Explain how to use the Shiny application that runs the model.

Katz’s back off model is a generative n-gram language model that estimates the conditional probability of a word given its history in the n-gram. It accomplishes this estimation by “backing-off” to models with smaller histories under certain conditions. By doing so, the model with the most reliable information about a given history is used to provide the better results.

Details of the model algorithm

Performance of the model

The performance of the model was measure in three criterias: size, speed, and accuracy.

  1. Katz’s Back-off Model: http://rpubs.com/salus0324/484129
  2. Stupid Back-off Model: http://rpubs.com/salus0324/484133

Shiny application