JMFlin
The purpose of this project is to build a natural language model that predicts the next unseen word in a user specified sentence input. Three types of data including twitter, news and blogs were used to train the model.
The following steps were performed to clean the data files
The next word prediction model is based on the Katz Back-off algorithm.
A Shiny application was developed based on the next word prediction model described previously.