Justin Nafe
August 18, 2015
NextWord by Justin Nafe (found on github at “justinnafe/NextWord”) is an R package that allows users to develop models for predicting the next word. The package contains an example model, which is used in the showcase Shiny app referenced on the last slide.
The application makes use of token frequencies and Parts of Speech (POS) to predict the next word.
Building the model consists of a multi-step process:
The prediction algorithm uses the frequencies of words and Parts of Speech (POS) of the words supplied from the blogs corpus.