Emma Sun
Feb 22nd, 2017
The goal of this exercise is to create a product to highlight the prediction algorithm that you have built and to provide an interface that can be accessed by others. For this project you must submit:
A Shiny app that takes as input a phrase (multiple words) in a text box input and outputs a prediction of the next word. A slide deck consisting of no more than 5 slides created with R Studio Presenter pitching your algorithm and app as if you were presenting to your boss or an investor.
To experience the app, click here: https://emmacourserahwork.shinyapps.io/textprediction/
The process is as follows:
n-gram models are widely used in statistical natural language processing.Below are some paper or lectures you can check out if you are interested in this topic.
Direct link to this topic on Wiki: https://en.wikipedia.org/wiki/N-gram#n-gram_models
Michael Collin's notes on n-Gram Language Models: http://www.cs.columbia.edu/~mcollins/courses/nlp2011/notes/lm.pdf
n-grams based on the largest publicly-available English: http://www.ngrams.info/