Capstone Word Predictor Presentation

Gonzalo Andres Moreno
September 24/2016

Prediction of the next word

In this capstone project, we will apply data science in the area of natural language processing

  • The project is based on building a corpus from tweets, blogs and news sources
  • Cleaning and data processing techniques are used on the Corpus.
  • We build a predictive model to come up the next word
  • We finally build an interactive app that use the model to predict the next word a user would likely type.

How this Shiny App works?

1) It uses n-grams with n=1,2,3,4

2) It uses frequency table of each n-grams

3) It finds the three best suggested words using backoff algorithm

How the Shiniy App is designed....

  • The App uses n-grams with n=1,2,3 (see http://en.wikipedia.org/wiki/N-gram) to predict the next word in a text. If no word is predicted with the n-gram of size 3, then the algorithms attempts to predict the next word with n-gram of size 2.

  • The App is coded in such a way the you will have the predicted word in the right panel in real time(!). While typing the App gives you one suggestion of what the next word can be

  • Enjoy playing around.. :-). Next slide provides a simple example of how things work.give it a try!

How use the Shiny App...

1) Write your text in the Input Text Box of the main panel. Avoid final spaces.

2) In case you find a word (without spaces) that matches with your preferences, please digit it to take in account the amount of sucesses and failures.

3) If any choice fits with your preferences, go on writing in the Input Text Box