Mauricio Ramos
May 24, 2018
This is a prototype Shiny application required by the capstone project of the Johns Hopkins University's Data Science Specialization.
The application takes a phrase as input and outputs a prediction of the next word using a Natural Language Processing (NLP) machine learning model trained from a large corpora of English texts from Blogs, News and Twitter available here.
It was based on technologies and techniques cited in the last slide. The overall steps taken are:
Currently it's a basic prediction algorithm that prioritizes the responsiveness and the low-memory usage.
It limits to the maximum of 4 predictor words.
It seeks the last k predictor words in the k+1 n-gram table.
If the last k predictor words weren't found it seeks the last k-1 predictor words in the k+2 n-gram tables and so on.
Type in one ore more words
Press the button
See the predicted word just bellow the button.