The application was created as the capstone project for Coursera’s JHU Data Science Specialization.
The idea was to take a data set provided by JHU containing some millions of text lines from news sites, twitteer and blogs, and use it to make a text prediction application, based on Natural Language Processing.
This is, given that certain word combinations (ngrams) are used more often than others, we can make an educated guess of which word(s) will follow.