This slide deck is intended to ‘decode’ briefly how a simple word prediction application, “textPredCoursera”, works. The app was:
- created for the Capstone Project of Data Science course, which is a 10-course series on Coursera;
- used for predicting a likely word following a given text message, in which case a few alternatives are available;
- For example, when we type: “It was nice”, the app would suggest that the next word is very likely to be “to” or something like “meeting”, “seeing”, etc. (This example is demonstrated at the startup of the app).
- based on N-gram frequencies from written text data sourced from Internet blogs, news and twitter.
The raw data set is available here: Coursera-SwiftKey.zip.