Data Science Capstone Coursera

Sofian Hamiti

The aim of the project is to develop a Shiny app that can predict the next word of a user typing a sentence.

This function is particularly useful for keyboard applications such as Swiftkey.

The prediction model is based on an n-gram model built from 3 very large corpora: Twitter, News, and Blogs.

Due to technology limitations, the model is trained on a subset of the data.

The App implements the Naive Bayes model, which has the following advantages:

Future improvements: In the future, this app will include the following improvements:

An option to choose different prediction models, such as Kneser-Ney and Back off
Top 5 predictions table
Better performance and accuracy