Xavier Valdayron
25/02/2018
The data science course from the Johns Hopkins University closes with a capstone project inspired by the Swiftkey technology: creating an app that predicts the next word in a sentence.
This app buils on a training dataset from HC Corpora.
Main stages of the project:
Tha app can be found here: https://xvalda.shinyapps.io/PredNext/
We retained the stupid backoff model, here's a possible example of how it works:
I documented the whole process in more details, see the references section.
This project could undergo some further developments:
Project files: https://github.com/xvalda/PredNext—Predict-Next-Words-with-language-models
Shiny app: https://xvalda.shinyapps.io/PredNext/
I listed the many references I used during this project in the following document (3_PredNext_Language_Models), the most essential reference is:
You can find me on linkedin if you have any question or would like to connect: https://www.linkedin.com/in/xavier-valdayron-9707231