Luis Hernandez
2019-12-09
Coursera Data Science Capstone Project
Predicting next words from a sentence is a very useful capability, used by text processors and modern keyboards.
Using data sciences process and machine learning, this application predicts the next word after a sentence is entered.
This application also provides a graph of five words in order of predicted probability.
This experiment leverages data from SwiftKey and was created for the purpose of demonstrating the capability to predict words based on previously entered phrases.
A database was created to allow the engine to perform fast enough for realtime prediction. These are the tables:
This optimized method allowed the software to work around the restrictions in R and provides great performance for large data sets.
Various methods were used to imporve performance including reducing the number of words and removing 'badwords'.
Thanks for reviewing my project!