Michael Lee
January 17, 2016
Coursera / Johns Hopkins University Data Science Specialization Capstone Project
Our goal was to create an algorithm for predicting the next word given one or more words as input. A large corpus of more than 4 million documents was loaded and analyzed. N-grams were extracted from the corpus and then used for building the predictive model. Various methods of improving the prediction accuracy and speed were explored.