Aadesh Neupane
2016-03-19
For completion of Coursera Data Science Specialization from Johns Hopkins University
We need to build an predictive algorithm which can predict upcoming words give some words as input. For this task, Coursera in collabration with Switfkey has provided us with the data set from which we will be build a english language corpus and predictive model.
Steps followed for this project:
n-gram with Good-Turing
Reassign the probability mass of all events that occur n times in the training data to all events that occur n-1 times
Features
This capstone project provided me with the oppurtunity to explore NLP , train and build models. Also, It allowed to me to implement the skill that I have learned during previous nine course in this specilization. This application Shinyapp is the result of the knowledge and skill earned during this data science specilization.
Future Enhancements
References