Yusak Rabin
April 26th, 2015
Capstone Project for Data Science Specialization
by Johns Hopkins University - Coursera
in partnership with
Minimize number of 'key-press' by offering prediction of next-word in an uncomplete sentence.
Solution: https://yrabin.shinyapps.io/wordpredict/
Application uses N-gram model with heuristic method to build database.
tm and RWeka, extracting 2,3,4-gram from over 700K sampled linesngram both from dataset with and without stopwords removed; this is to improve interactibilityThe application implements NLP (Natural Language Processing) for predicting Next-Word with good balance of size, speed, and accuracy.
Word-Completion feature is added for better interactivity.
Future improvements require more computing power for: