The app was created as a part of the final project in the Coursera’s Data Data Science Capstone by Johns Hopkins University and is available here: https://www.coursera.org/learn/data-science-project/peer/EI1l4/final-project-submission/submit
The training data et for the prediction model was previously prepared as described in the Milestone report, available at https://rpubs.com/aaturki/Milestone. In short, from the raw data I removed non-latin characters, punctuation marks, digits, stopwords, and swearing words (based on the publicly available Google’s Bad Words list)