Paul van der Kooy
17 December 2016
The objective is to develop a text prediction application like those used on mobile telephones and app's like WhatsApp.
Further improvements considered but not implemented because the results were not significant or due to time pressure to deliver the product in the allocated time.
agrep or Levenshtein distance to overcome small differences in spelling caused by typo's or grammarMost drawbacks of the N-grams method are related to its statistical background and missing context of the provided text.
"lot of food" versus "lot of different""Often I go running, because I love ???"."love you" before "love it".
predictScore = The percentage correctly predicted words
maxNgrams = The highest level of N-grams used for the test
nGramsUsed = The percentage of the N-grams data used for this test
Use the following link to start the end product: https://paulkooy.shinyapps.io/capstoneNLP/
Code for this work can be found on GitHub: https://github.com/paulkooy/Data-science-capstone