This is a 5 web page presentation using R Markdown that features a Natural Language Processing project with the objective of deploying a Shiny web app that takes input as a phrase (multiple words) in a text box input and outputs a prediction of the ‘next word’. When someone types:
I went to the …
The web app will predict what the next word might be. It could be gym, store, restaurant.
How to achieve the objective:
- Data: Blogs, News & Twitter from a corpus called HC Corpora
- Tools: Rstudio (TM, Quanteda, WordCloud, Tidyr, Stringi/Stringr, TidyText)
- Modelling: Markov chain & Katz Backoff