jeffzfw
20-8-2015
The application is the capstone project for the Coursera Data Science specialization held by professors of the Johns Hopkins University and in cooperation with SwiftKey.
The main goal of this capstone project is to build a shiny application that is able to predict the next word.
Some tasks of the project:
Read in data and Basic analysis:Read in twitts, news and blogs data.
Make corpus, clean the corpus, tokenized corpus, then create termdocumentmatrix for (unigram, 2-gram, 3-gram and 4-gram)
Convert tdm to data frame for each gram type(the data frame contains term names, count, and probability that it ocours in the data frame)
Create application that do the prediction and return the mostly will be the next word.
Type in more than one words into the text input box under “Input words please:”,then click submit or press enter key, you will get the next word predication on the right side main panel, followed by the algorithm used to predict.go to app? please click here