Tom Withey
02/01/19
I have developed an app which predicts the next word given an input chain of at least two words. This presentation includes:
The algorithm relies on the quanteda package to extract a 'corpus' of text data. The data used has been downloaded from the link here. This data is prepared by taking the following steps:
Before running the algorithm the six ngram frequency tables are read from the local drive as data tables, using the data.tables package. These are stored in memory. Then, based on a given input word chain, the algorithm does the following (see next slide):