Dave Hurst
12-Dec-2014
The prediction app uses N-grams with a simple backoff strategy.
Data Preparation Steps:
Prediction Steps:
load(file=ptm.squashed.file)
phrase <- tcorpus[[1]]$content[56]
phrase
plotPhraseScore( pmat.s, phrase)
[1] "Behold the graphic design equivalent of saying \"white Hispanic\" :"
MAX_NGRAMS <- 100000
MAX_TERMS <- 15000
object_size(pmat.s) #n-gram data [1-5]
35.1 MB