Create a lightweight text prediction application that anticipates the next word in a phrase.
Given three large (~ 200 Mb) txt files from blog, news and twitter scrapes, contruct a codebase for cleaning, crunching and showing the best guesses for the next word.
Utilize ngrams and a Kat'z back-off model to estimate the next word based on observed frequencies.