Word Prediction Application
Robert Granger
Apr 18, 2016
What is the NLP Prediction Application?
- Simple: Predicts the next word in a phrase or sentence
- Fast: Small dataset, quick to load (81 MBs)
- Easy: Requires user to just start typing
How it Works (algorithm)
- Data: millions of lines of twitter, blogs, and news articles
- Data converted into thousands of n-grams (quadrigrams,trigrams,bigrams)
- Only n-grams that meet a certain criterion are kept
- Algorithm searches and matches top three most frequent occurences
- If less than three occurences in quadrigram dataset, will search trigram dataset then bigram dataset
- If less than three occurences in bigram data set, will fill in remaining slots with most frequent words: “the”, “to”, “and”
Using the User Interface
- User simply types in the whitespace box labeled “Type Here:”
- Top three guesses for the next word will appear on the right
How can this Application be used?
- Could be implemented into a texting application to speed up texting.
- Useful for written documents that have become smeared or illegible.
- Application is small and not computationally intensive so could be used on almost any device.
NOT USEFUL for EXTREME ACCURACY
- While what is considered “good” accuracy is debatable, this application sacrifices accuracy for speed/memory.