Yann Claudel
22/03/2017
The purpose is to implement an algorithm that predicts an upcoming word given the first words of a sentence.
The algorithm will predict a list of potential words based on a corpus.
This corpus is built with 3 inputs files.
The content comes from twitter, blogs or news.
The probability of a world depends only on the k previous world.
The algorithm searches in 5-grams,4-grams,3-grams,2-grams:
See the application
https://yclaudel.shinyapps.io/appNextWord