Manfredi Ruggeri
01/03/2022
This report describes the steps involved in building a data-driven web application that aims to predict the next word in a phrase.
Getting and cleaning Data
The raw data, three big text files provided from SwiftKey, was preprocessed:
Processing Data
This phase aims to generate tables that contain the information useful for the working of Data product.
Building the prediction algorithm
This last phase aims to show the prediction to the user. A stupid backoff model was implemented. In summary, the software starts to calculate mle for n-grams and if can't find any match, goes back to (n-1)-grams where mle has to be multiplied for a factor equal to 0.4 in every step.
Implementation details:
The app is as simple as possible and shows a good reactivity: write a text, select the number of possible words and click predict!
Go to Next Word Prediction and try it!
Thanks for reading