Cvetan Veljanovski
19/03/2021
This application “Predictor” is used to predict the next word when a user inputs series of words in the input field. Once the user inserts the series of words, the application based on that input gives suggestion words that would complete the sentence. There are three predictions:
This application can be used for different purposes as long as the need is to predict the next word from series of already inserted words.
The User Interface of the application “Predictor” is organized in two parts, as it can be seen on the left:
The next word is guessed using the frequency of the word in combinations of two, three and four words.
The frequency is calculated using text that is extracted from blogs, news and twitter that SwiftKey has provided for this project.
When using the application you will notice that the word 'it' is given, this is a default value for the prediction when there are no hint for guessing of the next word due to missing combination of the words.
The data preparation for this model normally takes around 30 minutes, where the the original data composed by news, blogs and twitts is organized and selects 50.000 random lines to create the frequency tables with the combination of two, three, and four words.
The output of this process is the data used by this app. The total size is 3Mb.
The time required by the app to search the next word in the frequency tables is roughly 0.5 seconds in average.