Low Wei Hong
04 Sept 2017
TheNextWord is a quick and easy text prediction application.
TheNextWord can be implemented on mobile devices and offers advantages over standard text typing:
TheNextWord allows you to enter a custom word or phrase. Once you click “Predict Next Word”, TheNextWord displays your selected input before and after processing.
TheNextWord will output the most likely word in red text and a list of possible alternatives.
TheNextWord uses the HC Corpora data set determining word frequency.
The HC Corpora data set is screened and processed to removed extraneous characters and then is categorized into the most frequent word combinations (N-grams).
Using these N-gram frequencies TheNextWord can take the user submitted sentences and quickly calculate the most likely next word.
The code to the application can be found on Github
The source HC Corpora data set and associated ReadMe.