P. Fleer
September 5, 2017
Idea SwiftKey, the corporate partner in the capstone of Johns Hopkins University's Data Science Specialization, aims at making it easier for people to type on their mobile devices. One cornerstone of their technology is predictive text models. The idea of the capstone project is to build a shiny App that emulates this technique. Starting point is a large corpus of text documents from blogs, news and twitter sources provided by Swiftkey.
Goals
Data base: Uni-, bi- and trigram tables
Stupid backoff model for prediction:
Developments: The App could be developed in different directions. In particular, we could think of feeding back the choices made by users into the n-gram tables, which would enable the App to learn from these choices and enhance predictive quality.
References (Selection)
Thanks
Particular thanks to Michael Szczepaniak, Len Greski and Fiona Elisabeth Young who provided much valuable advise.
Hope you'll like the App! (Try building the longest meaningful word sequence just by clicking on the word buttons, maybe choosing different starting words.)