Ramon Schildknecht
September 17, 2017
“The goal of data scientists is to provide data products that people love. A lot of people will benefit from this added value. Hopefully the result of the established product gives the people some added value.”

We need to develop a model that predicts the next best word or rather next probable word given one to several input words. The problem is part of the services that SwiftKey offers.
These days people are more engaged than ever. The goal is to save those end users of various mobile devices their precious time either to get more done or have more time to relax.
Example: “I went to the” and the next probable words are “gym”, “store” or “restaurant”.
I like to try a rough estimation about a possible business value:
Assume your company hast 1'000 employees and they use five mobile company apps. If we increase their text input speed by 1% we can save 220 (yearly working days) x 1000 (employees) x 1 (daily app usage hours) x 0.01 (percent) = 2'200 hours. Then we multiply this value with a $35 per hour salary.
The result: $77'000 dollar added value.
Every 1% increase of saved time results in approximately $77'000 a year!
1 Vocabulary = total number of words, word types = unique words, TTP = word type divided by TTP vocabulary, diversity = measure of vocabulary diversity = word types divided by sqrt(2*vocabulary)
The final accuracy is at about 30% and you have to type at least one word. The accuracy of Swiftkey is 33% but you have to usually type just two letters.
Example: “I am going to new York”
