Csaba Farago
2021-05-24
This is the final project of Data Science Specialization course series at Coursera.
The task was to try to guess the next word of a started sentence. More specifically:
Main idea: divide the corpus into word triplets, create a hash which calculates the number of occurrences, and the results could be used for the next word guess.
There were several issues with this approach:
hash was used instead.The algorithm is split into 2 major parts.
I. Create cache. It is divided into 4 major steps:
II. Use cache in the app. The Shiny app reads the cached input at startup which takes a few seconds only.
The user should write something into the left-hand side input field.
Ideas for developing further: consider more words, other languages, better GUI.