Zoey Le
September 28, 2018
Coursera Capstone Project
Large databases comprising of text in a target language are commonly used when generating language models for various purposes. In this project, we will explore the major features of the text data given for the Coursera Data Science Capstone through Johns Hopkins University. The project is sponsored by SwiftKey.
The ultimate purpose of this project is to built a Shiny app that suggest possible words when users type some random sentences.
Details can be found in these links:
The algorithm will follow these steps below:
However: Since there is a limitation on RAM of the computers, the prediction model is biased toward the train data size. Hence, the result is not as accurate as expected