The Word Prediction Application has been created as a final Capstone Project in Data Science Specialization conducted by John Hopkins University at Coursera.
The data is from a corpus called HC Corpora and it contains texts collected from blogs, twitter and news:
This project will use the English dataset:
- en_US.blogs.txt
- en_US.twitter.txt
- en_US.news.txt