- This project is part of the tenth and final course of the Coursera Data Science Specialization. The project focuses on the analysis of several huge files with text, to analyze their structure and on this analysis create a model to predict the next word written by a user.
- Contents
- Text data analysis: analysis of the corpus to understand the relationship of words and word pairs
- Predictive modeling: build basice n-gram models and develop algorithms to facilitate text prediction
- Shiny app development: produce a web-based Shiny app to predict next words