This report analyzes the HC Corpora dataset.
The datasets used are: - Blogs - News - Twitter
The data was explored to understand text patterns and word frequencies.
This analysis will be used to build a next-word prediction model.