Introduction

This report analyzes the HC Corpora dataset.

Data

The datasets used are: - Blogs - News - Twitter

Exploratory Analysis

The data was explored to understand text patterns and word frequencies.

Conclusion

This analysis will be used to build a next-word prediction model.