Marco Adamo
06-08-2020
The data used for this project is a collection of text aggregated by web crawler from twitter, blogs and news publicly available online.Only the english dataset has been used in this example.
The dataset is downloadable here: https://d396qusza40orc.cloudfront.net/dsscapstone/dataset/Coursera-SwiftKey.zip
The following steps have been made to prepare the dataset:
The algorithm works as follows:
Work instruction: