The purpose of this project is to answer the following question:
Initial exploration shows clear differences in most frequent word stems between 1-star and 5-star ratings (1-star on left, 5-star on right):
Prediction performance at a glance using test data: