Coursera_Data_Science_Capstone_Final

Radu Popescu

22/12/2020

Introduction

This presentation is created as part of the requirement for the Coursera Data Science Capstone Course.

The goal of the project is to build a predictive text model combined with a shiny app UI that will predict the next word as the user types a sentence similar to the way most smart phone keyboards are implemented today using the technology of Swiftkey.

Shiny App: https://radupopescu.shinyapps.io/Coursera_Data_Science_Capstone_Final/

Github Repo: https://github.com/rppopescu/Coursera_Data_Science_Capstone_Final

Getting & Cleaning the Data

Before building the word prediction algorithm, data are first processed and cleaned as steps below:

Word Prediction Model

The prediction model for next word is based on the Katz Back-off algorithm. Explanation of the next word prediction flow is as below:

Shiny Application

A Shiny application was developed based on the next word prediction model described previously as shown below.