Coursera Data Science Capstone - Final Project

Maxime Verges

This presentation is included in the last assignment from the online course Data Science Capstone (https://www.coursera.org/learn/data-science-project)
The main challenge of these slides is to provide some key information regarding the final project
Due to te requirements, it has been generated using RStudio (https://www.rstudio.com) and Slidify (http://slidify.org)

The main challenge is to create an application which can predict the next word regarding a sequence of words given by the user
We need three datasets bigram.RData, trigram.RData and quadgram.RData that can be obtained with the milestone report (http://rpubs.com/maximeverges/495853). It contains respectively a list of 2 words, 3 words and 4 words.
The application and the current presentation source codes can be found at https://github.com/maximeverges/Data-Science-Capstone. 5 files are included: ui.R, server.R, bigram.RData, trigram.RData and quadgram.RData

The prediction model to get the next word in based on back-off method detailed as below:

A basic tutorial is described as below:

Tutorial

Besides, the tab help is dedicated to the explanations of the application while the tab about provides some useful innformations such as links