Data Science Capstone

QinQin0912
Jun.04,2016

Description of the Alogorithm used to make the prediction

The goal of this Shiny App is to create a product to highlight the prediction algorithm that we have built and to provide an interface that can be accessed by others via a Shiny app.

N-Grams

An n-gram is a contiguous sequence of n items from a given sequence of text or speech. The items can be phonemes, syllables, letters, words or base pairs according to the application. The n-grams typically are collected from a text or speech corpus. An n-gram of size 1 is referred to as a “unigram”; size 2 is a “bigram” (or, less commonly, a “digram”); size 3 is a “trigram”. Larger sizes are sometimes referred to by the value of n, e.g., “four-gram”, “five-gram”, and so on.

Describe the App and how it functions.

  1. The users can get a sentense from twitter or facebook.
  2. Enter the sentense or copy the sentense to the text box input.
  3. The text box output will give the prediction of the next word.

Experience of using this app.

The app is deployed to shinyappsio via Rstudio.

It is a very handy,fast-responsive web app for word prediction.

Accuracy increases as N increases.

However, this app can still be further developed for machine learning predictive behaviour.

Welcome Testing!