Elijah Appiah
2022-01-08
This project involves Natural Language Processing. The critical task is to take a user's input phrase (group of words) and to output a predicted next word.
Project deliverables:
The next word prediction model uses the principles of “tidy data” applied to text mining in R. Key model steps:
Benefits: easy to read code; uses “pipes”; fast processing of training data; able to sample up to 25% of original corpus; relatively small output repos
The next word prediction app provides a simple user interface to the next word prediction model.
Key Features:
Key Benefits:
Tidy Data
“http://vita.had.co.nz/papers/tidy-data.html”
Text Mining with R: A Tidy Approach
“http://tidytextmining.com/index.html”
Shiny App
“https://mblackmo.shinyapps.io/ngram_match/”
Shiny App Source Code repository on Github
“https://github.com/mark-blackmore/JHU-Data-Science-Capstone/tree/master/ngram_match”
Data Specialization Capstone repository on Github
“https://github.com/mark-blackmore/JHU-Data-Science-Capstone”