This application generates the upcoming word prediction through a backoff n-gram language model that was trained with English content from blogs and news articles and Twitter sources.
The goal is to demonstrate a data product that applies natural language processing to make predictions in real time and is deployed via Shiny.
We built a backoff n-gram model:
Data source:
en_US.blogs.txt,
en_US.news.txt, en_US.twitter.txt.tidytext.Try it out: (https://sathvikreddy.shinyapps.io/capstoneshinyapp/)
The model is loaded with .rds files for efficiency and
uses minimal memory on deployment.
The app is designed to be:
This app could be adapted to improve keyboard prediction, chatbots, or voice assistants.
Efficient: Memory-friendly and fast
Accurate: Uses real-world English sources
Scalable: Can expand to larger models or deep learning
Deployed: Fully functional on shinyapps.io
Ready to integrate into real-world applications
Would you hire me?
With this app, I’ve shown I can analyze big text data,
build predictive models, and deploy data
products effectively.