Goal: Build a model that can predict the next word from a given phrase using real-world language data.
Data Source: - SwiftKey Dataset - Text from blogs, news, and Twitter
Method: - Tokenize text into Bigrams - Clean and normalize input - Match user input to known patterns
Prediction Logic:
```r predict_next_word <- function(input, ngram_data_list) { # match user phrase to bigrams }
👉 Hosted on shinyapps.io
Performance: - Predicts common word pairs well - Example: - Input: "hello" → Output: "world"
Next Improvements: - Add Trigram & Quadgram support - Implement Katz Backoff or Stupid Backoff - Allow mobile deployment