Problem Statement

  • Predict the next word given a phrase
  • Useful for text input assistance and NLP systems

Data & Preprocessing

  • Sample English text data
  • Converted to lowercase
  • Removed punctuation
  • Tokenized into word pairs (bigrams)

Prediction Algorithm

  • Bigram language model
  • Uses frequency-based next-word prediction
  • Simple and efficient approach

Shiny Application

  • User enters a phrase
  • Clicks Predict
  • App outputs one predicted next word

Conclusion

  • Fast and easy to use
  • Demonstrates core NLP concepts
  • Can be extended using larger datasets