The app processes your phrase before searching it in the N-Gram Models:
- The phrase start is marked with the token [START]
- Numbers and times are replaced with tokens [NUMBER] and [HOUR] since we want to remember their position.
- Dashes and apostrophes that are part of words (e.g. don't or e-mail) are retained but other special characters are removed.
- Text is converted to lower case (“The” = “the”.)
- Ngrams of low prediction value are removed and other minor transformations take place to help accuracy.
Thank you for using my word prediction app!