NWPredictor

2026-02-20

The model

built on 4 orders of n-grams
Markov Chains assumption: probability of the next event depends only on the immediate previous event(s), not on the whole events history
Backoff strategy: tetragrams → trigrams → bigrams → unigrams

The model uses a 4-gram backoff model to predict the next word.

1) Try and use tetragram (4-gram): if not found, then

↓

2) backoff to trigram (3-gram): if not found, then

↓

3) backoff to bigram (2-gram): if not found, then

↓

4) backoff to unigram (1-gram): if not found, then

↓

5) just use most common words