Predictive Text Generator

20/04/2020

Overview

The objective of the project is to generate predictive text based on the text available in several datasets.
We accept an input of a string and give three possible suggestions for the next word.

Algorithms

We use the n-gram model with Kneser-Ney smoothing and back-off.
We retrieve the last n-1 letters of the input and look to find a match in the list of n-grams.
If we find a match with sufficient probability, then we return the last word of the n-gram.
However if we don’t, we look out in the (n-1)-gram and so on.

Sample text

“Every day is a new”: day/one/York
“I love”: you/it/them
“What are you”: doing/having/eating
“You are an”: amazing/excellent/awesome

Conclusion

You can find the web app here.
Here is the documentation.
Thank you for reading!