-
The Next Word Prediction application uses Simple Good-Turing (SGT) estimator, devised by late William A. Gale and Geoffrey Sampson[1] in 1995.
-
SGT estimator deals with frequencies of frequencies of events and designed to smooth a probability distribution in such a way that it accounts reasonably for events that have not occurred.
-
This technique was chosen for the project because it is straightforward and not as much complex and computationally extensive.
- Algorithm tab panel of the application describes the method in detail.
[1] Good-Turing Frequency Estimation Without Tears (JOURNAL OF QUANTITATIVE LINGUISTICS, vol. 2, pp. 217-37 -- reprinted in Geoffrey Sampson, EMPIRICAL LINGUISTICS, Continuum, 2001).
Website