The process is as follows:
N-gram models are widely used in statistical natural language processing. Below are some paper or lectures you can check out if you are interested in this topic.
Wikipedia Source: https://en.wikipedia.org/wiki/N-gram#n-gram_models
Michael Collin’s Notes on N-gram Language Models: http://www.cs.columbia.edu/~mcollins/courses/nlp2011/notes/lm.pdf
N-grams Data based on the Corpus of Contemporary American English (COCA): http://www.ngrams.info/