Arranging words into a network is a common way of visualizing all the relationships among words simultaneously, rather than just the top few at a time. Pairs of consecutive words might capture structure that isn’t present when one is just counting single words, and may provide context that makes tokens more understandable (for example, “pulteney street”, in Northanger Abbey, is more informative than “pulteney”).

The graph below shows a network of bigrams (one word immediately followed by another) in six major books of Jane Austen. The arrow represents the order of bigrams and the thickness the frequency of the bigrams. It shows those that occurred more than 20 times and where neither word was a stop-word. The analysis can be extended to n-grams (e.g., trigrams).

Interpretation