Derek Corcoran
January 24, 2016
To analize the data the quanteda package was used the main reasons to use this package were
Bellow an example of the frequency of the most comon fivegrams, fourgrams, and trigrams.
term | n | term | n | term | n |
---|---|---|---|---|---|
at the end of the | 286 | the end of the | 717 | one of the | 2693 |
in the middle of the | 158 | at the end of | 550 | a lot of | 2509 |
for the first time in | 116 | the rest of the | 543 | to be a | 1471 |
the end of the day | 111 | for the first time | 484 | the end of | 1394 |