By using the OpenMP-based Rcpp library data.table, this application can combine a Gzipped 4-gram, 3-gram, and Bayesian “multigram.” The app can expand to fit the number of threads available on the system for faster performance than some canned text analysis packages.
system.time(dt <- read.csv("testcsv.csv")) #regular read.csv
## user system elapsed ## 1.485 0.056 1.544
library(data.table)
system.time(dt <- fread("testcsv.csv")) # data.table
## user system elapsed ## 0.707 0.022 0.105