David Manero
septiembre 28 2015
Best Spanish Books Word Cloud
Coursera Developing Data Products
There is some objectives to be done with this project:
I use the libraries tm and wordcloud to analize Twiiter hashtags, but it is not possible to acces to the Twitter API online without showing my credentials, so I decide to use some public documents, as the best Spanish Books ever wroten.
Loading the Libraries:
library(tm)
library(wordcloud)
Loading the book:
text <- readLines("../lazarillo.txt", encoding="UTF-8")
Generating the Corpus:
myCorpus = Corpus(VectorSource(text))
Then doing several transformations and preparing the data for the word cloud generation.
And getting the Word Cloud
wordcloud(dm$word, dm$freq, random.order=FALSE, colors=brewer.pal(8, "Dark2"))
All the code and this presentation is in:
Other Links: