## Package version: 2.1.2
## Parallel computing: 2 of 8 threads used.
## See https://quanteda.io for tutorials and examples.
##
## Attaching package: 'quanteda'
## The following object is masked from 'package:utils':
##
## View
## Loading required package: usethis
##
## Attaching package: 'quanteda.textmodels'
## The following object is masked from 'package:quanteda':
##
## data_dfm_lbgexample
##
## Attaching package: 'seededlda'
## The following object is masked from 'package:stats':
##
## terms
##
## Attaching package: 'rsconnect'
## The following object is masked from 'package:devtools':
##
## lint
##
## Attaching package: 'packrat'
## The following objects are masked from 'package:devtools':
##
## install, install_local
#create corpus
## readtext object consisting of 19 documents and 1 docvar.
## # Description: df[,3] [19 x 3]
## doc_id text docvar1
## <chr> <chr> <chr>
## 1 Padania1.pdf "\" ‘When I’m\"..." Padania1
## 2 Padania10.pdf "\" \"..." Padania10
## 3 Padania11.pdf "\" \"..." Padania11
## 4 Padania12.pdf "\" Northern \"..." Padania12
## 5 Padania13.pdf "\" Popul\"..." Padania13
## 6 Padania14.pdf "\" \"..." Padania14
## # ... with 13 more rows
#create corpus
## [1] 19
## docvar1
## 1 Padania1
## 2 Padania10
## 3 Padania11
## 4 Padania12
## 5 Padania13
## 6 Padania14
## Corpus consisting of 19 documents, showing 19 documents:
##
## Text Types Tokens Sentences docvar1
## Padania1.pdf 1593 5007 171 Padania1
## Padania10.pdf 386 884 31 Padania10
## Padania11.pdf 225 367 15 Padania11
## Padania12.pdf 445 923 28 Padania12
## Padania13.pdf 503 1093 36 Padania13
## Padania14.pdf 296 550 18 Padania14
## Padania15.pdf 564 1262 49 Padania15
## Padania16.pdf 240 473 14 Padania16
## Padania17.pdf 580 1340 52 Padania17
## Padania18.pdf 580 1340 52 Padania18
## Padania19.pdf 466 1031 37 Padania19
## Padania2.pdf 166 275 12 Padania2
## Padania3.pdf 368 836 23 Padania3
## Padania4.pdf 368 836 23 Padania4
## Padania5.pdf 353 776 24 Padania5
## Padania6.pdf 444 1134 37 Padania6
## Padania7.pdf 208 429 16 Padania7
## Padania8.pdf 292 557 18 Padania8
## Padania9.pdf 225 377 11 Padania9
#create dfm
## Length Class Mode
## 48127 dfm S4
#Cleaning up using tokens
## Length Class Mode
## Padania1.pdf 2311 -none- character
## Padania10.pdf 431 -none- character
## Padania11.pdf 188 -none- character
## Padania12.pdf 476 -none- character
## Padania13.pdf 561 -none- character
## Padania14.pdf 322 -none- character
## Padania15.pdf 624 -none- character
## Padania16.pdf 234 -none- character
## Padania17.pdf 680 -none- character
## Padania18.pdf 680 -none- character
## Padania19.pdf 513 -none- character
## Padania2.pdf 152 -none- character
## Padania3.pdf 422 -none- character
## Padania4.pdf 422 -none- character
## Padania5.pdf 387 -none- character
## Padania6.pdf 555 -none- character
## Padania7.pdf 214 -none- character
## Padania8.pdf 284 -none- character
## Padania9.pdf 200 -none- character
#kwic doesn’t work with dfm, so you have to use tokens
| Padania9.pdf |
6 |
6 |
warns govt budget |
choices |
ansa english media |
choice* |
| Padania9.pdf |
142 |
142 |
warns govt budget |
choices |
rome alway seen |
choice* |
| Padania1.pdf |
1619 |
1619 |
matter term econom |
decision-mak |
financ european affair |
decision* |
| Padania15.pdf |
225 |
225 |
new state area |
occupi |
forgotten kingdom lottingen |
occupi* |
| Padania1.pdf |
408 |
408 |
whole berlusconi increas |
discredit |
gone salvini won |
discr* |
| Padania3.pdf |
105 |
105 |
auster polici parti |
discredit |
last year corrupt |
discr* |
| Padania4.pdf |
105 |
105 |
auster polici parti |
discredit |
last year corrupt |
discr* |
| Padania1.pdf |
791 |
791 |
italian whole nation |
oppressor |
intrud leagu abandon |
oppress* |
| Padania17.pdf |
366 |
366 |
sardinian stand opposit |
oppress |
obsess pro-migr eu |
oppress* |
| Padania18.pdf |
366 |
366 |
sardinian stand opposit |
oppress |
obsess pro-migr eu |
oppress* |
##
## [Padania13.pdf, 79:80] fastest-growing powerful | political party |
## [Padania13.pdf, 184:185] fastest-growing powerful | political party |
## [Padania19.pdf, 531:532] since Oct 1 | independence referendum |
## [Padania5.pdf, 748:749] allowing Scotland hold | independence referendum |
##
## Italy FULL TEXT
## Italy tapped
## Lombardy's boss
## autumn 2014
multiword
#create dfm from first tokenized steps
#word cloud 
#co-occurance
## leagu region eu parti berlusconi like nation
## 27121 18753 18433 17373 15384 14782 14445
## state vote call
## 13341 12892 12391
## [1] "leagu" "region" "eu" "parti" "berlusconi"
## [6] "like" "nation" "state" "vote" "call"
## [11] "media" "lega" "end" "ministri" "said"
## [16] "one" "renzi" "differ" "facebook" "morisi"
## [21] "say" "union" "italian" "european" "northern"
