## Package version: 2.1.2
## Parallel computing: 2 of 8 threads used.
## See https://quanteda.io for tutorials and examples.
## 
## Attaching package: 'quanteda'
## The following object is masked from 'package:utils':
## 
##     View
## Loading required package: usethis
## 
## Attaching package: 'quanteda.textmodels'
## The following object is masked from 'package:quanteda':
## 
##     data_dfm_lbgexample
## 
## Attaching package: 'seededlda'
## The following object is masked from 'package:stats':
## 
##     terms
## 
## Attaching package: 'rsconnect'
## The following object is masked from 'package:devtools':
## 
##     lint
## 
## Attaching package: 'packrat'
## The following objects are masked from 'package:devtools':
## 
##     install, install_local

#create corpus

## readtext object consisting of 19 documents and 1 docvar.
## # Description: df[,3] [19 x 3]
##   doc_id        text                docvar1  
##   <chr>         <chr>               <chr>    
## 1 Padania1.pdf  "\" ‘When I’m\"..." Padania1 
## 2 Padania10.pdf "\"          \"..." Padania10
## 3 Padania11.pdf "\"          \"..." Padania11
## 4 Padania12.pdf "\" Northern \"..." Padania12
## 5 Padania13.pdf "\"     Popul\"..." Padania13
## 6 Padania14.pdf "\"          \"..." Padania14
## # ... with 13 more rows

#create corpus

## [1] 19
##     docvar1
## 1  Padania1
## 2 Padania10
## 3 Padania11
## 4 Padania12
## 5 Padania13
## 6 Padania14
## Corpus consisting of 19 documents, showing 19 documents:
## 
##           Text Types Tokens Sentences   docvar1
##   Padania1.pdf  1593   5007       171  Padania1
##  Padania10.pdf   386    884        31 Padania10
##  Padania11.pdf   225    367        15 Padania11
##  Padania12.pdf   445    923        28 Padania12
##  Padania13.pdf   503   1093        36 Padania13
##  Padania14.pdf   296    550        18 Padania14
##  Padania15.pdf   564   1262        49 Padania15
##  Padania16.pdf   240    473        14 Padania16
##  Padania17.pdf   580   1340        52 Padania17
##  Padania18.pdf   580   1340        52 Padania18
##  Padania19.pdf   466   1031        37 Padania19
##   Padania2.pdf   166    275        12  Padania2
##   Padania3.pdf   368    836        23  Padania3
##   Padania4.pdf   368    836        23  Padania4
##   Padania5.pdf   353    776        24  Padania5
##   Padania6.pdf   444   1134        37  Padania6
##   Padania7.pdf   208    429        16  Padania7
##   Padania8.pdf   292    557        18  Padania8
##   Padania9.pdf   225    377        11  Padania9

#create dfm

## Length  Class   Mode 
##  48127    dfm     S4

#Cleaning up using tokens

##               Length Class  Mode     
## Padania1.pdf  2311   -none- character
## Padania10.pdf  431   -none- character
## Padania11.pdf  188   -none- character
## Padania12.pdf  476   -none- character
## Padania13.pdf  561   -none- character
## Padania14.pdf  322   -none- character
## Padania15.pdf  624   -none- character
## Padania16.pdf  234   -none- character
## Padania17.pdf  680   -none- character
## Padania18.pdf  680   -none- character
## Padania19.pdf  513   -none- character
## Padania2.pdf   152   -none- character
## Padania3.pdf   422   -none- character
## Padania4.pdf   422   -none- character
## Padania5.pdf   387   -none- character
## Padania6.pdf   555   -none- character
## Padania7.pdf   214   -none- character
## Padania8.pdf   284   -none- character
## Padania9.pdf   200   -none- character

#kwic doesn’t work with dfm, so you have to use tokens

docname from to pre keyword post pattern
docname from to pre keyword post pattern
Padania9.pdf 6 6 warns govt budget choices ansa english media choice*
Padania9.pdf 142 142 warns govt budget choices rome alway seen choice*
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
Padania1.pdf 1619 1619 matter term econom decision-mak financ european affair decision*
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
Padania15.pdf 225 225 new state area occupi forgotten kingdom lottingen occupi*
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
Padania1.pdf 408 408 whole berlusconi increas discredit gone salvini won discr*
Padania3.pdf 105 105 auster polici parti discredit last year corrupt discr*
Padania4.pdf 105 105 auster polici parti discredit last year corrupt discr*
docname from to pre keyword post pattern
Padania1.pdf 791 791 italian whole nation oppressor intrud leagu abandon oppress*
Padania17.pdf 366 366 sardinian stand opposit oppress obsess pro-migr eu oppress*
Padania18.pdf 366 366 sardinian stand opposit oppress obsess pro-migr eu oppress*
docname from to pre keyword post pattern
##                                                                                
##    [Padania13.pdf, 79:80]  fastest-growing powerful |     political party     |
##  [Padania13.pdf, 184:185]  fastest-growing powerful |     political party     |
##  [Padania19.pdf, 531:532]               since Oct 1 | independence referendum |
##   [Padania5.pdf, 748:749]    allowing Scotland hold | independence referendum |
##                  
##   Italy FULL TEXT
##   Italy tapped   
##   Lombardy's boss
##   autumn 2014
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern
docname from to pre keyword post pattern

multiword

#create dfm from first tokenized steps

#word cloud

#co-occurance

##      leagu     region         eu      parti berlusconi       like     nation 
##      27121      18753      18433      17373      15384      14782      14445 
##      state       vote       call 
##      13341      12892      12391
##  [1] "leagu"      "region"     "eu"         "parti"      "berlusconi"
##  [6] "like"       "nation"     "state"      "vote"       "call"      
## [11] "media"      "lega"       "end"        "ministri"   "said"      
## [16] "one"        "renzi"      "differ"     "facebook"   "morisi"    
## [21] "say"        "union"      "italian"    "european"   "northern"