10/30 in class activity

Wikipedia Analysis

Loading Packages:

library(tidyverse)
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ ggplot2   3.5.2     ✔ tibble    3.3.0
✔ lubridate 1.9.4     ✔ tidyr     1.3.1
✔ purrr     1.1.0     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors

Loading Data: this code loads in the 10 random pages, via a csv file that was created

wiki_pages <- read_csv("https://myxavier-my.sharepoint.com/:x:/g/personal/krahs_xavier_edu/ESCoUkVhKpRInnqKdK64aQEB8S5VeYH4PP1yAOCnL7sqgg?download=1")
New names:
Rows: 10 Columns: 2
── Column specification
──────────────────────────────────────────────────────── Delimiter: "," chr
(1): wiki_pages dbl (1): ...1
ℹ Use `spec()` to retrieve the full column specification for this data. ℹ
Specify the column types or set `show_col_types = FALSE` to quiet this message.
• `` -> `...1`
wiki_pages %>% nchar() %>% hist()