class: center, middle, inverse, title-slide # Veri Okuma ## Veri Aktarma ### Dr. Atalay Kabasakal ### Kasim, 2021 --- --- ## foreign paketi - Foreign paketi farkli yapilardaki verileri okumamizi saglayan bir pakettir. -- ```r install.packages("foreign") ``` -- ```r library(foreign) # library komutu paketleri yuklememizi saglar ``` --- # read.spss() fonksiyonu -- ```r spss <- read.spss("SPSS1.sav",to.data.frame=T) str(spss) #datanin yapisini gosterir ``` ``` ## 'data.frame': 24 obs. of 4 variables: ## $ Condition : num 1 1 1 1 1 1 1 1 2 2 ... ## $ Score : num 3 4 2 1 1 2 1 1 5 4 ... ## $ condition2: num 1 1 1 1 1 1 2 2 2 2 ... ## $ score2 : num 10 6 8 9 7 7 14 12 8 10 ... ## - attr(*, "codepage")= int 1252 ``` -- - Olusturalan veri seti ile islem yapilmasi -- ```r summary(spss) ``` ``` ## Condition Score condition2 score2 ## Min. :1 Min. :1.00 Min. :1.000 Min. : 6.00 ## 1st Qu.:1 1st Qu.:2.00 1st Qu.:1.000 1st Qu.: 8.00 ## Median :2 Median :3.50 Median :2.000 Median :10.00 ## Mean :2 Mean :3.25 Mean :1.867 Mean :10.13 ## 3rd Qu.:3 3rd Qu.:4.00 3rd Qu.:2.500 3rd Qu.:12.00 ## Max. :3 Max. :5.00 Max. :3.000 Max. :16.00 ## NA's :9 NA's :9 ``` -- --- # read.spss() fonksiyonu -- - Olusturalan veri seti ile islem yapilmasi -- ```r colMeans(spss) # 3. ve 4. degiskenlere ait hesaplamalar yok ``` ``` ## Condition Score condition2 score2 ## 2.00 3.25 NA NA ``` -- ```r spss[,3] # cunku kayip veri var ``` ``` ## [1] 1 1 1 1 1 1 2 2 2 2 2 3 3 3 3 NA NA NA NA NA NA NA NA NA ``` -- ```r colMeans(spss, na.rm=TRUE) # na.rm=TRUE kayip verileri yok sayar (remove NA=TRUE) ``` ``` ## Condition Score condition2 score2 ## 2.000000 3.250000 1.866667 10.133333 ``` -- --- ## haven() paketi foreign paketinde **read.spss()** fonksiyonunun karşılığıImportDataset menüsünde yer alan haven paketindeki **read_sav()** fonksiyonudur. ```r factor <- haven::read_sav("factor.sav") str(factor) ``` ``` ## tibble [50 x 3] (S3: tbl_df/tbl/data.frame) ## $ id : num [1:50] 1 2 3 4 5 6 7 8 9 10 ... ## ..- attr(*, "format.spss")= chr "F6.2" ## $ bolge: num [1:50] 1 1 1 1 1 1 1 1 1 1 ... ## ..- attr(*, "format.spss")= chr "F6.2" ## $ puan : num [1:50] 9 8 6 8 10 4 6 5 7 7 ... ## ..- attr(*, "format.spss")= chr "F6.2" ``` degiskeni faktore donusturme ```r factor$bolge <- factor(factor$bolge) ``` --- ## haven() paketi - Faktorlerin seviyelerini gorme -- ```r levels(factor$bolge) ``` ``` ## [1] "1" "2" "3" "4" "5" ``` - Seviyelere yeniden isim verme -- ```r levels(factor$bolge) <- c("Akdeniz","Ege","Marmara","IcAnadolu","Karadeniz") levels(factor$bolge) ``` ``` ## [1] "Akdeniz" "Ege" "Marmara" "IcAnadolu" "Karadeniz" ``` --- ## by() fonksiyonu **by()** fonksiyonu ile her bir bölgenin ortalamasini bulma ```r by(factor$puan,factor$bolge,mean) ``` ``` ## factor$bolge: Akdeniz ## [1] 7 ## ------------------------------------------------------------ ## factor$bolge: Ege ## [1] 6.9 ## ------------------------------------------------------------ ## factor$bolge: Marmara ## [1] 11 ## ------------------------------------------------------------ ## factor$bolge: IcAnadolu ## [1] 13.4 ## ------------------------------------------------------------ ## factor$bolge: Karadeniz ## [1] 12 ``` --- ## haven() paketi Faktorler fonksiyon kullanirken cok ise yaramakta ve siklikla kullanilmaktadir. ```r boxplot(factor$puan~factor$bolge) ``` <img src="R_11_veriaktarma_files/figure-html/unnamed-chunk-13-1.png" width="100%" /> ```r # boxplot(puan~bolge, data=factor) ```