The data we are using to practice working with SNPs in VCF files comes from a paper in Molecular Ecology by Jennifer Walsh called “Subspecies delineation amid phenotypic, geographic and genetic discordance in a songbird.” The goal of the paper is to compare traditional ways of classifying different populations and subspecies of Ammodramus sparrow with morphology and genetic data. In order to do this analysis, though, we need to prepare the data.
Missing data is common in SNP datasets, especially in samples from wild animals where the amount of DNA collected may be small. We can fill in (impute) missing data through methods such as mean imputation. However, if there’s lots of NAs we may want to consider removing samples (in this case birds) for which we’d have to impute many - if not most - of their SNPs. That is to say - if we have to impute a lot of the data for the sample, we’d be better off just removing the sample.
There are no strict rules for many of the decisions you have to make when doing data analysis, especially when it comes to preparation and cleaning steps like removing or imputing NAs.
In this study, the authors state:
“we removed any individuals that were missing data at more than 50% of the SNPs.”
This is an arbitrary decision - they could have chosen 60% or 25%. It is therefore called a researcher degree of freedom because they are free to chose what to do, and someone else may have made a different choice.
This type of decision is very common in data analysis, and needs to be documented and justified. Posting the raw data, as the authors have, allows other researchers to explore the consequences of a different choice.
In this exercise we will show how you can remove rows of data if it
is necessary. We’ll use the handy functions which() and
is.na() and a common programming tool called a
for() loop.
Note we are removing individual samples, which are represented by rows in the data. Elsewhere in our protocol we remove columns, for example if all the columns have the same genotype.
Load the vcfR package with library()`:
library(vcfR)
##
## ***** *** vcfR *** *****
## This is vcfR 1.13.0
## browseVignettes('vcfR') # Documentation
## citation('vcfR') # Citation
## ***** ***** ***** *****
Make sure that your working directory is set to the location of the
file all_loci.vcf.
getwd()
## [1] "/Users/krishna/Downloads"
list.files()
## [1] " Post lab 3 B.pdf"
## [2] "[16133722 - HUMOR] The Princess Bride and the parodic impulse_ The seduction of Cinderella (1).pdf"
## [3] "[16133722 - HUMOR] The Princess Bride and the parodic impulse_ The seduction of Cinderella (2).pdf"
## [4] "[16133722 - HUMOR] The Princess Bride and the parodic impulse_ The seduction of Cinderella (3).pdf"
## [5] "[16133722 - HUMOR] The Princess Bride and the parodic impulse_ The seduction of Cinderella.pdf"
## [6] "{7C8D6B82-D416-40FB-BA89-80783DECEA3B}.png.jpg"
## [7] "~$iman CH 4 & Conclusion.doc"
## [8] "~$Lab Meeting 2 Daphnia Excel sheet 3.25.21.xlsx"
## [9] "~$ligon and science final .docx"
## [10] "~$liogn and science paper.docx"
## [11] "~$TRODUCTION TO AFRICA -- FINAL PAPER OUTLINES (2).docx"
## [12] "~$TRODUCTION TO AFRICA -- FINAL PAPER OUTLINES.docx"
## [13] "~$TRODUCTION TO AFRICA -- LECTURE NOTES - 2-5 (2).docx"
## [14] "~$wilding Discussion Questions-4.docx"
## [15] "0110 F20 Recitation Worksheet 2 Send (1).pdf"
## [16] "0110 F20 Recitation Worksheet 2 Send.pdf"
## [17] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (1).pdf"
## [18] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (2).pdf"
## [19] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (3).pdf"
## [20] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (4).pdf"
## [21] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (5).pdf"
## [22] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (6).pdf"
## [23] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (7).pdf"
## [24] "0110 F20 420 PM Syllabus Updated with Canvas Compliance (8).pdf"
## [25] "0110 F20 420 PM Syllabus Updated with Canvas Compliance.pdf"
## [26] "0110 F20 Openstax Practice Exercises.pdf"
## [27] "0110 F20 Openstax Practice Questions Chapter 6 Electronic Structure and Periodic Properties of Elements.pdf"
## [28] "0110 F20 Outline of Possible Final Exam Material.pdf"
## [29] "0110 F20 Recitation Skills Inventory for Students Send.pdf"
## [30] "0110 F20 Recitation Worksheet 1 For Students (1).pdf"
## [31] "0110 F20 Recitation Worksheet 1 For Students (2).pdf"
## [32] "0110 F20 Recitation Worksheet 1 For Students (3).pdf"
## [33] "0110 F20 Recitation Worksheet 1 For Students.pdf"
## [34] "0110 F20 Recitation Worksheet 2 Key Send.pdf"
## [35] "0110 F20 Rules for Drawing Lewis Structures Post on Canvas.pdf"
## [36] "0110 F20 Schedule Post.pdf"
## [37] "0110 F20 Schedule Updated Post (1).pdf"
## [38] "0110 F20 Schedule Updated Post (10).pdf"
## [39] "0110 F20 Schedule Updated Post (11).pdf"
## [40] "0110 F20 Schedule Updated Post (12).pdf"
## [41] "0110 F20 Schedule Updated Post (13).pdf"
## [42] "0110 F20 Schedule Updated Post (14).pdf"
## [43] "0110 F20 Schedule Updated Post (15).pdf"
## [44] "0110 F20 Schedule Updated Post (16).pdf"
## [45] "0110 F20 Schedule Updated Post (17).pdf"
## [46] "0110 F20 Schedule Updated Post (2).pdf"
## [47] "0110 F20 Schedule Updated Post (3).pdf"
## [48] "0110 F20 Schedule Updated Post (4).pdf"
## [49] "0110 F20 Schedule Updated Post (5).pdf"
## [50] "0110 F20 Schedule Updated Post (6).pdf"
## [51] "0110 F20 Schedule Updated Post (7).pdf"
## [52] "0110 F20 Schedule Updated Post (8).pdf"
## [53] "0110 F20 Schedule Updated Post (9).pdf"
## [54] "0110 F20 Schedule Updated Post.pdf"
## [55] "0120 SP21 Notes for Students February 15 Post.pdf"
## [56] "0120 SP21 Openstax Chapter 11 Lecture Notes for Students February 5.pdf"
## [57] "0120 SP21 Recitation Worksheet 1 Chapter 10 for Students.pdf"
## [58] "0120_SP21_Recitation_Worksheet_Gradescope_Template_for_Week_of_February_14.pdf"
## [59] "0120_SP21_Recitation_Worksheet_March_29.pdf"
## [60] "0120+SP21+Recitation+Worksheet+1+Chapter+10+for+Students.pdf"
## [61] "1 0110 F20 Open Stax Chapter 1 Objectives.pdf"
## [62] "1.1 FINAL.pdf"
## [63] "10.1007_s10646-013-1061-1.ris"
## [64] "1189983.vcf"
## [65] "125938668.pdf"
## [66] "13_FABRIZIO PREGADIO_The_Way_of_the_Golden_Elixir.pdf"
## [67] "14_vivienne Lo_Food and Medicine in Traditional China.pdf"
## [68] "1540_cluster_analysis.pdf"
## [69] "1597512097.031946.jpg"
## [70] "1651137.acsm"
## [71] "2211.HAA.0010-1.Schedule.Final.docx"
## [72] "2211.HAA.0010.Handout-Power&Authority-Benin.pdf"
## [73] "2211.Handout-Descent.Bosch (1).pdf"
## [74] "2211.Handout-Descent.Bosch (2).pdf"
## [75] "2211.Handout-Descent.Bosch .pdf"
## [76] "2211.Handout-Hongzhi.Velazquez (1).docx"
## [77] "2211.Handout-Hongzhi.Velazquez.docx"
## [78] "2211.Handout-Li Cheng.Sotatsu.Hokusai.pdf"
## [79] "2211.Handout-Mosque.Cordoba.pdf"
## [80] "2211.Handout-Parthenon-1.pdf"
## [81] "2211.Handout-Rome.Forbidden.City.docx"
## [82] "2211.Handout.Power&Authority-Trajan.Bayeux (1).pdf"
## [83] "2211.Handout.Power&Authority-Trajan.Bayeux.pdf"
## [84] "2211.Smithson.Christo & Jeanne-Claude (1).pdf"
## [85] "2211.Smithson.Christo & Jeanne-Claude (2).pdf"
## [86] "2211.Smithson.Christo & Jeanne-Claude.pdf"
## [87] "2221_Exam3.pdf"
## [88] "3_The Meaning of “Culture” _ The New Yorker.pdf"
## [89] "3.Moai.Figures.and.VVM (1).pdf"
## [90] "3.Moai.Figures.and.VVM (2).pdf"
## [91] "3.Moai.Figures.and.VVM.pdf"
## [92] "5_Sun, Anna_Confucianism as a World Religion 2013.pdf"
## [93] "6.pdf"
## [94] "64875390710__ADFCE9EB-FF11-43DF-9002-B9A963EFA982.heic"
## [95] "824f8f2a5d6dbc6ad05b8beb5c3f3a30.torrent"
## [96] "A Pictorial Tour of Garfield Community Farm (1).pdf"
## [97] "A Pictorial Tour of Garfield Community Farm (2).pdf"
## [98] "A Pictorial Tour of Garfield Community Farm.pdf"
## [99] "A Pocket Style Manual 6th Editon by Hacker, Diana Sommers, Nancy (z-lib.org).pdf"
## [100] "A704A49F-15C5-4CCC-B85A-026D3DD079C6.jpg"
## [101] "Academic Foundations assignment.pdf"
## [102] "ACFrOgBpCSeTqGyNLw04rSSauwQg1tZ2TeQXf9JBPwhiZmA9EXu-aj8jdHkkVnu_2kOxNcB9WmOqufxFiBLctXdNWV6QmDYXz_kneq0QZgDAEkK89sruDsoBhXeqil630sHH19A7IZxzyAFlvzwg.pdf"
## [103] "AcroRdrDC_2100120145_MUI.dmg"
## [104] "AcroRdrDC_2100120145_MUI.pkg"
## [105] "Activity 2 In-Lab Workshop Problems, Part 1 and Part 2 (for printing if desired).pdf"
## [106] "Activity 2 Watson and Crick original paper.pdf"
## [107] "Africa_Fourth_Edition_----_(2._Legacies_of_the_Past_Themes_in_African_History).pdf"
## [108] "alcoholedu-for-college-part-two.pdf"
## [109] "all_loci-1.vcf"
## [110] "all_loci.vcf"
## [111] "ALL.chr10_GRCh38.genotypes.20170504.vcf.gz"
## [112] "allomtery_3_scatterplot3d (1).Rmd"
## [113] "Analyze_Course-wide_Community_data-1.pptx"
## [114] "anki-2.1.32-mac.dmg"
## [115] "annotated-Beetleguese%20essay.docx.pdf"
## [116] "annotated-Libraynth%20.pdf"
## [117] "annotated-Reflections%20Religion%20and%20Science (1).pdf"
## [118] "annotated-Reflections%20Religion%20and%20Science.pdf"
## [119] "Answer Key_Chem 0310_Study guide_FInal exam_Fall 2021.pdf"
## [120] "Anthro Mini Presentation (1).pdf"
## [121] "Anthro Mini Presentation.pdf"
## [122] "AP Chapter 14 Jeopardy.ppt"
## [123] "Appa.jpg"
## [124] "Aristotle-1.Physics (1).pdf"
## [125] "Aristotle-1.Physics (2).pdf"
## [126] "Aristotle-1.Physics (3).pdf"
## [127] "Aristotle-1.Physics.pdf"
## [128] "Art Final - Google Docs (1).pdf"
## [129] "Art Final - Google Docs (2).pdf"
## [130] "Art Final - Google Docs (3).pdf"
## [131] "Art Final - Google Docs (4).pdf"
## [132] "Art Final - Google Docs.pdf"
## [133] "Art final part 2 - Google Docs.pdf"
## [134] "Assistant Coordinator Final Schedule.xlsx"
## [135] "Auto Ethnographic Assignment #1 – Language.pdf"
## [136] "Avatar.jpg"
## [137] "Back to the Basics of Ecology-Carson (1).pptx"
## [138] "Back to the Basics of Ecology-Carson (2).pptx"
## [139] "Back to the Basics of Ecology-Carson.pptx"
## [140] "Bakar_Science_and_Religion_Christian_and_Muslim_Perspectives.AlGhazali and Averroes (1).pdf"
## [141] "Bakar_Science_and_Religion_Christian_and_Muslim_Perspectives.AlGhazali and Averroes (2).pdf"
## [142] "Bakar_Science_and_Religion_Christian_and_Muslim_Perspectives.AlGhazali and Averroes.pdf"
## [143] "Beetleguese essay.docx"
## [144] "Beetleguese essay.pdf"
## [145] "Bibliography_679d7d3b-9152-4b12-a5d9-29a809ab19a7.docx"
## [146] "Bio lab week 12 (1).pdf"
## [147] "Bio lab week 12.docx"
## [148] "Bio lab week 12.pdf"
## [149] "bio office hours.docx"
## [150] "BIOSC 0150 (2211) - Course Schedule_FINAL (1).pdf"
## [151] "BIOSC 0150 (2211) - Course Schedule_FINAL.pdf"
## [152] "BIOSC 0150 (2211) - Quiz Wrapper.docx"
## [153] "BIOSC 0150 (2211) - Syllabus_FINAL (1).pdf"
## [154] "BIOSC 0150 (2211) - Syllabus_FINAL.pdf"
## [155] "BIOSC 0150 Grade Calculator (final).xlsx"
## [156] "BIOSC 0150 Module 1_ Biomolecules (1).apkg"
## [157] "BIOSC 0150 Module 1_ Biomolecules (2).apkg"
## [158] "BIOSC 0150 Module 1_ Biomolecules (3).apkg"
## [159] "BIOSC 0150 Module 1_ Biomolecules (4).apkg"
## [160] "BIOSC 0150 Module 1_ Biomolecules (5).apkg"
## [161] "BIOSC 0150 Module 1_ Biomolecules (6).apkg"
## [162] "BIOSC 0150 Module 1_ Biomolecules (7).apkg"
## [163] "BIOSC 0150 Module 1_ Biomolecules.apkg"
## [164] "bird_snps_remove_NAs.Rmd"
## [165] "Black_Bodies_White_Gazes_The_Continuing_Significance_of_Race_in_America.pdf"
## [166] "Blog #3 (1).docx"
## [167] "Blog #3.docx"
## [168] "Blog Entry 2 (1).docx"
## [169] "Blog Entry 2.docx"
## [170] "Blog number 1 Bio lab (1).docx"
## [171] "Blog number 1 Bio lab (2).docx"
## [172] "Blog number 1 Bio lab.docx"
## [173] "Body Fat.mpx"
## [174] "brockstring_SPSS_04October_JG.sav"
## [175] "Brodd_Ch 1_Religion.pdf"
## [176] "Bullshit.pdf"
## [177] "c0110_expt1_intro.pdf"
## [178] "c0110_expt1_online (1).pdf"
## [179] "c0110_expt1_online.pdf"
## [180] "c0110_expt11_intro.pdf"
## [181] "c0110_expt11_online_fillable_pdf (1).pdf"
## [182] "c0110_expt11_online_fillable_pdf.pdf"
## [183] "c0110_expt2_intro.pdf"
## [184] "c0110_expt2_online (1) (1).pdf"
## [185] "c0110_expt2_online (1) (2).pdf"
## [186] "c0110_expt2_online (1).pdf"
## [187] "c0110_expt2_online.pdf"
## [188] "c0110_expt3_intro.pdf"
## [189] "c0110_expt3_online (1).pdf"
## [190] "c0110_expt3_online (2).pdf"
## [191] "c0110_expt3_online (3).pdf"
## [192] "c0110_expt3_online.pdf"
## [193] "c0110_expt4_intro (1).pdf"
## [194] "c0110_expt4_intro (2).pdf"
## [195] "c0110_expt4_intro (3).pdf"
## [196] "c0110_expt4_intro.pdf"
## [197] "c0110_expt4_online (1).pdf"
## [198] "c0110_expt4_online.pdf"
## [199] "c0110_expt5_intro (1).pdf"
## [200] "c0110_expt5_intro.pdf"
## [201] "c0110_expt5_online_fillable (1).pdf"
## [202] "c0110_expt5_online_fillable.pdf"
## [203] "c0110_expt6_intro (1).pdf"
## [204] "c0110_expt6_intro.pdf"
## [205] "c0110_expt6_online_fillable (1).pdf"
## [206] "c0110_expt6_online_fillable (2) (1).pdf"
## [207] "c0110_expt6_online_fillable (2).pdf"
## [208] "c0110_expt6_online_fillable.pdf"
## [209] "c0110_expt7_online_fillable_pdf.pdf"
## [210] "c0110_expt7Iintro.pdf"
## [211] "c0110_expt8_online_fillable_pdf.pdf"
## [212] "c0110_expt9_online_fillable_pdf.pdf"
## [213] "c0120_expt1.pdf"
## [214] "c0120_expt10_online_fillable.pdf"
## [215] "c0120_expt2_chromatograms.pdf"
## [216] "c0120_expt2_intro.pdf"
## [217] "c0120_expt3_online_fillable.pdf"
## [218] "c0120_expt8_online_fillable.pdf"
## [219] "c0120_expt9_online_fillable.pdf"
## [220] "CAC.png"
## [221] "Calendar.pdf"
## [222] "Campbell pp 19-31.pdf"
## [223] "Campbell pp 210-211.pdf"
## [224] "Carson and Root 2000 copy.pdf"
## [225] "Carson et al. post release APIS2008(2) (1).pdf"
## [226] "Carson et al. post release APIS2008(2) (2).pdf"
## [227] "Carson et al. post release APIS2008(2).pdf"
## [228] "center_function (1).R"
## [229] "center_function.R"
## [230] "Ch 13 Study Guide.pdf"
## [231] "Ch 14 Study Guide.pdf"
## [232] "Ch 7 lecture 1.pdf"
## [233] "Ch4 Getting wasted .doc"
## [234] "Ch5 getting wasted.doc"
## [235] "Ch6 getting wasted.doc"
## [236] "Chan paper (1).pdf"
## [237] "Chan paper (2).pdf"
## [238] "Chan paper (3).pdf"
## [239] "Chan paper (4).pdf"
## [240] "Chan paper (5).pdf"
## [241] "Chan paper (6).pdf"
## [242] "Chan paper.pdf"
## [243] "Chapter 21 Study Guide.pdf"
## [244] "Chapter 22 Study Guide.pdf"
## [245] "Chapter 55 study guide.pdf"
## [246] "ChartReviewInformation_Clinica (1).pdf"
## [247] "ChartReviewInformation_Clinica (2).pdf"
## [248] "ChartReviewInformation_Clinica.pdf"
## [249] "Chem 0310_Fall 2021_Chapter 3.pdf"
## [250] "Chem 0310_Feel Good Exam_Fall 2021.docx"
## [251] "Chem 0310_Study guide_FInal exam_Fall 2021.pdf"
## [252] "Chem 0310_Take Home Exam_Fall 2021 (1).pdf"
## [253] "Chem 0310_Take Home Exam_Fall 2021.pdf"
## [254] "Chem 2 Lab 1 (1).pdf"
## [255] "Chem 2 Lab 1 (2).pdf"
## [256] "Chem 2 Lab 1.pdf"
## [257] "Chem 2 Lab 2.pdf"
## [258] "Chem+0310_Feel+Good+Exam_Fall+2021.pdf"
## [259] "Chem+0310_Take+Home+Exam_Fall+2021-1.pdf"
## [260] "Chem+0310_Take+Home+Exam_Fall+2021.pdf"
## [261] "CHEM0310_Problem Set 2_Fall 2021.pdf"
## [262] "CHEM0310_Problem Set 3_Fall 2021 (1).pdf"
## [263] "CHEM0310_Problem Set 3_Fall 2021.pdf"
## [264] "CHEM0310_Problem Set 7_Fall 2021.pdf"
## [265] "CHEM0310_Problem Set 9_Fall 2021.pdf"
## [266] "Chemically Active Extraction (Slides).pdf"
## [267] "Chemistry2e textbook.pdf"
## [268] "CLARK - PHYS 0212-0219 - Lecture 01 - Changing Motion.pdf"
## [269] "ClinicalOutcomesFollowingRotat_2020-10-16_1156.pdf"
## [270] "ClinicalOutcomesFollowingRotat_2020-10-28_1331.pdf"
## [271] "ClinicalOutcomesFollowingRotat_2020-10-28_1343.pdf"
## [272] "ClinicalOutcomesFollowingRotat_2020-10-28_1344.pdf"
## [273] "ClinicalOutcomesFollowingRotat_2020-10-28_1348.pdf"
## [274] "ClinicalOutcomesFollowingRotat_2020-10-30_1842.pdf"
## [275] "ClinicalOutcomesFollowingRotat_2020-11-30_1230.pdf"
## [276] "ClinicalOutcomesFollowingRotat_2020-12-02_1503.pdf"
## [277] "ClinicalOutcomesFollowingRotat_DataDictionary_2020-10-16.csv"
## [278] "ClinicalOutcomesFollowingRotat_DataDictionary_2020-10-28 (1).csv"
## [279] "ClinicalOutcomesFollowingRotat_DataDictionary_2020-10-28.csv"
## [280] "ClinicalOutcomesFollowingRotat_DataDictionary_2020-10-30 (1).csv"
## [281] "ClinicalOutcomesFollowingRotat_DataDictionary_2020-10-30.csv"
## [282] "cluster_analysis_portfolio (1).Rmd"
## [283] "cluster_analysis_portfolio.Rmd"
## [284] "cluster_analysis_with_Higgs_aa (1).R"
## [285] "cluster_analysis_with_Higgs_aa.R"
## [286] "cobi.12738 (1).pdf"
## [287] "cobi.12738.pdf"
## [288] "code_checkpoint_vcfR.html"
## [289] "code_checkpoint_vcfR.Rmd"
## [290] "CODE_CHECKPOINT-first_rstudio_script (1).R"
## [291] "CODE_CHECKPOINT-first_rstudio_script (2).R"
## [292] "CODE_CHECKPOINT-first_rstudio_script.R"
## [293] "codon chart (1).png"
## [294] "codon chart.png"
## [295] "Collab_Writing_PEER_REVIEW_WORKSHEET.doc.pdf"
## [296] "CollegeVsHSWriting.pdf"
## [297] "Community slides.pptx"
## [298] "Comp bio shit "
## [299] "Competition-I&II 2014.pptx"
## [300] "Coogan Genesis-1.pdf"
## [301] "Copy of Chapter 13 Textbook Notes.pdf"
## [302] "Corrigan--Writing-About-Film Ch 1 (1).pdf"
## [303] "Corrigan--Writing-About-Film Ch 1 (2).pdf"
## [304] "Corrigan--Writing-About-Film Ch 1.pdf"
## [305] "Corrigan--Writing-About-Film Ch 2.pdf"
## [306] "corriganchapter3-1 (1).pdf"
## [307] "corriganchapter3-1 (2).pdf"
## [308] "corriganchapter3-1 (3).pdf"
## [309] "corriganchapter3-1 (4).pdf"
## [310] "corriganchapter3-1.pdf"
## [311] "Cover Letter Revision Seminar in comp film.docx"
## [312] "Cover Letter Revision Seminar in comp film.pdf"
## [313] "Covid test conformation.pdf"
## [314] "Criminal Justice Intro(1).docx"
## [315] "Cultural_anthropology.pdf"
## [316] "Daphnia thing.xlsx"
## [317] "Daphnia.pptx"
## [318] "data_structures-DFs (1).pdf"
## [319] "data_structures-DFs (2).pdf"
## [320] "data_structures-DFs (3).pdf"
## [321] "data_structures-DFs.pdf"
## [322] "dawkins_science discredits religion.pdf"
## [323] "DBF61B98-02E0-46E1-AF26-E65EF06BC312.jpg"
## [324] "Dennis_social_darwinism_scientific_racism.pdf"
## [325] "Development Theories Primer (1) (1).doc"
## [326] "Development Theories Primer (1).doc"
## [327] "Devils Gardens.pdf"
## [328] "Diagloues .pdf"
## [329] "Diet (1).mpx"
## [330] "Diet.mpx"
## [331] "Discord (1).dmg"
## [332] "Done Teme.pdf"
## [333] "Donlan et al. 2005(4).pdf"
## [334] "download.pdf"
## [335] "eckl,+JNS_23_Amupanda (1).pdf"
## [336] "eckl,+JNS_23_Amupanda.pdf"
## [337] "Ecology - The Economy of Nature 8e [Rick Relyea].pdf"
## [338] "Ecology note card.pdf"
## [339] "Enns evolution of adam ch. 3-1.pdf"
## [340] "ENUMA ELISH.pdf"
## [341] "environmental racism.pdf"
## [342] "Estes_etal_2011 (1).pdf"
## [343] "Evolution and Adaptation, Life Histories-Carson-6.pptx"
## [344] "Exam 1_ Krishna Patel_files"
## [345] "Exam 1_ Krishna Patel.htm"
## [346] "Exam 1_ Krishna Patel.mhtml"
## [347] "Exam 3note cars.pdf"
## [348] "Exam 4 Review Sheet -1.pdf"
## [349] "Exam 4 Review Sheet .pdf"
## [350] "Exam I Practice Questions.pdf"
## [351] "Exam+Ch.+1-6.pdf"
## [352] "Exam+Ch.+7-11.pdf"
## [353] "Exam3_2204.pdf"
## [354] "Exam3am_2191.pdf"
## [355] "Example Exam 3 Questions.pdf"
## [356] "Example Exam Questions (1).pdf"
## [357] "Example Exam Questions (2).pdf"
## [358] "Example Exam Questions.pdf"
## [359] "Exp 1 - GC2.pdf"
## [360] "Experiment 10 Chem 1 Lab.pdf"
## [361] "Experiment 12.pdf"
## [362] "Expermient 11.pdf"
## [363] "Fantasy Brewer pg 193 (1).pdf"
## [364] "Fantasy Brewer pg 193 (2).pdf"
## [365] "Fantasy Brewer pg 193.pdf"
## [366] "feature_engineering (1).Rmd"
## [367] "feature_engineering_intro_2_functions-part2.Rmd"
## [368] "feature_engineering.Rmd"
## [369] "feb28.pdf"
## [370] "fiji-macosx.zip"
## [371] "file (1).pdf"
## [372] "file.pdf"
## [373] "Film Terms Sheets.pdf"
## [374] "final exam review burlew.pptx"
## [375] "Final_Exam_Formula_Sheet (1).pdf"
## [376] "Final_Exam_Formula_Sheet (2).pdf"
## [377] "Final_Exam_Formula_Sheet.pdf"
## [378] "Flaming Snowball - Gabrielle & Lisa (1).doc"
## [379] "Flaming Snowball - Gabrielle & Lisa.doc"
## [380] "Flex Pitt Provisions.pdf"
## [381] "Flow Chart.pdf"
## [382] "Forest Dynamics talk (1).pptx"
## [383] "Forest Dynamics talk (2).pptx"
## [384] "Forest Dynamics talk .pptx"
## [385] "free jazz.pdf"
## [386] "Freshman year Pitt"
## [387] "Freud Dreams pp 34-44 (1).pdf"
## [388] "Freud Dreams pp 34-44 (2).pdf"
## [389] "Freud Dreams pp 34-44.pdf"
## [390] "Freud Uncanny (1).pdf"
## [391] "Freud Uncanny.pdf"
## [392] "Furhter direction sldie.pptx"
## [393] "Fusion.pdf"
## [394] "Galileo Galilei_Letter to Grand Duchess.pdf"
## [395] "Galileo Goes to Jail_ And Other Myths about Science and Religion.pdf"
## [396] "Galileo_Starry Messenger.pdf"
## [397] "Garfield Farms Essay.docx"
## [398] "GCB lecture 11-18-2021 (1).pptx"
## [399] "GCB lecture 11-18-2021 (2).pptx"
## [400] "GCB lecture 11-18-2021 (3).pptx"
## [401] "GCB lecture 11-18-2021.pptx"
## [402] "Genesis Rabbah.pdf"
## [403] "Getting Wasted Soc Ch.2-3.doc"
## [404] "Getting Wasted Soc Ch1.doc"
## [405] "Giberson Chap 3 (1).pdf"
## [406] "Giberson Chap 3.pdf"
## [407] "glassware_specs.pdf"
## [408] "Global Citizenship.png"
## [409] "Goldberg_Genetics_7e_CH09_SMSG_pdf (1).pdf"
## [410] "Goldberg_Genetics_7e_CH09_SMSG_pdf.pdf"
## [411] "Goldberg_Genetics_7e_CH12_SMSG_pdf.pdf"
## [412] "Gould.Two Separate Domains.pdf"
## [413] "Group Assignment (1).pdf"
## [414] "Group Assignment.pdf"
## [415] "guest lecture seed banks (1).pptx"
## [416] "guest lecture seed banks.pptx"
## [417] "HAA 0010 FINAL PROJECT COMPARATIVE ANALYSIS RUBRIC (1).docx"
## [418] "HAA 0010 FINAL PROJECT COMPARATIVE ANALYSIS RUBRIC.docx"
## [419] "Handout #1_ What is Art_"
## [420] "Handout 1 (Accompanying Slides).pdf"
## [421] "Handout 10 (Accompanying Slides) (1).pdf"
## [422] "Handout 10 (Accompanying Slides).pdf"
## [423] "Handout 11 (Accompanying Slides) (1).pdf"
## [424] "Handout 11 (Accompanying Slides).pdf"
## [425] "Handout 12 (Accompanying Slides).pdf"
## [426] "Handout 13 (Accompanying Slides).pdf"
## [427] "Handout 14 (Accompanying Slides) (1).pdf"
## [428] "Handout 14 (Accompanying Slides) (2).pdf"
## [429] "Handout 14 (Accompanying Slides).pdf"
## [430] "Handout 15 (Accompanying Slides) (1).pdf"
## [431] "Handout 15 (Accompanying Slides) (2).pdf"
## [432] "Handout 15 (Accompanying Slides).pdf"
## [433] "Handout 16 (Accompanying Slides).pdf"
## [434] "Handout 17 (Accompanying Slides).pdf"
## [435] "Handout 18 (Accompanying Slides).pdf"
## [436] "Handout 19 (Accompanying Slides).pdf"
## [437] "Handout 2(Accompanying Slides).pdf.pdf"
## [438] "Handout 20 (Accompanying Slides).pdf"
## [439] "Handout 3 (Accompanying Slides).pdf"
## [440] "Handout 4 (Accompanying Sldes).pdf"
## [441] "Handout 5 (Accompanying Slides) (1).pdf"
## [442] "Handout 5 (Accompanying Slides).pdf"
## [443] "Handout 6 (Accompanying Slides).pdf"
## [444] "Handout 7 (Accompanying Slides) (1).pdf"
## [445] "Handout 7 (Accompanying Slides) (2).pdf"
## [446] "Handout 7 (Accompanying Slides) (3).pdf"
## [447] "Handout 7 (Accompanying Slides).pdf"
## [448] "Handout 8 (Accompanying Slides) (1).pdf"
## [449] "Handout 8 (Accompanying Slides).pdf"
## [450] "Handout 9 (Accompanying Slides) (1).pdf"
## [451] "Handout 9 (Accompanying Slides).pdf"
## [452] "Hatfull Spencer paper (1).pdf"
## [453] "Hatfull Spencer paper.pdf"
## [454] "Hegazy, Habiel, and Fawzy. 2009 (1).pdf"
## [455] "Hegazy, Habiel, and Fawzy. 2009 (2).pdf"
## [456] "Hegazy, Habiel, and Fawzy. 2009.pdf"
## [457] "HighSchool resume (1).docx"
## [458] "HighSchool resume.docx"
## [459] "Hirsch_etal_2014 (1).pdf"
## [460] "Hirsch_etal_2014 (2).pdf"
## [461] "Hirsch_etal_2014.pdf"
## [462] "Hist.docx"
## [463] "Hist.pdf"
## [464] "HOJ F22 Week 11 hard bop.pdf"
## [465] "HOJ F22 Week 12 free jazz.pdf"
## [466] "HOJ F22 Week 12 racial politics_free jazz 1.pdf"
## [467] "HOJ F22 Week 14 fusion 1.pdf"
## [468] "HOJ F22 Week 14 fusion 2.pdf"
## [469] "HOJ F22 Week 4 Blues Legacies 1.pdf"
## [470] "HOJ F22 Week 4 New Orleans 1.pdf"
## [471] "HOJ F22 Week 5 Chicago.pdf"
## [472] "HOJ F22 Week 6 Kansas City.pdf"
## [473] "Homework 1 Chapter 1,2,3 (1).pdf"
## [474] "Homework 1 Chapter 1,2,3 (2) (1).pdf"
## [475] "Homework 1 Chapter 1,2,3 (2) (2).pdf"
## [476] "Homework 1 Chapter 1,2,3 (2).pdf"
## [477] "Homework 1 Chapter 1,2,3.pdf"
## [478] "Homework week 8 bio lab.pptx"
## [479] "How to do Mendel type problems (1).pptx"
## [480] "How to do Mendel type problems.pptx"
## [481] "HW-4-5 (1).pdf"
## [482] "HW-4-5.pdf"
## [483] "HW-6.pdf"
## [484] "HW-7-8.pdf"
## [485] "HW+11.pdf"
## [486] "HW+16.pdf"
## [487] "hw16_1_Q3.jpg"
## [488] "id.pdf"
## [489] "IMG_0115.PNG"
## [490] "IMG_0135.PNG"
## [491] "IMG_0237.PNG"
## [492] "IMG_0289 2.HEIC"
## [493] "IMG_0289.HEIC"
## [494] "IMG_0342.HEIC"
## [495] "IMG_0452.HEIC"
## [496] "IMG_0488.JPG"
## [497] "IMG_0762.mov"
## [498] "IMG_0766.mov"
## [499] "IMG_0767.mov"
## [500] "IMG_0769.mov"
## [501] "IMG_0771.mov"
## [502] "IMG_0D9ABA74D56F-1.heic"
## [503] "IMG_2293.HEIC"
## [504] "IMG_2380 copy.jpg"
## [505] "IMG_2380.HEIC"
## [506] "IMG_2393 copy.jpg"
## [507] "IMG_2583.HEIC"
## [508] "IMG_2583.png"
## [509] "IMG_2C431C072954-1.jpeg"
## [510] "IMG_2C431C072954-1.png"
## [511] "IMG_3455.heic"
## [512] "IMG_4440 copy.jpg"
## [513] "IMG_5436 copy.pdf"
## [514] "IMG_6142.heic"
## [515] "IMG_7509 2.HEIC"
## [516] "IMG_8511.heic"
## [517] "IMG_AB9A219EFA7A-1.jpeg"
## [518] "IMG_AB9A219EFA7A-1.png"
## [519] "IMG_B68DD95C6828-1.jpeg"
## [520] "IMG_FDB33BB32CC7-1.jpeg"
## [521] "IMG_FDB33BB32CC7-1.png"
## [522] "individual_notebook_population_spreadsheet_sp21 (1).xlsx"
## [523] "individual_notebook_population_spreadsheet_sp21.xlsx"
## [524] "Install Spotify.app"
## [525] "InstallBackupAndSync.dmg"
## [526] "InstallLDBPackage64c-2-0-8-01.zip"
## [527] "Intro to Africa Final.pdf"
## [528] "Intro to Africa Unit 2 notes.docx"
## [529] "Introduction slides-1-1 (1).pptx"
## [530] "Introduction slides-1-1 (2).pptx"
## [531] "Introduction slides-1-1.pptx"
## [532] "Introduction slides-1.pptx"
## [533] "INTRODUCTION TO AFRICA - FILMS {SAHARA, SAVANNA, FOREST} (1).docx"
## [534] "INTRODUCTION TO AFRICA - FILMS {SAHARA, SAVANNA, FOREST}.docx"
## [535] "INTRODUCTION TO AFRICA - TEST GUIDELINES - 2 (1).docx"
## [536] "INTRODUCTION TO AFRICA - TEST GUIDELINES - 2-1.docx"
## [537] "INTRODUCTION TO AFRICA - TEST GUIDELINES - 2.docx"
## [538] "INTRODUCTION TO AFRICA - TEST GUIDELINES (1).docx"
## [539] "INTRODUCTION TO AFRICA - TEST GUIDELINES.docx"
## [540] "INTRODUCTION TO AFRICA -- FINAL PAPER OUTLINES (1).docx"
## [541] "INTRODUCTION TO AFRICA -- FINAL PAPER OUTLINES (2).docx"
## [542] "INTRODUCTION TO AFRICA -- FINAL PAPER OUTLINES.docx"
## [543] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2 (1).docx"
## [544] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-0a8505b7-1bc9-4f6d-99f9-e13dc3ef2ee6.docx"
## [545] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-11df78e7-773b-4443-96ba-d7a319003b76.docx"
## [546] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-2.docx"
## [547] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-3.docx"
## [548] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-5 (1).docx"
## [549] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-5 (2).docx"
## [550] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-5.docx"
## [551] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-6 (1).docx"
## [552] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-6.docx"
## [553] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-7.docx"
## [554] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2-ed609f97-4368-4e02-94c0-4972369c04e8.docx"
## [555] "INTRODUCTION TO AFRICA -- LECTURE NOTES - 2.docx"
## [556] "INTRODUCTION TO AFRICA.docx"
## [557] "Invasive Species-Carson-Final-2 (1).ppt"
## [558] "Invasive Species-Carson-Final-2 (2).ppt"
## [559] "Invasive Species-Carson-Final-2 (3).ppt"
## [560] "Invasive Species-Carson-Final-2.ppt"
## [561] "IWA PART 1 MIDTERM ESSAY (1).docx"
## [562] "IWA PART 1 MIDTERM ESSAY (2).docx"
## [563] "IWA PART 1 MIDTERM ESSAY.docx"
## [564] "IWA PART 2 ART in Quarantine (1).docx"
## [565] "IWA PART 2 ART in Quarantine.docx"
## [566] "Jade.Axe"
## [567] "JC #1 - Hegazy, Kabiel, and Fawzy (2009) (1).pptx"
## [568] "JC #1 - Hegazy, Kabiel, and Fawzy (2009).pptx"
## [569] "JC #2 - Paolacci, Jansen, and Harrison (2018).pptx"
## [570] "Jeffrey H. Reiman_ Paul Leighton - The Rich Get Richer and the Poor Get Prison_ Ideology, Class, and Criminal Justice (2016, Routledge) - libgen.lc 2.pdf"
## [571] "John Creasy_HandMade with Soil (1).pdf"
## [572] "John Creasy_HandMade with Soil.pdf"
## [573] "John Esposito Chemistry 0120 Recitation and Office Hours Links (1).pdf"
## [574] "John Esposito Chemistry 0120 Recitation and Office Hours Links.pdf"
## [575] "Jounral Bio 1 thing (1).pdf"
## [576] "Jounral Bio 1 thing (2).pdf"
## [577] "Jounral Bio 1 thing.pdf"
## [578] "Jounral CLub #2.pdf"
## [579] "Jounral CLub #2.pptx"
## [580] "JUNIOR DRIVER'S LICENSE.pdf"
## [581] "Key to practice questions.pdf"
## [582] "KeyExam3MW.doc"
## [583] "Khafre.Qin.APAP.reading.pdf"
## [584] "KindleForMac-1.30.59055.dmg"
## [585] "Krishna Final Revision.pdf"
## [586] "Krishna Paper 1 (1).pdf"
## [587] "Krishna Paper 1 (2).pdf"
## [588] "Krishna Paper 1.pdf"
## [589] "Krishna Paper Three (1).pdf"
## [590] "Krishna Paper Three (2).pdf"
## [591] "Krishna Paper Three (3).pdf"
## [592] "Krishna Paper Three.pdf"
## [593] "Krishna Paper Two (1).pdf"
## [594] "Krishna Paper Two (2).pdf"
## [595] "Krishna Paper Two (3).pdf"
## [596] "Krishna Paper Two Revision.pdf"
## [597] "Krishna Paper Two.pdf"
## [598] "KrishnaPatel_Electronic Fine Slip.pdf"
## [599] "Kurten&Carson_2015_BioSci(4) copy (1).pdf"
## [600] "Kurten&Carson_2015_BioSci(4) copy.pdf"
## [601] "L1.3 Biomolecules Overview Slides and Study Guide.pdf"
## [602] "Lab 01 Report - Changing Motion.docx"
## [603] "Lab 03 Report - Combining Forces.docx"
## [604] "Lab 05 Report - Two Dimensional Motion (Projectile Motion) (1).docx"
## [605] "Lab 05 Report - Two Dimensional Motion (Projectile Motion).docx"
## [606] "Lab 06 Report - Conservation of Energy.docx"
## [607] "Lab 07 Report - Voltage in Simple DC Circuits and Ohms Law.docx"
## [608] "Lab 1 (1).pdf"
## [609] "Lab 1 .pdf"
## [610] "Lab 10 .pdf"
## [611] "Lab 10 Prelab.pdf"
## [612] "lab 11.pdf"
## [613] "Lab 4 (1).pdf"
## [614] "Lab 4 gen cehm 1 (1).pdf"
## [615] "Lab 4 gen cehm 1.pdf"
## [616] "Lab 4.pdf"
## [617] "Lab 5 pt.2.pdf"
## [618] "Lab 5 .pdf"
## [619] "lab 5'.pdf"
## [620] "Lab 6 pl.pdf"
## [621] "Lab 6.pdf"
## [622] "Lab 8 .pdf"
## [623] "Lab 8 Prelan .pdf"
## [624] "Lab 8 try 2.pdf"
## [625] "Lab 9 pre lab .pdf"
## [626] "lab 9.pdf"
## [627] "Lab Meeting 2 Daphnia Excel sheet 3.25.21 (1).xlsx"
## [628] "Lab Meeting 2 Daphnia Excel sheet 3.25.21.xlsx"
## [629] "Lab Meeting_Community Experiment.docx"
## [630] "labsyllabus_0120_2214.pdf"
## [631] "least_squares_intro-2.pdf"
## [632] "Lecture 1.2- Water and Bonding Powerpoint and Study Guide.pdf"
## [633] "Lecture 1.4- Carbohydrates Powerpoint and Study Guide.pdf"
## [634] "Lecture 1.5- Lipids Powerpoint and Study Guide.pdf"
## [635] "Lecture 1.7- RNA Structure and Function Powerpoint and Study Guide.pdf"
## [636] "Lecture 1.9- Translation Powerpoint and Study Guide.pdf"
## [637] "Lecture 7 Practice Problem (Solutions)_20200905_0001.pdf"
## [638] "Lecture B Beyond Mendel Fall 2022 no signaling epistasis no pedigree.pptx"
## [639] "Lecture B Beyond Mendel Fall 2022 pre no signaling epistasis no pedigree.pdf"
## [640] "Lecture C Chromosome theory 2022 for Quiz 1.pptx"
## [641] "Lecture F Mutations Fall 2022.pptx"
## [642] "Lecture G1 Genetic code 2022 2.pptx"
## [643] "Lecture G1 Genetic code 2022 pre.pdf"
## [644] "Lecture G2 Trancription 2022 pre.pdf"
## [645] "Lecture G3 Translation 2022 pre.pdf"
## [646] "Lecture G4 Mutations 2 2022 pre (1).pdf"
## [647] "Lecture G4 Mutations 2 2022 pre (2).pdf"
## [648] "Lecture G4 Mutations 2 2022 pre (3).pdf"
## [649] "Lecture G4 Mutations 2 2022 pre.pdf"
## [650] "Lecture G4 Mutations 2 2022.pptx"
## [651] "Lecture H Molecular Genetic Techniques 2022 pre (1).pdf"
## [652] "Lecture H Molecular Genetic Techniques 2022 pre (2).pdf"
## [653] "Lecture H Molecular Genetic Techniques 2022 pre (3).pdf"
## [654] "Lecture H Molecular Genetic Techniques 2022 pre (4).pdf"
## [655] "Lecture H Molecular Genetic Techniques 2022 pre.pdf"
## [656] "Lecture I Molecular Analysis of Genomes 2022 (1).pptx"
## [657] "Lecture I Molecular Analysis of Genomes 2022 pre.pdf"
## [658] "Lecture I Molecular Analysis of Genomes 2022.pptx"
## [659] "Lecture J Chromosomes 2022 2.pptx"
## [660] "Lecture J Chromosomes 2022 pre.pdf"
## [661] "Lecture K Chromosome abnormalities 2022 pre.pdf"
## [662] "Lecture K Chromosome abnormalities 2022.pptx"
## [663] "Lecture L Bacterial genetics 2022 pre.pdf"
## [664] "Lecture L,M,N Bacterial genetics 2022.pptx"
## [665] "Lecture Predation and Herbivory Part II-1-9 (1).ppt"
## [666] "Lecture Predation and Herbivory Part II-1-9.ppt"
## [667] "lecture-introd2RStudio-with_scripts.pdf"
## [668] "Lecture02 (1).pdf"
## [669] "Lecture02 (2).pdf"
## [670] "Lecture02.pdf"
## [671] "Lecture04.pdf"
## [672] "Lecture05.pdf"
## [673] "Lecture06 (1).pdf"
## [674] "Lecture06.pdf"
## [675] "Lecture08.pdf"
## [676] "Letter of Interst Dr. Kontos.docx"
## [677] "Letter to Class of 2025 - Google Docs.pdf"
## [678] "Lib assignment .pdf"
## [679] "Libraynth (1).pdf"
## [680] "Libraynth (2).pdf"
## [681] "Libraynth (3).pdf"
## [682] "Libraynth .pdf"
## [683] "line_of_best_fit_example-tibet_allele_freq (1).pdf"
## [684] "line_of_best_fit_example-tibet_allele_freq.pdf"
## [685] "Love on the Sahel essay .pdf"
## [686] "M Gene reg bacteria 2022 pre.pdf"
## [687] "M Gene reg bacteria Fall 2021 pre.pdf"
## [688] "M Gene reg bacteria Fall 2022.pptx"
## [689] "M3.Khafre.Qin (1).pdf"
## [690] "M3.Khafre.Qin.pdf"
## [691] "Mapping the Global Muslim Population.pdf"
## [692] "meeting-91999866091.ics"
## [693] "meeting-93219672601 (1).ics"
## [694] "meeting-93219672601 (2).ics"
## [695] "meeting-93219672601 (3).ics"
## [696] "meeting-93219672601.ics"
## [697] "Meiners and Pickett 2011(3).pdf"
## [698] "Melies Brewer pp 370-371 (1).pdf"
## [699] "Melies Brewer pp 370-371 (2).pdf"
## [700] "Melies Brewer pp 370-371.pdf"
## [701] "Micro_NEW Major_Tracker 2.pdf"
## [702] "Micro_NEW Major_Tracker.pdf"
## [703] "Microsoft Word - Sharpley Tourism Notes iMac.docx"
## [704] "Microsoft Word - Sharpley Tourism Notes iMac.docx.pdf"
## [705] "Mid term Art 1 (1).docx"
## [706] "Mid term Art 1.docx"
## [707] "Mid term Pics (1).docx"
## [708] "Mid term Pics (2).docx"
## [709] "Mid term Pics.docx"
## [710] "midterm exam review - Anneliese.pdf"
## [711] "Midterm Review Material (1).pdf"
## [712] "Midterm Review Material (2).pdf"
## [713] "Midterm Review Material (3).pdf"
## [714] "Midterm Review Material Key (1).pdf"
## [715] "Midterm Review Material Key.pdf"
## [716] "Midterm Review Material.pdf"
## [717] "Midterm review sheet fall 2022.docx"
## [718] "MIles and Henry_Gender.pdf"
## [719] "Muslim_Astronomers_who_Influenced_Copern.pdf"
## [720] "Mutualism Lecture-Final-2-1-2 (1).pptx"
## [721] "Mutualism Lecture-Final-2-1-2.pptx"
## [722] "my_SNP"
## [723] "N Gene reg eukaryotes 2022 for quiz 6.pptx"
## [724] "N Gene reg eukaryotes 2022 pre.pdf"
## [725] "N Gene reg eukaryotes Fall 2021 for Quiz 7.pptx"
## [726] "Nancy Pearcey-1 (1).pdf"
## [727] "Nancy Pearcey-1.pdf"
## [728] "Nasser Abufarha_ Neil L. Whitehead_ Jo Ellen Fair_ Leigh A.Payne - The Making of a Human Bomb_ An Ethnography of Palestinian Resistance-Duke University Press Books (2009).pdf"
## [729] "NMR1-3_ChemicalShift_V2-Slides (1).pdf"
## [730] "NMR1-3_ChemicalShift_V2-Slides.pdf"
## [731] "NMR2-2_Solvents_V2-Slides (1).pdf"
## [732] "NMR2-2_Solvents_V2-Slides.pdf"
## [733] "NMR2-3_ComplexSplitting_V2-Slides.pdf"
## [734] "NMR2-4_LongRangeCoupling_V2-Slides.pdf"
## [735] "Note card 2 - Google Docs.pdf"
## [736] "Note card 2 pt2 (1).pdf"
## [737] "Note card 2 pt2 (2).pdf"
## [738] "Note card 2 pt2.odt"
## [739] "Note card 2 pt2.pdf"
## [740] "Note Feb 10, 2022 2.pdf"
## [741] "Note Feb 10, 2022 3.pdf"
## [742] "Note Feb 10, 2022 4.pdf"
## [743] "Note Feb 10, 2022.pdf"
## [744] "Note Feb 17, 2022.pdf"
## [745] "Note Feb 24, 2022.pdf"
## [746] "Nuttle et al. 2013 (1).pdf"
## [747] "Nuttle et al. 2013 (2).pdf"
## [748] "Nuttle et al. 2013 (3).pdf"
## [749] "Nuttle et al. 2013.pdf"
## [750] "O Manipulating eukaryotic genomes 2022 pre.pdf"
## [751] "O Manipulating eukaryotic genomes 2022.pptx"
## [752] "Organic Chemistry Structure and Function by K. Peter C. Vollhardt (z-lib.org).pdf"
## [753] "Packet1PERadiologyMonitoringFo.pdf"
## [754] "paley essay (1).pdf"
## [755] "paley essay (2).pdf"
## [756] "paley essay.pdf"
## [757] "Pasztory_ Thinking Things.pdf"
## [758] "Patel_Dark_Crystal.mp4"
## [759] "PCA-missing_data.Rmd"
## [760] "periodic-table.pdf"
## [761] "PeriodicTableMuted-56a12d823df78cf772682aaa.png"
## [762] "Permaculture handout (1).pdf"
## [763] "Permaculture handout (2).pdf"
## [764] "Permaculture handout.pdf"
## [765] "phage (1).fasta"
## [766] "phage (2).fasta"
## [767] "phage.fasta"
## [768] "Pham16216Report.pdf"
## [769] "Pham49320Report.pdf"
## [770] "Philo ON CREATION.pdf"
## [771] "PHYS 0111 Chapter 13 Fall 2022 (1).pdf"
## [772] "PHYS 0111 Chapter 13 Fall 2022 (2).pdf"
## [773] "PHYS 0111 Chapter 13 Fall 2022.pdf"
## [774] "PHYS 0111 Chapter 14 Fall 2022 (1).pdf"
## [775] "PHYS 0111 Chapter 14 Fall 2022 (2).pdf"
## [776] "PHYS 0111 Chapter 14 Fall 2022.pdf"
## [777] "PHYS 0111 Chapter 15 Fall 2022 (1).pdf"
## [778] "PHYS 0111 Chapter 15 Fall 2022 (2).pdf"
## [779] "PHYS 0111 Chapter 15 Fall 2022 (3).pdf"
## [780] "PHYS 0111 Chapter 15 Fall 2022.pdf"
## [781] "PHYS 0111 Chapter 18 Fall 2022.pdf"
## [782] "PHYS 0111 Chapter 19 Fall 2022.pdf"
## [783] "PHYS 0111 Chapter 20 Fall 2022.pdf"
## [784] "PHYS 0111 Chapter 21 Fall 2022.pdf"
## [785] "PHYS 0111 Chapter 22 Fall 2022 (1).pdf"
## [786] "PHYS 0111 Chapter 22 Fall 2022.pdf"
## [787] "PHYS 0111 Chapter 23 Fall 2022.pdf"
## [788] "PHYS 0111 Chapter 24 Fall 2022 (1).pdf"
## [789] "PHYS 0111 Chapter 24 Fall 2022.pdf"
## [790] "PHYS 0111 Chapter 25 Fall 2022 (1).pdf"
## [791] "PHYS 0111 Chapter 25 Fall 2022.pdf"
## [792] "PHYS 0111 Chapter 27 Fall 2022 (1).pdf"
## [793] "PHYS 0111 Chapter 27 Fall 2022.pdf"
## [794] "PHYS 0111 Equation Sheet Fall 2022 (1).pdf"
## [795] "PHYS 0111 Equation Sheet Fall 2022 (2).pdf"
## [796] "PHYS 0111 Equation Sheet Fall 2022 (3).pdf"
## [797] "PHYS 0111 Equation Sheet Fall 2022 (4).pdf"
## [798] "PHYS 0111 Equation Sheet Fall 2022 (5).pdf"
## [799] "PHYS 0111 Equation Sheet Fall 2022.pdf"
## [800] "PHYS2201-notes-ch2.pdf"
## [801] "Picture1.png"
## [802] "Pitt Ecology-Nov-2021.pptx"
## [803] "Plagiarism – The Writing Center • University of North Carolina at Chapel Hill.pdf"
## [804] "Plant Love and Plant Detectives.pptx"
## [805] "Plue et al. 2019.13201-1.pdf"
## [806] "pollutant websearch.docx"
## [807] "pond_site_analysis_template_to_fill_sp21 (1).pptx"
## [808] "pond_site_analysis_template_to_fill_sp21.pptx"
## [809] "Population Genetics Invasion Lecture Nov. 15 (1).pdf"
## [810] "Population Genetics Invasion Lecture Nov. 15.pdf"
## [811] "Portfolio Working direct"
## [812] "portfolio_ggpubr_intro-2 (1).Rmd"
## [813] "portfolio_ggpubr_intro-2 (2).Rmd"
## [814] "portfolio_ggpubr_intro-2 (3).Rmd"
## [815] "portfolio_ggpubr_intro-2 (4) (Autosaved).rmd"
## [816] "portfolio_ggpubr_intro-2 (4).Rmd"
## [817] "portfolio_ggpubr_intro-2 (5).Rmd"
## [818] "portfolio_ggpubr_intro-2 (6).Rmd"
## [819] "portfolio_ggpubr_intro-2.Rmd"
## [820] "portfolio_ggpubr_log_transformation.Rmd"
## [821] "portfolio-01-dataframe.ipynb"
## [822] "Position (m) copy.jpg"
## [823] "Post lab 3.pdf"
## [824] "Post lab 4.pdf"
## [825] "Post lab 5.pdf"
## [826] "Post lab 6.pdf"
## [827] "Post lab 7,8.pdf"
## [828] "Post lab 9.pdf"
## [829] "Pre lab 7 .pdf"
## [830] "PRE-COLONIAL AFRICAN HISTORY AND MISCONCEPTION OF AFRICA (1).pdf"
## [831] "PRE-COLONIAL AFRICAN HISTORY AND MISCONCEPTION OF AFRICA (2).pdf"
## [832] "PRE-COLONIAL AFRICAN HISTORY AND MISCONCEPTION OF AFRICA (3).pdf"
## [833] "PRE-COLONIAL AFRICAN HISTORY AND MISCONCEPTION OF AFRICA.pdf"
## [834] "Prelab Activity 3 .pdf"
## [835] "Prelab for lab 4.pdf"
## [836] "Presentation planning DWS sp21_post.docx"
## [837] "Presentation1.pptx"
## [838] "Preziosi&Farago_ Art is Not What You Think It Is (1).pdf"
## [839] "Preziosi&Farago_ Art is Not What You Think It Is.pdf"
## [840] "Princess Bride pt.2 (1).docx"
## [841] "Princess Bride pt.2 (2).docx"
## [842] "Princess Bride pt.2 (2).ps"
## [843] "Princess Bride pt.2 (3).docx"
## [844] "Princess Bride pt.2.docx"
## [845] "Princess Bride pt.2.pdf"
## [846] "Problems 5 Spring 2022 updated.pdf"
## [847] "Propp Morphology pp 24-59 (1).pdf"
## [848] "Propp Morphology pp 24-59.pdf"
## [849] "ProtonVPN.dmg"
## [850] "psf_forest_community_dynamics.pptx"
## [851] "Q Genetics of Cancer 2022.pptx"
## [852] "Quiz+1 (2).pdf"
## [853] "Quiz+1.pdf"
## [854] "Quiz+2 (2).pdf"
## [855] "Quiz+2.pdf"
## [856] "Quiz+3 (1).pdf"
## [857] "Quiz+3 (2).pdf"
## [858] "Quiz+3.pdf"
## [859] "Quiz+4 (2).pdf"
## [860] "Quiz+6.pdf"
## [861] "Quiz+7.pdf"
## [862] "Quiz+8.pdf"
## [863] "quizlog1.pdf"
## [864] "R_data_structures_vectors_intro.pdf"
## [865] "R4.3.pptx"
## [866] "reading list 9-2019"
## [867] "reading list 9-2019.zip"
## [868] "Reading List Add ons 2021"
## [869] "Reading List Add ons 2021.zip"
## [870] "Recitation 1 2021 Spring 2022.docx"
## [871] "Recitation 1.3 Q1 Table.png"
## [872] "Recitation 13 (1).pptx"
## [873] "Recitation 13 (2).pptx"
## [874] "Recitation 13.pptx"
## [875] "Recitation 6 2022.docx"
## [876] "Recitation 7 Spring 2022.docx"
## [877] "Recitation 8 Spring 2022.docx"
## [878] "Recitation 9.pptx"
## [879] "Recitation week 4.pdf"
## [880] "Recitation_Week_12_copy.pdf"
## [881] "Recitation12.pptx"
## [882] "Recitiation 29th.pdf"
## [883] "Reflection #2 (1).pdf"
## [884] "Reflection #2 (2).pdf"
## [885] "Reflection #2 (3).pdf"
## [886] "Reflection #2.pdf"
## [887] "Reflections Religion and Science (1).pdf"
## [888] "Reflections Religion and Science (2).pdf"
## [889] "Reflections Religion and Science (3).pdf"
## [890] "Reflections Religion and Science (4).pdf"
## [891] "Reflections Religion and Science (5).pdf"
## [892] "Reflections Religion and Science (6).pdf"
## [893] "Reflections Religion and Science (7).pdf"
## [894] "Reflections Religion and Science.pdf"
## [895] "Reiman CH 2(1) (2).docx"
## [896] "Reiman CH 3(2) (1).docx"
## [897] "Reiman chapter 1.docx"
## [898] "Reiman Intro (2) (1).docx"
## [899] "Reiman Intro (2).docx"
## [900] "Relfection #3 (1).docx"
## [901] "Relfection #3.docx"
## [902] "Religon and science final .docx"
## [903] "Reliogn and science paper.docx"
## [904] "removing_fixed_alleles.html"
## [905] "removing_fixed_alleles.Rmd"
## [906] "ResponseSummary.pdf"
## [907] "Resume 2.docx"
## [908] "Resume.docx"
## [909] "Review Questions Exam 3-1.pdf"
## [910] "Review Questions Exam 4.pdf"
## [911] "Rewilding Discussion Questions-4.docx"
## [912] "Ripple&Beschta_2012_BioCons-1.pdf"
## [913] "Robert J. Debry Essay.pdf"
## [914] "Ronald L. Numbers - Galileo Goes to Jail and Other Myths about Science and Religion-Harvard University Press (2009).pdf"
## [915] "rsconnect"
## [916] "RStudio-2022.07.1-554.dmg"
## [917] "RStudio-2022.07.2-576 (1).dmg"
## [918] "RStudio-2022.07.2-576.dmg"
## [919] "salt in duckweed (1) (1).pptx"
## [920] "salt in duckweed (1).pptx"
## [921] "salt in duckweed.pptx"
## [922] "sampling distribution for mean.png"
## [923] "Savage.K.Monument.Wars.on.Lin (1).pdf"
## [924] "Savage.K.Monument.Wars.on.Lin.pdf"
## [925] "Savanna Hoemcoming .pdf"
## [926] "schedule.ics"
## [927] "Schedule.pdf"
## [928] "Schnitzer and Carson 2010 (1).pdf"
## [929] "Schnitzer and Carson 2010.pdf"
## [930] "Schnitzer and Carson Study Questions-1 (1).docx"
## [931] "Schnitzer and Carson Study Questions-1.docx"
## [932] "Scholarly Literature Search_Duckweed Sp21_3.7.2021.docx"
## [933] "SCIENCE AND RELIGION Syllabus Spring 2021-1.pdf"
## [934] "Science communication .docx"
## [935] "SCIGRESS_V3.3.3.dmg"
## [936] "SCIGRESS_V3.4.3 (1).dmg"
## [937] "SCIGRESS_V3.4.3.dmg"
## [938] "Screen Shot 2020-08-25 at 9.57.06 PM.png"
## [939] "Screen Shot 2022-09-04 at 6.40.11 PM.png"
## [940] "Screen Shot 2022-10-06 at 8.12.34 PM.png"
## [941] "Screen Shot 2022-10-11 at 9.58.45 PM copy (1).jpg"
## [942] "Screen Shot 2022-10-11 at 9.58.45 PM copy.jpg"
## [943] "Screen Shot 2022-10-17 at 3.16.22 PM (1).png"
## [944] "Screen Shot 2022-10-17 at 3.16.22 PM.png"
## [945] "Screen Shot 2022-10-19 at 3.37.01 PM (1).png"
## [946] "Screen Shot 2022-10-19 at 3.37.01 PM.png"
## [947] "Screen Shot 2022-10-31 at 3.25.36 PM.png"
## [948] "Screen Shot 2022-12-01 at 4.37.11 PM.jpg"
## [949] "Screen_Shot_2021-01-27_at_1.04.23_PM.png"
## [950] "Screen_Shot_2021-02-03_at_1.34.47_PM.png"
## [951] "Screen_Shot_2021-02-03_at_12.47.59_PM (1).png"
## [952] "Screen_Shot_2021-02-03_at_12.47.59_PM.png"
## [953] "Screen_Shot_2021-02-03_at_12.49.40_PM.png"
## [954] "Screen_Shot_2021-02-10_at_2.55.15_PM.png"
## [955] "Screen_Shot_2021-02-10_at_3.01.41_PM.png"
## [956] "Screen_Shot_2021-04-14_at_1.00.15_PM.png"
## [957] "Sharpley Tourism Notes iMac.pdf"
## [958] "ShoulderBraceStudy.pdf"
## [959] "Simmons - Odd Girl Out (1).pdf"
## [960] "Simmons - Odd Girl Out.pdf"
## [961] "Slavery and emancipation lecture Handout.pdf"
## [962] "Slavery and emancipation lecture PP.pptx"
## [963] "SOC 0005 Globalization Notes 2020.docx"
## [964] "SOC 0005- A Growing Trend of Leaving America.doc"
## [965] "Soc of Tourism - Approaches, Issues & Findings (Cohen).pdf"
## [966] "Societies 2211 (1).docx"
## [967] "Societies 2211.docx"
## [968] "Sociology of Tourism Cohen .docx"
## [969] "Sociology of Tourism Notes (3).docx"
## [970] "Sociology of Tourism Notes II (1).docx"
## [971] "Socities Exam 3notes.docx"
## [972] "Socitites CH4 Remian.doc"
## [973] "SolsticeClientMac_V2A0AE303CDPCF6CI8ECC0BD6.app"
## [974] "SolsticeClientMac_V2A0AE303CDPCF6CI8ECC0BD6.zip"
## [975] "Solution to Chapter 2 End Problems.pdf"
## [976] "Special Notes on Welfare(1).docx"
## [977] "SpotifyInstaller (1).zip"
## [978] "SpotifyInstaller.zip"
## [979] "Spring 2021 Tution poayment .pdf"
## [980] "SR Online Extra Credit Opportunities.docx"
## [981] "ST_1000__Practice_Midterm_2_S22.pdf"
## [982] "STAT_1000___HW_2_S_22.pdf"
## [983] "STAT_1000___S22___1085.pdf"
## [984] "STAT_1000__Extra_Credit_S22.pdf"
## [985] "STAT_1000__HW_1_S22.pdf"
## [986] "STAT_1000__HW_4_S22.pdf"
## [987] "STAT_1000__HW_5_S22%281%29.pdf"
## [988] "STAT_1000__HW_6_S22.pdf"
## [989] "STAT_1000__HW_7_S22.pdf"
## [990] "STAT_1000__HW_8_S22.pdf"
## [991] "STAT_1000__More_on_Hypothesis_Testing_SP___Slides.pdf"
## [992] "STAT_1000__Practice_Problem__Solutions_.pdf"
## [993] "STAT_1000__Practice_Problems__Final__F21.pdf"
## [994] "STAT_1000__Quiz_10.pdf"
## [995] "STAT_1000__Quiz_11_S22.pdf"
## [996] "STAT_1000__Quiz_12_S22.pdf"
## [997] "STAT_1000__Quiz_9.pdf"
## [998] "STAT_1000_HW_3_S_22.pdf"
## [999] "Stats quiz 5.pdf"
## [1000] "steam.dmg"
## [1001] "Study Guide to Private Life of Plants - Social Struggle.pdf"
## [1002] "Study questions for Carson et al 2008.docx"
## [1003] "Study Questions for Hirsch et al. 2014FINAL-2.docx"
## [1004] "submission_99715463.pdf"
## [1005] "summary_stats.pdf"
## [1006] "Survivorship in the Natural World - Honors College Homework 3.docx"
## [1007] "Sutton et al. 2021 (1).pdf"
## [1008] "Sutton et al. 2021 (2).pdf"
## [1009] "Sutton et al. 2021.pdf"
## [1010] "Syllabus PHYS 0111 10429 Fall 2022.pdf"
## [1011] "Teamwork semester evaluation.docx"
## [1012] "Technology Preparation_Duckweed Survivor_klw1.13.21.docx"
## [1013] "TEME.pdf"
## [1014] "Templete.pdf"
## [1015] "The princess bride essay (1).pdf"
## [1016] "The princess bride essay final draft.pdf"
## [1017] "The princess bride essay.pdf"
## [1018] "Timaeus-1 (1).pdf"
## [1019] "Timaeus-1 (2).pdf"
## [1020] "Timaeus-1.pdf"
## [1021] "Time series plots from vectors (1).pdf"
## [1022] "Tropical Ecology - Top down effects-Panama-.pptx"
## [1023] "Unconfirmed 145395.crdownload"
## [1024] "Unconfirmed 174225.crdownload"
## [1025] "Unconfirmed 283790.crdownload"
## [1026] "Unconfirmed 398081.crdownload"
## [1027] "Unconfirmed 446809.crdownload"
## [1028] "Untitled document (1).pdf"
## [1029] "Untitled document (2).pdf"
## [1030] "Untitled document.pdf"
## [1031] "Untitled.docx"
## [1032] "Untitled.Rmd"
## [1033] "Vander Ven CH 1.doc"
## [1034] "Vander Ven CH 2 -3.doc"
## [1035] "Vander Ven CH 4.doc"
## [1036] "Vander Ven CH 5.doc"
## [1037] "Vander Ven CH 6.doc"
## [1038] "Vander Ven Lecture Notes Final (5).docx"
## [1039] "vcfR_test.vcf"
## [1040] "vcfR_test.vcf.gz"
## [1041] "vegan_PCA_amino_acids-STUDENT_files"
## [1042] "vegan_PCA_amino_acids-STUDENT.Rmd"
## [1043] "VidyoConnect"
## [1044] "VidyoConnectInstaller-macosx-TAG_VCOND_20_2_0_13382[p=https&h=epiccal.video.upmc.com&x=1&f=RzpJUENPOklQQ0k6TW9kOlBDOlB1YkM6Q0RSOkVQOkNQOlJQSTpCQTpOREM6Q1BSOk9BOjIyMDpQUjpTUjI6U1I=&r=BLHNysomvy].dmg"
## [1045] "VRRENEWAL.pdf"
## [1046] "Wednesday Community Master File_day14_sp21.xlsx"
## [1047] "Wee 6 academic foundations (1).pdf"
## [1048] "Wee 6 academic foundations.pdf"
## [1049] "Week 11 - Recitation (1).pdf"
## [1050] "Week 11 - Recitation.pdf"
## [1051] "Week 11 Recitation Background.pdf"
## [1052] "Week 12 (Nov 15) class slides-1 (1).pptx"
## [1053] "Week 12 (Nov 15) class slides-1.pptx"
## [1054] "Week 12 (Nov 15) full slides-1.pptx"
## [1055] "Week 12 Recitation (1).pdf"
## [1056] "Week 12 Recitation.pdf"
## [1057] "Week 14 (Nov 29) full slides.pptx"
## [1058] "Week 3 Recitation.pdf"
## [1059] "Week 3 Sept 13 class slides.pptx"
## [1060] "Week 4 Recitation.pdf"
## [1061] "Week 4 Sept 20 BLAST class slides.pptx"
## [1062] "Week 4 Sept 20 BLAST full slides.pptx"
## [1063] "Week 5 Sept 27 class slides.pptx"
## [1064] "Week 5 Study Guide & PPT Slides (1).pdf"
## [1065] "Week 5 Study Guide & PPT Slides.pdf"
## [1066] "Week 6 Oct 4 class slides.pptx"
## [1067] "Week 6 Oct 4 full slides.pptx"
## [1068] "Week 6 Study Guide SV2 (1).pdf"
## [1069] "Week 6 Study Guide SV2 (2).pdf"
## [1070] "Week 6 Study Guide SV2.pdf"
## [1071] "Week 7 Recitation.pdf"
## [1072] "Week 7 Study Guide SV.pdf"
## [1073] "Week 8 Oct 18 full slides (1).pptx"
## [1074] "Week 8 Oct 18 full slides (2).pptx"
## [1075] "Week 8 Oct 18 full slides.pptx"
## [1076] "Week 8 Population Experiment Part 2 - Guidance.pptx"
## [1077] "Week 9 (Oct 25) full slides (1).pptx"
## [1078] "Week 9 (Oct 25) full slides.pptx"
## [1079] "Welcome to FALL Semester 2211 (1).docx"
## [1080] "Welcome to FALL Semester 2211 (2).docx"
## [1081] "Welcome to FALL Semester 2211 (3).docx"
## [1082] "Welcome to FALL Semester 2211.docx"
## [1083] "What is computational biology_ (1) (1).pdf"
## [1084] "What is computational biology_ (1).pdf"
## [1085] "Why Difficult Movies Are More, Um, Difficult - The New York Times (1).pdf"
## [1086] "Why Difficult Movies Are More, Um, Difficult - The New York Times (2).pdf"
## [1087] "Why Difficult Movies Are More, Um, Difficult - The New York Times.pdf"
## [1088] "working version.xlsx"
## [1089] "Yancey Martin and Hummer - Fraternities and Rape on Campus.pdf"
## [1090] "your-ap-score-report-unofficial-copy.pdf"
## [1091] "Zallek Ecology Guest Lecture 11.16.21 (1) (1).pdf"
## [1092] "Zallek Ecology Guest Lecture 11.16.21 (1).pdf"
## [1093] "Zoom.pkg"
## [1094] "ztable.pdf"
list.files(pattern = "vcf")
## [1] "1189983.vcf"
## [2] "all_loci-1.vcf"
## [3] "all_loci.vcf"
## [4] "ALL.chr10_GRCh38.genotypes.20170504.vcf.gz"
## [5] "code_checkpoint_vcfR.html"
## [6] "code_checkpoint_vcfR.Rmd"
## [7] "vcfR_test.vcf"
## [8] "vcfR_test.vcf.gz"
Load the all_loci.vcf file into an R data object with
vcfR::read.vcfR().
bird_snps <- vcfR::read.vcfR("all_loci.vcf")
## Scanning file to determine attributes.
## File attributes:
## meta lines: 8
## header_line: 9
## variant count: 1929
## column count: 81
##
Meta line 8 read in.
## All meta lines processed.
## gt matrix initialized.
## Character matrix gt created.
## Character matrix gt rows: 1929
## Character matrix gt cols: 81
## skip: 0
## nrows: 1929
## row_num: 0
##
Processed variant 1000
Processed variant: 1929
## All variants processed
cat("Note - if this didn't work you may not have your working directory set")
## Note - if this didn't work you may not have your working directory set
Use vcfR::extract.gt() to get the genotype scores.
bird_snps_num <- vcfR::extract.gt(bird_snps,
element = "GT",
IDtoRowNames = F,
as.numeric = T,
convertNA = T,
return.alleles = F)
Transpose the data with t() so that it has the proper
orientation.
# add t()
bird_snps_num_t <- t(bird_snps_num)
Convert the matrix to a dataframe.
# add data.frame()
bird_snps_num_df <- data.frame(bird_snps_num_t)
In order to deal with NAs you must first locate them.
In this paper, the author’s state that they removed from their analysis data an individual (row) that had missing values (NAs) for >50% of the SNPs. First we need to find them.
NAs can be detected in R using is.na().
Let’s take a look at how many NAs are in the first row of the data.
We’ll use bracket notation of [1, ] to
look at the row.
# Add is.na() and select the first row
## using [1, ]
NAs_row_01 <- is.na(bird_snps_num_t [1,])
is.na() returns a logical vector of
TRUE and FALSE values.
Look at the output of is.na() with
head().
# call head() on the vector NAs_row_01
head(NAs_row_01)
## [1] FALSE FALSE FALSE FALSE FALSE FALSE
In this vector, TRUE means “yes, there was an NA in this position”, and FALSE means “No, no NA there.”
The length of the vector is the length of the entire row we put into
is.na(), meaning we have a TRUE or FALSE answer for every
single value in the row.
We can check this with length() and logical
comparisons.
First, the length of our vector of TRUE/FALSE responses.
# Call length() on NAs_row_01
N_NAs <- length(NAs_row_01)
N_NAs
## [1] 1929
Now, the length of our original row
# Call length() on the first row of bird_snps_num_t
## use bracket notation of [1, ] to get the
## first row
length_row <- length(bird_snps_num_t[1,])
Now check that they are identical using ==
# Use a logical comparison with ==
## to confirm they are the same length
N_NAs == length_row
## [1] TRUE
We can work directly with a vector of TRUE and
FALSE value, but I find its easiest to first convert this
logical vector into a vector of index
values (indices) that tell us exactly where
the NAs are in the dataframe.
We can get these indices this with which(... == TRUE),
because in the vector TRUE is saying “Yes, its TRUE there
was an NA there.”
# Add which()
which(NAs_row_01 == TRUE)
## [1] 664 665 666 667 668 669 693 744 983 984 985 986 987 988 989
## [16] 990 1158 1159 1470 1471 1537 1901 1902 1925 1926 1927 1928 1929
This gives us an vector of index values. We’ll save the vector for later use.
# Assign the output to an object called
## i_NA_row_01
i_NA_row_01 <- (NAs_row_01 == TRUE)
We can confirm that these parts of row 1 of our dataframe contain NAs
using the vector we made i_NA_row_01 and bracket notation.
Let’s look at the rows with NAs in column 1, and also see what’s in rows
2 and 3.
bird_snps_num_t[c(1:3), i_NA_row_01]
## [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] [,11]
## sample_ACAAA_Nel3 NA NA NA NA NA NA NA NA NA NA NA
## sample_ACAGTG_Nel5 0 0 0 0 0 0 0 1 0 0 0
## sample_AGCAT_Nel8 0 0 0 0 0 0 0 1 NA NA NA
## [,12] [,13] [,14] [,15] [,16] [,17] [,18] [,19] [,20] [,21]
## sample_ACAAA_Nel3 NA NA NA NA NA NA NA NA NA NA
## sample_ACAGTG_Nel5 0 0 0 0 0 NA NA 1 0 NA
## sample_AGCAT_Nel8 NA NA NA NA NA 0 1 1 0 0
## [,22] [,23] [,24] [,25] [,26] [,27] [,28]
## sample_ACAAA_Nel3 NA NA NA NA NA NA NA
## sample_ACAGTG_Nel5 0 0 0 0 0 0 0
## sample_AGCAT_Nel8 0 0 NA NA NA NA NA
Here’s a function that will look for NAs in single column or vector, and tell us the index values.
find_NAs <- function(x){
NAs_TF <- is.na(x)
i_NA <- which(NAs_TF == TRUE)
N_NA <- length(i_NA)
cat("Results:",N_NA, "NAs present\n.")
return(i_NA)
}
Let’s test it on the first row of our data.
# run find_NAs() on the first row of
## bird_snps_num_t, using bracket notation [1,]
find_NAs(bird_snps_num_t[1,])
## Results: 28 NAs present
## .
## [1] 664 665 666 667 668 669 693 744 983 984 985 986 987 988 989
## [16] 990 1158 1159 1470 1471 1537 1901 1902 1925 1926 1927 1928 1929
For our workflow, we’re going to want to find the NAs in each row of data, count them up, and if >50% of the SNPs are NA, then remove the entire row.
We could find the NAs in each row like this. First, find the NAs in row 1 and save it to a vector:
i_NAs01 <- find_NAs(bird_snps_num_t[1,])
## Results: 28 NAs present
## .
And figure how how many NAs there are with length()
length(i_NAs01)
## [1] 28
Then continue this many times:
i_NAs02 <- find_NAs(bird_snps_num_t[2,])
## Results: 20 NAs present
## .
i_NAs03 <- find_NAs(bird_snps_num_t[3,])
## Results: 28 NAs present
## .
i_NAs04 <- find_NAs(bird_snps_num_t[4,])
## Results: 24 NAs present
## .
by making a new vector for each row and updating the row number in the brackets. This would take a lot of time and be very prone to errors.
This process of working on many rows is most easily done with a common programming approach called a for loop. The name at first doesn’t make sense; what it should be called is a “do something a bunch of times” loop.
It is called a “for” loop because you tell it something along the lines of: “FOR every row in this dataframe frame, do this …” In our case we’ll work on rows, but it can also work on columns, or anything else that can exist in R.
I’ll write a for() loop that will go through each row of
our SNP data and determine if >50% of the values are NA.
I’ll use our find_NA() function we just made.
To do this I’m going to want to have a few things:
N_rows: The total number of rows of data in our
dataframeN_NA: A vector to hold how many NAs are in each
rowN_SNPs: The total number of columns, so we can
determine if >50% of the columns (SNPs) are NAs.We can get the number of rows with nrow():
# call nrow() on bird_snps_num_t
N_rows <- nrow(bird_snps_num_t)
We can make a vector to store how many NAs are in each row like this,
where rep() repeats 0 for us as many times as we want.
N_NA <- rep(x = 0, times = N_rows)
I can get the number of SNPs withncol()
# call ncol() on bird_snps_num_t
N_SNPs <- ncol(bird_snps_num_t)
The percentage of SNPs that are NA can be found as
length(i_NAs01)/N_SNPs*100
## [1] 1.451529
If we were doing this by hand we’d have to fill in the vector of the
number of NAs (N_NA) like this:
# Number of NAs in row 1
i_NAs01 <- find_NAs(bird_snps_num_t[1,])
## Results: 28 NAs present
## .
N_NA[1] <- length(i_NAs01)
# Number of NAs in row 2:
i_NAs02 <- find_NAs(bird_snps_num_t[2,])
## Results: 20 NAs present
## .
N_NA[2] <- length(i_NAs02)
# Number of NAs in row 3:.
# ... etc
That would not be fun. So we automate the process with a
for() loop. I won’t explain right now how whole thing work,
but take a look to get the general sense of what it is.
I’ll repeat the previous preparation code so its all in one place.
# N_rows
# number of rows (individuals)
N_rows <- nrow(bird_snps_num_t)
# N_NA
# vector to hold output (number of NAs)
N_NA <- rep(x = 0, times = N_rows)
# N_SNPs
# total number of columns (SNPs)
N_SNPs <- ncol(bird_snps_num_t)
# the for() loop
for(i in 1:N_rows){
# for each row, find the location of
## NAs with bird_snps_num_t()
i_NA <- find_NAs(bird_snps_num_t[i,])
# then determine how many NAs
## with length()
N_NA_i <- length(i_NA)
# then save the output to
## our storage vector
N_NA[i] <- N_NA_i
}
## Results: 28 NAs present
## .Results: 20 NAs present
## .Results: 28 NAs present
## .Results: 24 NAs present
## .Results: 23 NAs present
## .Results: 63 NAs present
## .Results: 51 NAs present
## .Results: 38 NAs present
## .Results: 34 NAs present
## .Results: 24 NAs present
## .Results: 48 NAs present
## .Results: 21 NAs present
## .Results: 42 NAs present
## .Results: 78 NAs present
## .Results: 45 NAs present
## .Results: 21 NAs present
## .Results: 42 NAs present
## .Results: 34 NAs present
## .Results: 66 NAs present
## .Results: 54 NAs present
## .Results: 59 NAs present
## .Results: 52 NAs present
## .Results: 47 NAs present
## .Results: 31 NAs present
## .Results: 63 NAs present
## .Results: 40 NAs present
## .Results: 40 NAs present
## .Results: 22 NAs present
## .Results: 60 NAs present
## .Results: 48 NAs present
## .Results: 961 NAs present
## .Results: 478 NAs present
## .Results: 59 NAs present
## .Results: 26 NAs present
## .Results: 285 NAs present
## .Results: 409 NAs present
## .Results: 1140 NAs present
## .Results: 600 NAs present
## .Results: 1905 NAs present
## .Results: 25 NAs present
## .Results: 1247 NAs present
## .Results: 23 NAs present
## .Results: 750 NAs present
## .Results: 179 NAs present
## .Results: 433 NAs present
## .Results: 123 NAs present
## .Results: 65 NAs present
## .Results: 49 NAs present
## .Results: 192 NAs present
## .Results: 433 NAs present
## .Results: 66 NAs present
## .Results: 597 NAs present
## .Results: 1891 NAs present
## .Results: 207 NAs present
## .Results: 41 NAs present
## .Results: 268 NAs present
## .Results: 43 NAs present
## .Results: 110 NAs present
## .Results: 130 NAs present
## .Results: 90 NAs present
## .Results: 271 NAs present
## .Results: 92 NAs present
## .Results: 103 NAs present
## .Results: 175 NAs present
## .Results: 31 NAs present
## .Results: 66 NAs present
## .Results: 64 NAs present
## .Results: 400 NAs present
## .Results: 192 NAs present
## .Results: 251 NAs present
## .Results: 69 NAs present
## .Results: 58 NAs present
## .
My vector N_SNPs now how the number of NAs in each row
of the dataframe.
head(N_NA)
## [1] 28 20 28 24 23 63
We can get a sense of the how many NAs there are in the rows of the dataset by making a histogram. Most rows have very few, and a few have a lot:
# Call hist() on N_NA
hist(N_NA)
The authors of the bird speciation paper decided to remove any row that was >50% NAs. There are 1929 SNPs, so 50% is about 964 SNPs.
# total number of columns
N_SNPs
## [1] 1929
# 50% of N_SNPs
cutoff50 <- N_SNPs*0.5
I can add a line for the cutoff to the plot with
abline()
# Call hist() on N_NA
## add a vertical line at the cutoff value
## using abline()
hist(N_NA,
v = cutoff50,
col = 2,
lwd = 2,
lty = 2)
## Warning in plot.window(xlim, ylim, "", ...): "v" is not a graphical parameter
## Warning in title(main = main, sub = sub, xlab = xlab, ylab = ylab, ...): "v" is
## not a graphical parameter
## Warning in axis(1, ...): "v" is not a graphical parameter
## Warning in axis(2, at = yt, ...): "v" is not a graphical parameter
abline(v=cutoff50)
After figuring out how many NAs there in each row, I can convert this to a percent.
percent_NA <- N_NA/N_SNPs*100
I can plot these percentages and set the cutoff at 50 for 50%
# Call hist() on N_NA
## add a vertical line at 50%
## using abline()
hist(percent_NA,
v = cutoff50,
col = 2,
lwd = 2,
lty = 2)
## Warning in plot.window(xlim, ylim, "", ...): "v" is not a graphical parameter
## Warning in title(main = main, sub = sub, xlab = xlab, ylab = ylab, ...): "v" is
## not a graphical parameter
## Warning in axis(1, ...): "v" is not a graphical parameter
## Warning in axis(2, at = yt, ...): "v" is not a graphical parameter
abline(v=50)
I can determine the index value of each row with >50% NAs using
which()
# Call which() on percent_NA
i_NA_50percent <- which(percent_NA > 50)
I use length() to see how many there are
# call length() on i_NA_50percent
length(i_NA_50percent)
## [1] 4
The index values happen to be:
i_NA_50percent
## [1] 37 39 41 53
There are 4 rows where 50% or more of the columns contain an NA
In the paper they say they removed 6, and I’m not sure where the discrepancy comes from. In order get up to 6 birds, I need to decrease the threshold to 38% missing.
which(percent_NA > 38)
## [1] 31 37 39 41 43 53
length(which(percent_NA > 38))
## [1] 6
I can remove the rows of data with >50% missing using negative indexing.
bird_snps_num_t02 <- bird_snps_num_t[-i_NA_50percent, ]
I always need to check to make sure the previous and current data make sense.
dim(bird_snps_num_t)
## [1] 72 1929
dim(bird_snps_num_t02)
## [1] 68 1929
In our workflow information about the samples like their population of origin is getting embedded in the row names of the dataframe. (In contrast to this, VCF files from the 1000 Genomes Project have a separate file with all the information).
Its going to become necessary in subsequent assignments to access this information, for example to color-code plots. It takes a little bit of code to get the information from these row names, so I’m not going to dig into it here except to say I’m using functions known as regular expressions to be able to edit the text in the row names.
First, let’s look at the row names.
# call row.names() on
row_names <- row.names(bird_snps_num_t) # TODO
# call head() on row_names
head(row_names) # TODO
## [1] "sample_ACAAA_Nel3" "sample_ACAGTG_Nel5" "sample_AGCAT_Nel8"
## [4] "sample_ATGAAAC_Nel10" "sample_ATGAAAC_Nel15" "sample_CGATGT_Nel4"
The individual samples are called things like “Nel3” and “Cau10”. The numbers are ID numbers of individuals birds that DNA was collected from, and the letters are the populations.
I can use regular expressions (in this case a function called
gsub()) to remove the stuff like “sample_ACAAA_” before the
things I want. First I’ll remove the “sample_” (don’t worry about the
exact details of how the function works).
# add gsub() to before ("sample_","",row_names)
row_names02 <- gsub("sample_","",row_names)
# look at the output using head()
head(row_names02)
## [1] "ACAAA_Nel3" "ACAGTG_Nel5" "AGCAT_Nel8" "ATGAAAC_Nel10"
## [5] "ATGAAAC_Nel15" "CGATGT_Nel4"
Now I’ll get rid of the As, Cs, Ts and Gs (not yet sure what those
are actually…). This gives me a unique combination of a population code
and number for each sample. (Again, we won’t worry about the code
withing gsub()).
# clean up the character data
sample_id <- gsub("ATCG", "3", row_names02)
# look at thee output
head(sample_id)
## [1] "ACAAA_Nel3" "ACAGTG_Nel5" "AGCAT_Nel8" "ATGAAAC_Nel10"
## [5] "ATGAAAC_Nel15" "CGATGT_Nel4"
Now I want a vector just with the population code, so I’ll use
gsub() to get rid of the numbers. (Again, don’t worry about
the details of what’s in gsub().)
# add gsub() before the stuff in the parentheses
pop_id <- gsub("[01-9]*", # TODO
"",
sample_id)
The function table() summarizes the output for me
# call table() on pop_id
table(pop_id) # TODO
## pop_id
## AAAAGTT_Alt AAAAGTT_Div AAAAGTT_Sub ACAAA_Alt ACAAA_Nel ACAAA_Sub
## 1 2 1 1 1 1
## ACAGTG_Alt ACAGTG_Cau ACAGTG_Nel ACTAT_Cau ACTAT_Div ACTAT_Sub
## 1 2 1 2 1 1
## AGCAT_Div AGCAT_Nel AGCAT_Sub AGCCC_Alt AGCCC_Cau AGCCC_Div
## 1 1 1 1 1 1
## AGCCC_Sub ATCACG_Alt ATCACG_Div ATGAAAC_Cau ATGAAAC_Div ATGAAAC_Nel
## 1 2 1 1 1 2
## CGATGT_Alt CGATGT_Div CGATGT_Nel CGATGT_Sub CTAGC_Cau CTAGC_Div
## 1 1 1 1 1 1
## CTAGC_Nel CTAGC_Sub CTGTA_Alt CTGTA_Cau CTGTA_Nel CTTGA_Alt
## 1 1 1 1 2 1
## CTTGA_Div CTTGA_Nel CTTGCTT_Div CTTGCTT_Sub GAATTCA_Alt GAATTCA_Cau
## 1 1 1 2 1 1
## GAATTCA_Sub GCCAAT_Alt GCCAAT_Sub GGACCTA_Cau GGACCTA_Nel GGACCTA_Sub
## 1 2 1 1 2 1
## GTATT_Div GTATT_Nel GTATT_Sub TCACC_Cau TCACC_Nel TGACCA_Cau
## 2 1 1 2 1 1
## TGACCA_Div TGACCA_Nel TGACCA_Sub TTAGGC_Alt TTAGGC_Div
## 1 1 1 3 1
As noted above, the authors say they removed 6 rows because of NAs, but I only got 4. Its actually pretty common for things reported in a paper to diverge from what you’re able to replicate from the data - somewhere, something minor got misreported or left off of a file. But I do always like to try to figure out what’s going on.
The author’s state:
“We obtained blood samples from 75 Ammodramus sparrows … 15 from each of five putative subspecies. … Due to missing data, we removed six individuals (four of which were from [the subspecies] subvirgatus ), resulting in the analysis of 69 individuals (11–15 individuals per population).
Now that I’ve extracted the information on the samples I can see what samples they provided in their data.
Again, I can summarize the vector of population names with
table().
length(pop_id)
## [1] 72
table(pop_id)
## pop_id
## AAAAGTT_Alt AAAAGTT_Div AAAAGTT_Sub ACAAA_Alt ACAAA_Nel ACAAA_Sub
## 1 2 1 1 1 1
## ACAGTG_Alt ACAGTG_Cau ACAGTG_Nel ACTAT_Cau ACTAT_Div ACTAT_Sub
## 1 2 1 2 1 1
## AGCAT_Div AGCAT_Nel AGCAT_Sub AGCCC_Alt AGCCC_Cau AGCCC_Div
## 1 1 1 1 1 1
## AGCCC_Sub ATCACG_Alt ATCACG_Div ATGAAAC_Cau ATGAAAC_Div ATGAAAC_Nel
## 1 2 1 1 1 2
## CGATGT_Alt CGATGT_Div CGATGT_Nel CGATGT_Sub CTAGC_Cau CTAGC_Div
## 1 1 1 1 1 1
## CTAGC_Nel CTAGC_Sub CTGTA_Alt CTGTA_Cau CTGTA_Nel CTTGA_Alt
## 1 1 1 1 2 1
## CTTGA_Div CTTGA_Nel CTTGCTT_Div CTTGCTT_Sub GAATTCA_Alt GAATTCA_Cau
## 1 1 1 2 1 1
## GAATTCA_Sub GCCAAT_Alt GCCAAT_Sub GGACCTA_Cau GGACCTA_Nel GGACCTA_Sub
## 1 2 1 1 2 1
## GTATT_Div GTATT_Nel GTATT_Sub TCACC_Cau TCACC_Nel TGACCA_Cau
## 2 1 1 2 1 1
## TGACCA_Div TGACCA_Nel TGACCA_Sub TTAGGC_Alt TTAGGC_Div
## 1 1 1 3 1
Right away I can see I have only 72 samples versus their 75, so 3 are missing. There’s also only 13 in the “Cau” category and 1 in “Sub”. So the .vcf file they provided is either missing a few birds, or something is happening when vcfR loads to kick out some rows, perhaps due to data quality. I could open up the vcfR file in a text editor to check this out if I wanted.
When locating rows with NAs I created a vector where there were >50% NAs:
i_NA_50percent
## [1] 37 39 41 53
I can compare this to my vector of population IDs using brackets:
sample_id[i_NA_50percent]
## [1] "CTAGC_Sub3" "CTTGCTT_Sub8" "GCCAAT_Sub4" "GAATTCA_Cau10"
In the paper they say of the six samples they removed, “four of which were from [the subspecies] subvirgatus”. I have 3 samples called “Sub”, which are probably 3 of those 4 samples, and one “Cau.”
In total, I’m missing 1 Sub with >50% missing data, 1 Cau with >50% missing data, and 1 Cau with <50% missing data. I’m not sure what’s going on, but I doubt this will impact the analysis.