Figure 1. A female ruffed grouse flaring her neck feathers in a defensive display (Stedman, 2015).
This study aims to examine the mosquito communities of New York State in a mostly exploratory fashion. Given the complexity of the dataset and the system it is attempting to represent, follow-up studies will be required to confirm any patterns found here.
However, this study will test the provisional hypotheses that:
Recognizing state-wide trends in ecological relationships, instead of simply trying to eliminate Culex or Culiseta mosquitoes in populated areas, could allow the DEC to adopt a more holistic approach to mosquito control. Possible outcomes include the redistribution of WNV monitoring resources to high-risk forest environments and the implementation of specific and nuanced mosquito-reduction practices. While these practices may include the use of species-level techniques such as releasing sterile males, niche-level practices such as destroying oviposition areas near hiking trails may also provide protection (Weng et al., 2024).
\[\\[0.1in]\]
\[\\[0.1in]\]
Figure 2. The 35 mosquito trapping sites in relation to the DEC ecoregions of New York State.
All tables presented in this report are
reactables. Column widths are resizable and data can be
sorted alphabetically by column through clicking on the column
header.
Table 1. Mosquito trapping data named Mosquito.csv.
Table 2. CDC West Nile Virus Reports by county named
County.csv.
All 35 sites were sampled for both years but six changed forest type from 2023 to 2024. The reasons for these changes are unknown.
Adirondack Ecological Center went from
Coniferous/Northern Hardwoods to Coniferous.
Cranberry Mountain went from Mixed Oak to Northern
Hardwoods.
Manguap Valley went from Northern Hardwoods to Mixed
Oak.
Sargent Pond and Titusville Mountain
went from Northern Hardwoods to Coniferous/Northern Hardwoods.
Tug Hill went from Coniferous to Northern
Hardwoods.
Wilcox Lake went from Coniferous/Northern Hardwoods to
Northern Hardwoods.
Figure 3. Choropleth maps of presumed viremic donors (a) and reported symptomatic human cases (b) in New York State by DEC ecoregion. Data used encompass CDC reports from 1999-2023.
specnumber function
reporting each of the 25 species at every site across both of the
sampling years. The trapping effort was sufficient to detect all likely
mosquito species, as neither the specpool nor
estimateR functions predicted any undetected species at any
site at each year. No notably dominant or rare species were found at any
site and no site saw more than 100 individuals of a species trapped per
night. Mosquito totals and species richness did not vary significantly
between 2023 and 2024, but slightly more mosquitoes (+603) were
collected the second year.\[\\[0.03in]\] Table 3. Names, count data, and bionomic information for the 25 mosquito species observed in this study.
reactable of WNV data, mosquito collection
data, and mosquito names and characteristics.specnumber) and alpha
diversity (diversity) for each site. If differences are
found, move to ecoregion, WNV risk level, and forest type.specpool and estimateR.adonis2 to measure mosquito community dissimilarity
among sites. If differences are found, move to ecoregion, WNV risk
level, and forest type.rda and determine the amount of variation
explained by each axis using eigenvalues (eigenvals).metaMDS to clarify the role of significant species.envfit to plot species with a significant effect
(p<0.001) on community dissimilarity.RVAideMemoire.dispersiontest.MASS and compare
to other models using AIC and chi squared.\[\\[0.25in]\]
1. Alpha diversity will be highest in sites with low/medium WNV risk and in ecotones.
While alpha diversity (as measured by the Shannon Weiner index) varied significantly by WNV risk category (p < 0.01), no significant differences were found among forest types. The analysis indicates that low and medium WNV risk sites had similar Shannon Weiner indices that were significantly higher than no-risk sites (Figure 4). However, the high-risk sites were not significantly different from any other risk level. This finding may be due to the small sample size of high-risk sites (n=3), or the effect of increased mosquito diversity on WNV ecology. While the relationship between mosquito diversity and WNV risk is initially positive, there is a tipping point after which the correlation becomes negative (Chaves et al., 2011).
Figure 4. Sampling sites grouped by WNV risk and plotted by average Shannon Weiner index.
\[\\[0.01in]\]
2. If mosquito communities vary in composition, differences will be greatest among WNV risk categories and least among ecoregions.
As with the first hypothesis, the WNV risk categories showed significant differences in community composition (p = 0.001), but no significant differences were found among forest types or ecoregions. While environmental variables such as forest type and geographic location were expected to have a greater impact, the site selection process may have masked such differences. All sites were selected because they had similar characteristics: early successional forest with a known ruffed grouse population. Although more research would be required to verify this theory: the selection pressures of colonizing a recently disrupted area may drive the site’s vegetation to adopt strategies more similar to other early successional forests than to comparable established forests. If this pressure exists, it is also possible that vegetative features used by mosquitoes, such as stumps, flowers, or muddy tree roots, are less varied among the sampling sites than among comparable mature forests. Future directions for study should include sampling in mature forests, as well as a more detailed survey of plant life. Including species such as reeds, herbaceous flowering plants, and algae may also provide insight into mosquito communities, as these resources are used in different ways by different species.
3. If differences in mosquito communities are found, no single genus or species will drive the separation.
This hypothesis was proven correct, as four species were found to be significant drivers of variation among sites with differing WNV risk levels: Aedes vexans (Ae.vex), Anopheles punctipennis (An.pun), Culiseta melanura (Cs.mel), and Ochlerotatus trivittatus (Oc.triv) (Figures 5,6). These species belong to four different genera and display varied lifestyles, supporting the notion that no single species or “type” of mosquito is responsible for WNV risk. The speculative roles of each species are described further in the “Mosquitoes” tab.
Figure 5. Principal Component Analysis of sampling sites by WNV risk level. Points represent sites while arrows represent mosquito species. The length and direction of the arrows indicates the amount and direction of site variation each species is responsible for. The axes are labeled with the percentage of site variation they can account for.
Figure 6. Non-metric Multidimensional Scaling analysis of differences among WNV risk levels and the species that drive such variation. Risk levels are represented by ellipses, although not enough high-risk sites were sampled to generate an ellipsis. Only the most significant mosquito species (p=0.001), were included.
\[\\[0.01in]\]
4. As mosquito prevalence increases, WNV risk level will also increase.
Sites with no WNV risk were found to have a lower total than low-risk sites, but no other significant differences were found (Figure 7). This result is difficult to interpret. The finding seems to indicate that there is some threshold of mosquito richness that must be met for WNV to present a danger, but medium/high risk sites did not differ from either low or no risk sites. Looking at the most significant mosquito species may offer some insight. The suspected bridge vectors also showed lower totals in no-risk sites than low-risk sites (Figure 8). This repeated pattern may indicate that a certain density of bridge vectors is required for spillover events to become possible, but complex interactions among hosts, mosquitoes, and the environment determine the likelihood of such events. Fully describing these interactions is outside the scope of this study.
Figure 7. Average total mosquito counts per site per day, grouped by WNV risk level.
Figure 8. Average total Ochlerotatus trivittatus counts (a.) and Aedes vexans (b.) per site per day, grouped by WNV risk level.
\[\\[0.01in]\]
Major caveats:
Many of the methods used in this study displayed red flags of poor explanatory power. The NMDS and PCA used to visualize differences among mosquito communities were in accordance with each other, but fit the data worse than is typically accepted. The PCA created one axis that explained 22.81% of variation and another that only explained 9.3%, leaving the vast majority of variation unaccounted for. Similarly, the NMDS model had a stress level of 0.2444585, which is above the usually-accepted threshold of 0.2. While factors such as sample size and the complexity of WNV ecology could cause these signs to appear in representative models, more complex models that account for random effects, such as Generalized Linear Mixed Models (GLMMs), should be conducted as follow-up. A GLMM could also account for the totals of individual mosquito species. As the negative binomial model did not fit the species count data well, the associations between species populations and WNV risk levels should be double-checked. This study did not involve GLMMs as this work was mostly exploratory and the need for more sophisticated models was not anticipated.
In addition, the mosquito community data have been interpreted with the assumption that the four-day sampling windows in 2023 and 2024 were representative of mosquito diversity and richness for the entire time season However, the striking uniformity of mosquito species richness among sites, regions, and years was unexpected and may indicate that the sampling window was not representative. These mosquito species do not hatch and mature simultaneously; their varied reproductive and overwintering strategies lead to staggered population peaks from late spring to early fall. Regular sampling throughout spring-fall would likely reveal greater spatiotemporal differences.
The four mosquito species identified as significant drivers of WNV risk variation can be divided into three categories: bridge vectors, main vectors, and unknowns.
Bridge Vectors
\[\\[0.01in]\]
Main Vectors
\[\\[0.01in]\]
Unknowns
\[\\[0.01in]\]
Control
| Package | Version | Citation |
|---|---|---|
| AER | 1.2.14 | @AER |
| base | 4.4.3 | @base |
| emmeans | 1.11.0 | @emmeans |
| ggrepel | 0.9.6 | @ggrepel |
| ggsignif | 0.6.4 | @ggsignif |
| ggthemes | 5.1.0 | @ggthemes |
| knitr | 1.50 | @knitr2014; @knitr2015; @knitr2025 |
| maps | 3.4.2.1 | @maps |
| MASS | 7.3.64 | @MASS |
| patchwork | 1.3.0 | @patchwork |
| reactable | 0.4.4 | @reactable |
| rmarkdown | 2.29 | @rmarkdown2018; @rmarkdown2020; @rmarkdown2024 |
| rsconnect | 1.3.4 | @rsconnect |
| RVAideMemoire | 0.9.83.7 | @RVAideMemoire |
| tidyverse | 2.0.0 | @tidyverse |
| vegan | 2.6.10 | @vegan |
| viridis | 0.6.5 | @viridis |
Citation table generated by grateful:
Rodriguez-Sanchez F, Jackson C (2024). grateful: Facilitate citation of R packages. https://pakillo.github.io/grateful/.
Apart from help pages and vignettes for the packages used in this report, the following websites were consulted:
Bartlett, J. (2021, April 2). Deviance goodness of fit test for
Poisson regression. The Stats Geek. https://thestatsgeek.com/2014/04/26/deviance-goodness-of-fit-test-for-poisson-regression/
Examples. (n.d.). https://glin.github.io/reactable/articles/examples.html#theming
How to calculate and visualize Z scores in R. (2023, May 15). https://www.codingthepast.com/2023/05/15/Z-score-in-R.html
Lam. (2020, November 24). Non-parametric: differences in multiple
groups. https://lamurian.rbind.io/note/biostat-ukrida/09-nonpar-meandiff-2/
Negative Binomial Regression | R Data Analysis Examples. (n.d.). https://stats.oarc.ucla.edu/r/dae/negative-binomial-regression/
R: What is data-masking and why do I need {{? (n.d.). https://search.r-project.org/CRAN/refmans/rlang/html/topic-data-mask.html
RPubs - vegan cheatsheet. (n.d.). https://www.rpubs.com/an-bui/vegan-cheat-sheet
Stackoverflow and the subreddits r/rstats and r/statistics were also used.
No AI was used in the creation of this report.
Andreadis, T. G., Armstrong, P. M., Anderson, J. F., & Main, A. J. (2014). Spatial-Temporal Analysis of Cache Valley Virus (Bunyaviridae: Orthobunyavirus ) Infection in Anopheline and Culicine Mosquitoes (Diptera: Culicidae) in the Northeastern United States, 1997–2012. Vector-Borne and Zoonotic Diseases, 14(10), 763–773. https://doi.org/10.1089/vbz.2014.1669
Armstrong, P. M., & Andreadis, T. G. (2022). Ecology and Epidemiology of Eastern Equine Encephalitis Virus in the Northeastern United States: An Historical Perspective. Journal of Medical Entomology, 59(1), 1–13. https://doi.org/10.1093/jme/tjab077
Baum, S. G. (2008). Zoonoses-with friends like this, who needs enemies? Transactions of the American Clinical and Climatological Association, 119, 39–51; discussion 51-52.
Bialosuknia, S. M., Dupuis Ii, A. P., Zink, S. D., Koetzner, C. A., Maffei, J. G., Owen, J. C., Landwerlen, H., Kramer, L. D., & Ciota, A. T. (2022). Adaptive evolution of West Nile virus facilitated increased transmissibility and prevalence in New York State. Emerging Microbes & Infections, 11(1), 988–999. https://doi.org/10.1080/22221751.2022.2056521
Briegel, H., Waltert, A., & Kuhn, R. (2001). Reproductive Physiology of Aedes ( Aedimorphus ) vexans (Diptera: Culicidae) in Relation to Flight Potential. Journal of Medical Entomology, 38(4), 557–565. https://doi.org/10.1603/0022-2585-38.4.557
Burkett-Cadena, N. D. (2013). Mosquitoes of the southeastern United States. University of Alabama Press. Cahill, M. E., Yao, Y., Nock, D., Armstrong, P. M., Andreadis, T. G., Diuk-Wasser, M. A., & Montgomery, R. R. (2017). West Nile Virus Seroprevalence, Connecticut, USA, 2000–2014. Emerging Infectious Diseases, 23(4), 708–710. https://doi.org/10.3201/eid2304.161669
Carson, P. J., Borchardt, S. M., Custer, B., Prince, H. E., Dunn-Williams, J., Winkelman, V., Tobler, L., Biggerstaff, B. J., Lanciotti, R., Petersen, L. R., & Busch, M. P. (2012). Neuroinvasive Disease and West Nile Virus Infection, North Dakota, USA, 1999–2008. Emerging Infectious Diseases, 18(4), 684–686. https://doi.org/10.3201/eid1804.111313
CDC. (2025, January 14). Current Year Data (2024). West Nile Virus. https://www.cdc.gov/west-nile-virus/data-maps/current-year-data.html
Chapagain, B. P., & Poudyal, N. C. (2020). Economic benefit of wildlife reintroduction: A case of elk hunting in Tennessee, USA. Journal of Environmental Management, 269, 110808. https://doi.org/10.1016/j.jenvman.2020.110808
Chaves, L. F., Hamer, G. L., Walker, E. D., Brown, W. M., Ruiz, M. O., & Kitron, U. D. (2011). Climatic variability and landscape heterogeneity impact urban mosquito diversity and vector abundance and infection. Ecosphere, 2(6), art70. https://doi.org/10.1890/ES11-00088.1
Darsie, R. F., & Ward, R. A. (with Chang, C. C., & Litwak, T.). (2005). Identification and geographical distribution of the mosquitos of North America, north of Mexico. University Press of Florida.
Downs, C., & Stewart, K. (2014). A primer in ecoimmunology and immunology for wildlife research and management. California Fish and Game, 100(3), 371–395.
Duryea, R. D. (1990). Aedes trivittatus in New Jersey. Proceedings of the New Jersey Mosquito Control Association, 73–78. https://vectorbio.rutgers.edu/outreach/species/sp12.htm
Dusfour, I., & Chaney, S. C. (2021). Mosquito control. In M. Hall & D. Tamïr, Mosquitopia (1st ed., pp. 213–233). Routledge. https://doi.org/10.4324/9781003056034-19
Eidson, M. (2001). Dead Bird Surveillance as an Early Warning System for West Nile Virus. Emerging Infectious Diseases, 7(4), 631–635. https://doi.org/10.3201/eid0704.010405
Eritja, R., Palmer, J. R. B., Roiz, D., Sanpera-Calbet, I., & Bartumeus, F. (2017). Direct Evidence of Adult Aedes albopictus Dispersal by Car. Scientific Reports, 7(1), 14399. https://doi.org/10.1038/s41598-017-12652-5
Findlater, A., & Bogoch, I. I. (2018). Human Mobility and the Global Spread of Infectious Diseases: A Focus on Air Travel. Trends in Parasitology, 34(9), 772–783. https://doi.org/10.1016/j.pt.2018.07.004
George, T. L., Harrigan, R. J., LaManna, J. A., DeSante, D. F., Saracco, J. F., & Smith, T. B. (2015). Persistent impacts of West Nile virus on North American bird populations. Proceedings of the National Academy of Sciences, 112(46), 14290–14294. https://doi.org/10.1073/pnas.1507747112
Gorris, M. E., Bartlow, A. W., Temple, S. D., Romero-Alvarez, D., Shutt, D. P., Fair, J. M., Kaufeld, K. A., Del Valle, S. Y., & Manore, C. A. (2021). Updated distribution maps of predominant Culex mosquitoes across the Americas. Parasites & Vectors, 14(1), 547. https://doi.org/10.1186/s13071-021-05051-3
Heidecke, J., Lavarello Schettini, A., & Rocklöv, J. (2023). West Nile virus eco-epidemiology and climate change. PLOS Climate, 2(5), e0000129. https://doi.org/10.1371/journal.pclm.0000129
Hill, A. P., Prince, P., Snaddon, J. L., Doncaster, C. P., & Rogers, A. (2019). AudioMoth: A low-cost acoustic device for monitoring biodiversity and the environment. HardwareX, 6, e00073. https://doi.org/10.1016/j.ohx.2019.e00073
Howard, J. J., White, D. J., & Muller, S. L. (1989). Mark-Recapture Studies on the Culiseta (Diptera: Culicidae) Vectors of Eastern Equine Encephalitis Virus. Journal of Medical Entomology, 26(3), 190–199. https://doi.org/10.1093/jmedent/26.3.190
Jensen, T., Dritz, D. A., Fritz, G. N., Washino, R. K., & Reeves, W. C. (1998). Lake Vera revisited: Parity and survival rates of Anopheles punctipennis at the site of a malaria outbreak in the Sierra Nevada foothills of California. The American Journal of Tropical Medicine and Hygiene, 59(4), 591–594. https://doi.org/10.4269/ajtmh.1998.59.591
Komar, N., Langevin, S., Hinten, S., Nemeth, N., Edwards, E., Hettler, D., Davis, B., Bowen, R., & Bunning, M. (2003). Experimental infection of North American birds with the New York 1999 strain of West Nile virus. Emerging Infectious Diseases, 9(3), 311–322. https://doi.org/10.3201/eid0903.020628
Martínez-de La Puente, J., Ferraguti, M., Ruiz, S., Roiz, D., Llorente, F., Pérez-Ramírez, E., Jiménez-Clavero, M. Á., Soriguer, R., & Figuerola, J. (2018). Mosquito community influences West Nile virus seroprevalence in wild birds: Implications for the risk of spillover into human populations. Scientific Reports, 8(1), 2599. https://doi.org/10.1038/s41598-018-20825-z
May, F. J., Davis, C. T., Tesh, R. B., & Barrett, A. D. T. (2011). Phylogeography of West Nile Virus: From the Cradle of Evolution in Africa to Eurasia, Australia, and the Americas. Journal of Virology, 85(6), 2964–2974. https://doi.org/10.1128/JVI.01963-10
McMillan, J. R., Armstrong, P. M., & Andreadis, T. G. (2020). Patterns of mosquito and arbovirus community composition and ecological indexes of arboviral risk in the northeast United States. PLOS Neglected Tropical Diseases, 14(2), e0008066. https://doi.org/10.1371/journal.pntd.0008066
Molaei, G., Andreadis, T. G., Armstrong, P. M., & Diuk-Wasser, M. (2008). Host-Feeding Patterns of Potential Mosquito Vectors in Connecticut, USA: Molecular Analysis of Bloodmeals from 23 Species of Aedes, Anopheles, Culex, Coquillettidia, Psorophora, and Uranotaenia. Journal of Medical Entomology, 45(6), 1143–1151. https://doi.org/10.1093/jmedent/45.6.1143
Molaei, G., Andreadis, T. G., Armstrong, P. M., Thomas, M. C., Deschamps, T., Cuebas-Incle, E., Montgomery, W., Osborne, M., Smole, S., Matton, P., Andrews, W., Best, C., Cornine, F., Bidlack, E., & Texeira, T. (2013). Vector-Host Interactions and Epizootiology of Eastern Equine Encephalitis Virus in Massachusetts. Vector-Borne and Zoonotic Diseases, 13(5), 312–323. https://doi.org/10.1089/vbz.2012.1099
Molaei, G., Farajollahi, A., Scott, J. J., Gaugler, R., & Andreadis, T. G. (2009). Human Bloodfeeding by the Recently Introduced Mosquito, Aedes japonicus japonicus, and Public Health Implications. Journal of the American Mosquito Control Association, 25(2), 210–214. https://doi.org/10.2987/09-0012.1
Nemeth, N. M., Bosco-Lauth, A. M., Williams, L. M., Bowen, R. A., & Brown, J. D. (2017). West Nile Virus Infection in Ruffed Grouse ( Bonasa umbellus ): Experimental Infection and Protective Effects of Vaccination. Veterinary Pathology, 54(6), 901–911. https://doi.org/10.1177/0300985817717770
Nemeth, N. M., Williams, L. M., Bosco-Lauth, A. M., Oesterle, P. T., Helwig, M., Bowen, R. A., & Brown, J. D. (2021). WEST NILE VIRUS INFECTION IN RUFFED GROUSE (BONASA UMBELLUS) IN PENNSYLVANIA, USA: A MULTI-YEAR COMPARISON OF STATEWIDE SEROSURVEYS AND VECTOR INDICES. Journal of Wildlife Diseases, 57(1). https://doi.org/10.7589/JWD-D-19-00016
NYSDEC. (n.d.). Young Forest Initiative On Wildlife Management Areas. Retrieved April 6, 2025, from https://dec.ny.gov/places-to-go/wildlife-management-areas/young-forest-initiative
O’Hara, R. B., & Kotze, D. J. (2010). Do not log‐transform count data. Methods in Ecology and Evolution, 1(2), 118–122. https://doi.org/10.1111/j.2041-210X.2010.00021.x
O’Malley, C. (1990). Aedes vexans (Meigan): An Old Foe. Proceedings of the New Jersey Mosquito Control Association, 90–95. https://vectorbio.rutgers.edu/outreach/species/sp13.htm
Owen, J. C., Nakamura, A., Coon, C. A., & Martin, L. B. (2012). The effect of exogenous corticosterone on West Nile virus infection in Northern Cardinals (Cardinalis cardinalis). Veterinary Research, 43(1), 34. https://doi.org/10.1186/1297-9716-43-34
Porter, W. F., & Jarzyna, M. A. (2013). Effects of landscape‐scale forest change on the range contraction of ruffed grouse in New York State, USA. Wildlife Society Bulletin, 37(1), 198–208. https://doi.org/10.1002/wsb.225
Reeves, L. E., Gillett-Kaufman, J. L., Kawahara, A. Y., & Kaufman, P. E. (2018). Barcoding blood meals: New vertebrate-specific primer sets for assigning taxonomic identities to host DNA from mosquito blood meals. PLOS Neglected Tropical Diseases, 12(8), e0006767. https://doi.org/10.1371/journal.pntd.0006767
Riccetti, N., Fasano, A., Ferraccioli, F., Gomez-Ramirez, J., & Stilianakis, N. I. (2022). Host selection and forage ratio in West Nile virus–transmitting Culex mosquitoes: Challenges and knowledge gaps. PLOS Neglected Tropical Diseases, 16(10), e0010819. https://doi.org/10.1371/journal.pntd.0010819
Riley, S. J., Decker, D. J., Enck, J. W., Curtis, P. D., Lauber, T. B., & Brown, T. L. (2003). Deer populations up, hunter populations down: Implications of interdependence of deer and hunter population dynamics on management. Écoscience, 10(4), 455–461. https://doi.org/10.1080/11956860.2003.11682793
Roche, B., Rohani, P., Dobson, A. P., & Guégan, J.-F. (2013). The Impact of Community Organization on Vector-Borne Pathogens. The American Naturalist, 181(1), 1–11. https://doi.org/10.1086/668591
Ronca, S. E., Murray, K. O., & Nolan, M. S. (2019). Cumulative Incidence of West Nile Virus Infection, Continental United States, 1999-2016. Emerging Infectious Diseases, 25(2), 325–327. https://doi.org/10.3201/eid2502.180765
Ronca, S. E., Ruff, J. C., & Murray, K. O. (2021). A 20-year historical review of West Nile virus since its initial emergence in North America: Has West Nile virus become a neglected tropical disease? PLOS Neglected Tropical Diseases, 15(5), e0009190. https://doi.org/10.1371/journal.pntd.0009190
Roy, C. L., Carstensen, M., LaSharr, K., Humpal, C., Dick, T., Kunkel, M., & Nemeth, N. M. (2022). WEST NILE VIRUS EXPOSURE AND INFECTION AMONG HUNTER-HARVESTED RUFFED GROUSE (BONASA UMBELLUS) COHORTS IN A STABLE POPULATION. Journal of Wildlife Diseases, 58(1). https://doi.org/10.7589/JWD-D-21-00018
Ruffed Grouse Hunting—NYSDEC. (n.d.). Retrieved January 29, 2025, from https://dec.ny.gov/things-to-do/hunting/small-game/ruffed-grouse
Santini, M., Haberle, S., Židovec-Lepej, S., Savić, V., Kusulja, M., Papić, N., Višković, K., Župetić, I., Savini, G., Barbić, L., Tabain, I., Kutleša, M., Krajinović, V., Potočnik-Hunjadi, T., Dvorski, E., Butigan, T., Kolaric-Sviben, G., Stevanović, V., Gorenec, L., … Vilibić-Čavlek, T. (2022). Severe West Nile Virus Neuroinvasive Disease: Clinical Characteristics, Short- and Long-Term Outcomes. Pathogens, 11(1), 52. https://doi.org/10.3390/pathogens11010052
Scuderi, B., Olds, E., & Southwick, R. (2024). Hunting in America: An Economic Force for Conservation. Sportsmen’s Alliance Foundation.
Servello, F. A., & Kirkpatrick, R. L. (1987). Regional Variation in the Nutritional Ecology of Ruffed Grouse. The Journal of Wildlife Management, 51(4), 749. https://doi.org/10.2307/3801737
Shahhosseini, N., Frederick, C., Racine, T., Kobinger, G. P., & Wong, G. (2021). Modeling host-feeding preference and molecular systematics of mosquitoes in different ecological niches in Canada. Acta Tropica, 213, 105734. https://doi.org/10.1016/j.actatropica.2020.105734
Shepard, J. J., & Armstrong, P. M. (2023). Jamestown Canyon virus comes into view: Understanding the threat from an underrecognized arbovirus. Journal of Medical Entomology, 60(6), 1242–1251. https://doi.org/10.1093/jme/tjad069
Smithburn, K. C., Hughes, T. P., Burke, A. W., & Paul, J. H. (1940). A Neurotropic Virus Isolated from the Blood of a Native of Uganda. The American Journal of Tropical Medicine, s1-20(4), 471–492. https://doi.org/10.4269/ajtmh.1940.s1-20.471
Stauffer, G. E., Miller, D. A. W., Williams, L. M., & Brown, J. (2018). Ruffed grouse population declines after introduction of West Nile virus. The Journal of Wildlife Management, 82(1), 165–172. https://doi.org/10.1002/jwmg.21347
Stedman, L. (2015). Ruffed Grouse (https://www.flickr.com/photos/49208525@N08/18645551408) [Photography]. Flickr.
Tiawsirisup, S., Kinley, J. R., Tucker, B. J., Evans, R. B., Rowley, W. A., & Platt, K. B. (2008). Vector Competence of Aedes vexans (Diptera: Culicidae) for West Nile Virus and Potential as an Enzootic Vector. Journal of Medical Entomology, 45(3), 452–457. https://doi.org/10.1093/jmedent/45.3.452
Tiawsirisup, S., Platt, K. B., Evans, R. B., & Rowley, W. A. (2004). Susceptibility of Ochlerotatus trivittatus (Coq.), Aedes albopictus (Skuse), and Culex pipiens (L.) to West Nile virus infection. Vector Borne and Zoonotic Diseases (Larchmont, N.Y.), 4(3), 190–197. https://doi.org/10.1089/vbz.2004.4.190
Trani, M., Brooks, R., Schmidt, T., Rudis, V., & Gabbard, C. (2001). Patterns and trends of early successional forests in the Eastern United States. Wildlife Society Bulletin, 29. https://doi.org/10.2307/3784166
Turell, M. J., Dohm, D. J., Sardelis, M. R., O’guinn, M. L., Andreadis, T. G., & Blow, J. A. (2005). An Update on the Potential of North American Mosquitoes (Diptera: Culicidae) to Transmit West Nile Virus. Journal of Medical Entomology, 42(1), 57–62. https://doi.org/10.1093/jmedent/42.1.57
Vaidyanathan, R., Edman, J. D., Cooper, L. A., & Scott, T. W. (1997). Vector Competence of Mosquitoes (Diptera: Culicidae) from Massachusetts for a Sympatric Isolate of Eastern Equine Encephalomyelitis Virus. Journal of Medical Entomology, 34(3), 346–352. https://doi.org/10.1093/jmedent/34.3.346
Weng, S.-C., Masri, R. A., & Akbari, O. S. (2024). Advances and challenges in synthetic biology for mosquito control. Trends in Parasitology, 40(1), 75–88. https://doi.org/10.1016/j.pt.2023.11.001
#Tidyverse
library(rlang)
library(dplyr)
library(knitr)
#Figure Design
library(ggplot2)
library(viridis)
library(maps)
library(ggthemes)
library(ggsignif)
library(ggrepel)
library(patchwork)
#Tables
library(reactable)
#Community Ecology
library(vegan)
#Statistics
library(MASS)
library(AER)
library(RVAideMemoire)
library(emmeans)
#Bibliography management
library(grateful)moz_data$PerYear <- paste(moz_data$Site, moz_data$Year)
.rowNamesDF(moz_data, make.names=FALSE) <- moz_data$ID
moz_data[,1] <- NULL
moz_data = merge(moz_data, County[,c("Site","WNV")], by="Site")
moz_data$WNV <- factor(moz_data$WNV , levels=c("None", "Low", "Medium", "High"))
moz_data<- moz_data %>%
relocate(WNV, .before = Ae.vex)
moz_data<-na.omit(moz_data)Smush is a custom function for condensing count
data into totals for a given category.dplyr would look at column names within the dataset, and
treat the columns like a normal object. However, making a function in
base R conflicts with dplyr’s style of data masking,
confusing dplyr about where to find the column and how to
use it. Making a Smush function requires {{}} to guide
dplyr to the column names and tell it to hang onto them
until receiving the full instructions. In this case, Smush
was used to generate a table of species counts per site per year to fit
the expected format of subsequent vegan functions.vegan funtions. I attempted
for over a month to make a version of Smush that could
handle multiple column names, but could not get it to work.SpecMoz<-Smush(moz_data, moz_data$PerYear)
SpecMoz <- as.data.frame(SpecMoz, row.names = NULL, optional = FALSE)
.rowNamesDF(SpecMoz, make.names=FALSE) <- SpecMoz$`moz_data$PerYear`
SpecMoz[,1] <- NULL
site_data <-(
moz_data %>%
group_by(moz_data$PerYear,moz_data$Site,moz_data$Forest.Type,
moz_data$Year, moz_data$Region, moz_data$WNV) %>%
summarise_if(is.integer, sum) %>%
distinct())
site_data <- as.data.frame(site_data, row.names = NULL, optional = FALSE)
.rowNamesDF(site_data, make.names=FALSE) <- site_data$`moz_data$PerYear`
site_data[,1] <- NULL%in% and wanted to
practice.NYcounty<-subset(map_data("county"), region == "new york")
NYcounty$ecoregion = vector(mode='character', length=length(NYcounty$subregion))
NYcounty$ecoregion[NYcounty$subregion %in% c('nassau', 'suffolk')] <- '1'
NYcounty$ecoregion[NYcounty$subregion %in% c('bronx', 'kings', 'new york',
'queens', 'richmond')] <- '2'
NYcounty$ecoregion[NYcounty$subregion %in% c('dutchess','orange','putnam','rockland',
'sullivan','ulster','westchester')] <- '3'
NYcounty$ecoregion[NYcounty$subregion %in% c('albany','columbia','delaware','greene',
'montgomery','otsego','rensselaer','schenectady',
'schoharie')] <- '4'
NYcounty$ecoregion[NYcounty$subregion %in% c('clinton','essex','franklin','fulton',
'hamilton','saratoga','warren',
'washington')] <- '5'
NYcounty$ecoregion[NYcounty$subregion %in% c('herkimer','jefferson',
'lewis','oneida','st lawrence')] <- '6'
NYcounty$ecoregion[NYcounty$subregion %in% c('broome','cayuga','chenango','cortland',
'madison','onondaga','oswego','tioga',
'tompkins')] <- '7'
NYcounty$ecoregion[NYcounty$subregion %in% c('chemung','genesee','livingston','monroe',
'ontario','orleans','schuyler',
'seneca','steuben','wayne','yates')] <- '8'
NYcounty$ecoregion[NYcounty$subregion %in% c('allegany','cattaraugus','chautauqua',
'erie','niagara','wyoming')] <- '9'
NYmap<-ggplot(data = NYcounty, aes(x=long, y = lat, group = group)) +
coord_quickmap() +
theme_void()NYcol<-c("#88CCEE","#CC6677","#DDCC77","#117733","#AA4499","#44AA99","#882255")
NYFIG<-NYmap +
theme_void() +
geom_polygon(data = NYcounty, aes(x=long, y = lat, fill = ecoregion), color = "white") +
scale_fill_manual(values =c("dimgray","lightgray",NYcol), name= "DEC Ecoregion",
labels = c("1 - Long Island", "2 - New York City", "3 - Lower Hudson Valley",
"4 - Capital Region/Northern Catskills", "5 - Eastern Adirondacks/Lake Champlain",
"6 - Western Adirondacks/Eastern Lake Ontario","7 - Central New York",
"8 - Western Finger Lakes", "9 - Western New York"))+
theme(legend.position = "right")+
geom_polygon(color = "black", fill = NA) +
geom_point(data=County, aes(x=Longitude, y=Latitude),
color="black", fill="white", pch=21, size=3, inherit.aes=FALSE)Smush was used to remove these duplicates.deduped.County <- distinct(County, county, .keep_all = TRUE)
deduped.County<-Smush(deduped.County, ecoregion)
deduped.County$ecoregion<-as.character(deduped.County$ecoregion)
Virus <- inner_join(NYcounty, deduped.County, by = "ecoregion")
VirusMap<-ggplot(data = Virus, aes(x=long, y = lat, group = group)) +
coord_quickmap() +
theme_void()patchwork.PVDFIG<-VirusMap +
theme_void() +
geom_polygon(data = Virus, aes(x=long, y = lat), color = "white") +
geom_polygon(data = Virus, aes(fill = PVD))+
theme(legend.position = "right")+
scale_fill_viridis(option="rocket", begin = 0.9, end = 0.1, name="Presumed Viremic Donors")+
geom_polygon(color = "black", fill = NA) +
guides(colour = guide_colourbar(order = 1),alpha = guide_legend(order = 2),
size = guide_legend(order = 3))+
ggtitle("a.") +
theme(plot.title.position = "plot", plot.title = element_text(size = 20, face = "bold"))
CASFIG<-VirusMap +
theme_void() +
geom_polygon(data = Virus, aes(x=long, y = lat), color = "white") +
geom_polygon(data = Virus, aes(fill = Total_Case))+
theme(legend.position = "right")+
scale_fill_viridis(option="mako", begin = 0.9, end = 0.1, name="Symptomatic Cases")+
geom_polygon(color = "black", fill = NA) +
guides(colour = guide_colourbar(order = 1),alpha = guide_legend(order = 2),
size = guide_legend(order = 3))+
ggtitle("b.") +
theme(plot.title.position = "plot", plot.title = element_text(size = 20, face = "bold"))reactable.Percent<- function(x){
y<-((x/sum(x)*100))
format(round(y, digits=2))
}
Pretty_Names$Host.Type<-as.factor(Pretty_Names$Host.Type)
Pretty_Names$Abbreviation<-as.factor(Pretty_Names$Abbreviation)
Pretty_Names$Generations.per.Season<-as.factor(Pretty_Names$Generations.per.Season)
Pretty_Names$Short<-as.factor(Pretty_Names$Short)
Pretty_Names$Totals<-colSums(moz_data[,10:34])
Pretty_Names$Percent<-Percent(Pretty_Names$Totals)
Pretty_Names$Totals<-as.numeric(Pretty_Names$Totals)
Pretty_Names$Percent<-as.numeric(Pretty_Names$Percent)
Pretty_Names <-(Pretty_Names %>%
arrange(-Pretty_Names$Percent))
Presentable_Names <-(Pretty_Names %>%
rename("Host Type" = `Host.Type`) %>%
rename("Generations per Season" = `Generations.per.Season`))\[\\[0.03in]\]
Vegan
tutorial “Vegan cheat sheet” by An Bui.specnumber and revealed a uniform count of 25 species. No
further testing of ecoregion, forest type, or WNV risk category was
necessary.## Df Sum Sq Mean Sq F value Pr(>F)
## site_data$`moz_data$Site` 34 2.019e-26 5.938e-28 1 0.499
## Residuals 35 2.078e-26 5.938e-28
diversity) revealed slight differences among sites
and forest types but significant differences among WNV risk
categories.## Df Sum Sq Mean Sq F value Pr(>F)
## site_data$`moz_data$Site` 34 0.0008728 2.567e-05 1.559 0.098 .
## Residuals 35 0.0005762 1.646e-05
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## Df Sum Sq Mean Sq F value Pr(>F)
## site_data$`moz_data$Forest.Type` 5 0.0002142 4.285e-05 2.221 0.0629 .
## Residuals 64 0.0012348 1.929e-05
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## Df Sum Sq Mean Sq F value Pr(>F)
## site_data$`moz_data$WNV` 3 0.0003032 1.011e-04 5.821 0.00136 **
## Residuals 66 0.0011459 1.736e-05
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## Tukey multiple comparisons of means
## 95% family-wise confidence level
##
## Fit: aov(formula = diversity(SpecMoz) ~ site_data$`moz_data$WNV`, data = site_data)
##
## $`site_data$`moz_data$WNV``
## diff lwr upr p adj
## Low-None 0.0041574833 0.0010747993 0.007240167 0.0038465
## Medium-None 0.0045103806 0.0007591592 0.008261602 0.0121216
## High-None 0.0009257985 -0.0039857033 0.005837300 0.9595171
## Medium-Low 0.0003528973 -0.0035883783 0.004294173 0.9953284
## High-Low -0.0032316848 -0.0082898308 0.001826461 0.3402383
## High-Medium -0.0035845821 -0.0090758081 0.001906644 0.3213817
ggplot(SpecMoz, aes(x =site_data$`moz_data$WNV`, y = diversity(SpecMoz), fill = site_data$`moz_data$WNV`)) +
geom_boxplot() +
theme(legend.position = "none", plot.title = element_text(hjust = 0.5), axis.title=element_text(size=14,face="bold"), axis.text=element_text(size=12)) +
labs(x = "Risk Level", y = "Shannon-Weiner Index", title = NULL) +
scale_fill_viridis(option = "turbo",begin = 0.1, end = 0.9, discrete=TRUE) +
Transparent+
geom_signif(annotations ="**", y_position = c(3.214), xmin = c("None"), xmax =c("Low"))+
geom_signif(annotations ="*", y_position = c(3.216), xmin = c("None"), xmax =c("Medium"))specpool and for each site per
year using estimateR.adonis2 was used to assess community
dissimilarity.adonis2 is essentially a perMANOVA test that places
each site’s mosquito community in ordination space based on synthetic
axes that explain the maximum amount of difference among them. These
mosquito communities are then grouped by a supplied variable, then these
groups are compared. If the center of the groups in ordination space are
close together, the communities are not significantly different from one
another based on the supplied variable.adonis2 groups communities by Site to determine if
any differences exist between communities at all.## Permutation test for adonis under reduced model
## Permutation: free
## Number of permutations: 999
##
## adonis2(formula = SpecMoz ~ site_data$`moz_data$Site`, data = site_data)
## Df SumOfSqs R2 F Pr(>F)
## Model 34 0.15524 0.53566 1.1875 0.015 *
## Residual 35 0.13457 0.46434
## Total 69 0.28982 1.00000
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## Permutation test for adonis under reduced model
## Permutation: free
## Number of permutations: 999
##
## adonis2(formula = SpecMoz ~ site_data$`moz_data$WNV`, data = site_data)
## Df SumOfSqs R2 F Pr(>F)
## Model 3 0.024186 0.08345 2.0031 0.001 ***
## Residual 66 0.265632 0.91655
## Total 69 0.289818 1.00000
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Smush to create a pared-down
version of site_data with no mosquito counts and a new version of
SpecMoz that contains mosquito data per site, not per site
and year.## Call: rda(X = SiteDF)
##
## -- Model Summary --
##
## Inertia Rank
## Total 855416
## Unconstrained 855416 25
##
## Inertia is variance
##
## -- Eigenvalues --
##
## Eigenvalues for unconstrained axes:
## PC1 PC2 PC3 PC4 PC5 PC6 PC7 PC8
## 195100 79602 75146 63308 59437 51839 50604 44527
## (Showing 8 of 25 unconstrained eigenvalues)
## Importance of components:
## PC1 PC2 PC3 PC4 PC5
## Eigenvalue 1.951e+05 7.960e+04 7.515e+04 6.331e+04 5.944e+04
## Proportion Explained 2.281e-01 9.306e-02 8.785e-02 7.401e-02 6.948e-02
## Cumulative Proportion 2.281e-01 3.211e-01 4.090e-01 4.830e-01 5.525e-01
## PC6 PC7 PC8 PC9 PC10
## Eigenvalue 5.184e+04 5.060e+04 4.453e+04 3.945e+04 3.253e+04
## Proportion Explained 6.060e-02 5.916e-02 5.205e-02 4.611e-02 3.803e-02
## Cumulative Proportion 6.131e-01 6.722e-01 7.243e-01 7.704e-01 8.084e-01
## PC11 PC12 PC13 PC14 PC15
## Eigenvalue 3.066e+04 2.304e+04 1.910e+04 1.863e+04 1.565e+04
## Proportion Explained 3.585e-02 2.694e-02 2.233e-02 2.177e-02 1.829e-02
## Cumulative Proportion 8.443e-01 8.712e-01 8.935e-01 9.153e-01 9.336e-01
## PC16 PC17 PC18 PC19 PC20
## Eigenvalue 1.370e+04 1.064e+04 8.172e+03 7.403e+03 5.563e+03
## Proportion Explained 1.601e-02 1.244e-02 9.554e-03 8.655e-03 6.503e-03
## Cumulative Proportion 9.496e-01 9.621e-01 9.716e-01 9.803e-01 9.868e-01
## PC21 PC22 PC23 PC24 PC25
## Eigenvalue 3.959e+03 3.151e+03 2.531e+03 1.167e+03 5.145e+02
## Proportion Explained 4.628e-03 3.684e-03 2.959e-03 1.364e-03 6.015e-04
## Cumulative Proportion 9.914e-01 9.951e-01 9.980e-01 9.994e-01 1.000e+00
plot_PCA <- ggplot() +
geom_point(data = PCAscores, size=3, aes(x = PC1, y = PC2, color=WNV, shape=WNV)) +
scale_color_viridis(option = "turbo",begin = 0.1, end = 0.9, discrete=TRUE) +
geom_vline(xintercept = c(0), color = "grey70", linetype = 1) +
geom_hline(yintercept = c(0), color = "grey70", linetype = 1) +
geom_segment(data = PCAvect, aes(x = 0, y = 0, xend = PC1, yend = PC2),
arrow = arrow(length = unit(0.2, "cm"), type = "closed")) +
geom_text_repel(data = PCAvect, max.overlaps = 20,
aes(x = PC1, y = PC2, label = rownames(PCAvect))) +
Transparent +
labs(x = "PC1 (22.81%)",
y = "PC2 (9.3%)",
title = "Principal Components Analysis by WNV Risk") +
guides(color = guide_legend(title = "Risk Level"),
shape= guide_legend(title = "Risk Level"))
plot_PCA##
## Call:
## metaMDS(comm = SiteDF)
##
## global Multidimensional Scaling using monoMDS
##
## Data: wisconsin(sqrt(SiteDF))
## Distance: bray
##
## Dimensions: 2
## Stress: 0.2444585
## Stress type 1, weak ties
## Best solution was repeated 1 time in 20 tries
## The best solution was from try 12 (random start)
## Scaling: centring, PC rotation, halfchange scaling
## Species: expanded scores based on 'wisconsin(sqrt(SiteDF))'
Moz_Stress <- scores(Moz_NMDS, display = "sites") %>%
as.data.frame() %>%
tibble::rownames_to_column("Site") %>%
full_join(SiteforPCA, by = "Site")
plot_nmds <- ggplot(Moz_Stress, aes(x = NMDS1, y = NMDS2, color = WNV, shape = WNV)) +
geom_point(size = 3) +
stat_ellipse(linetype = 2, linewidth = 1) +
scale_color_viridis(option = "turbo",begin = 0.1, end = 0.9, discrete=TRUE) +
Transparent +
labs(x = "NMDS X", y = "NMDS Y", title = NULL) +
guides(color = guide_legend(title = "Risk Level"), shape= guide_legend(title = "Risk Level"))envfit function allows the
species count data to be added to the ordination space using the same
process as the first NMDS. In this case, the model had 999 tries to find
the best placement for the species.envfit function were reduced by a
constant factor (0.65, 0.2), which did not alter the direction, but did
bring them closer to the site ellipses. The code below will produce the
original figure.\[\\[0.03in]\]
Interpret function is a simple
custom function for printing mean and standard deviation in a way that
is easy (for me) to interpret. While eyeballing the means and standard
deviations can reveal some issues, I also used byf.shapiro
from RVAideMemoire to be sure.Interpret<- function(x) {
sprintf("Mean (Std. Dev) = %1.2f (%1.2f)", mean(x), sd(x))
}
with(moz_data, tapply(moz_data$Oc.triv,moz_data$WNV, Interpret))## None Low
## "Mean (Std. Dev) = 37.33 (32.45)" "Mean (Std. Dev) = 47.11 (31.09)"
## Medium High
## "Mean (Std. Dev) = 44.15 (31.39)" "Mean (Std. Dev) = 45.00 (32.30)"
##
## Shapiro-Wilk normality tests
##
## data: moz_data$Oc.triv by moz_data$WNV
##
## W p-value
## None 0.8978 < 2.2e-16 ***
## Low 0.9405 1.112e-10 ***
## Medium 0.9362 2.017e-07 ***
## High 0.9158 1.236e-05 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
“DO NOT LOG TRANSFORM COUNT DATA!”
##
## Call:
## glm(formula = moz_data$Oc.triv ~ moz_data$WNV, family = poisson)
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 3.619775 0.007471 484.54 <2e-16 ***
## moz_data$WNVLow 0.232788 0.010775 21.60 <2e-16 ***
## moz_data$WNVMedium 0.167878 0.013229 12.69 <2e-16 ***
## moz_data$WNVHigh 0.186888 0.016950 11.03 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for poisson family taken to be 1)
##
## Null deviance: 33421 on 1117 degrees of freedom
## Residual deviance: 32912 on 1114 degrees of freedom
## AIC: 38229
##
## Number of Fisher Scoring iterations: 5
dispersiontest from AER is
next.##
## Overdispersion test
##
## data: TrivFish
## z = 36.714, p-value < 2.2e-16
## alternative hypothesis: true alpha is greater than 0
## sample estimates:
## alpha
## 23.27223
MASS.##
## Call:
## glm.nb(formula = moz_data$Oc.triv ~ moz_data$WNV, init.theta = 0.7888318081,
## link = log)
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 3.61977 0.05193 69.703 < 2e-16 ***
## moz_data$WNVLow 0.23279 0.07974 2.919 0.00351 **
## moz_data$WNVMedium 0.16788 0.09741 1.723 0.08480 .
## moz_data$WNVHigh 0.18689 0.12702 1.471 0.14120
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for Negative Binomial(0.7888) family taken to be 1)
##
## Null deviance: 1353.5 on 1117 degrees of freedom
## Residual deviance: 1344.1 on 1114 degrees of freedom
## AIC: 10597
##
## Number of Fisher Scoring iterations: 1
##
##
## Theta: 0.7888
## Std. Err.: 0.0334
##
## 2 x log-likelihood: -10587.1310
##
## Call:
## glm.nb(formula = moz_data$Oc.triv ~ 1, init.theta = 0.7823762308,
## link = log)
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 3.74309 0.03412 109.7 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for Negative Binomial(0.7824) family taken to be 1)
##
## Null deviance: 1344.1 on 1117 degrees of freedom
## Residual deviance: 1344.1 on 1117 degrees of freedom
## AIC: 10601
##
## Number of Fisher Scoring iterations: 1
##
##
## Theta: 0.7824
## Std. Err.: 0.0330
##
## 2 x log-likelihood: -10596.5100
## Likelihood ratio tests of Negative Binomial Models
##
## Response: moz_data$Oc.triv
## Model theta Resid. df 2 x log-lik. Test df LR stat.
## 1 1 0.7823762 1117 -10596.51
## 2 moz_data$WNV 0.7888318 1114 -10587.13 1 vs 2 3 9.379692
## Pr(Chi)
## 1
## 2 0.02464629
## [1] 2.234498e-06
## [1] 2.997222e-06
##
## Call:
## glm.nb(formula = moz_data$Oc.triv ~ moz_data$WNV + moz_data$Forest.Type,
## init.theta = 0.7916109915, link = log)
##
## Coefficients:
## Estimate Std. Error z value
## (Intercept) 3.43638 0.16789 20.468
## moz_data$WNVLow 0.28517 0.08984 3.174
## moz_data$WNVMedium 0.10348 0.10588 0.977
## moz_data$WNVHigh 0.20117 0.12883 1.561
## moz_data$Forest.TypeConiferous_Mixed Oak 0.42156 0.27284 1.545
## moz_data$Forest.TypeConiferous_Northern Hardwoods 0.20955 0.17511 1.197
## moz_data$Forest.TypeMixed Oak 0.06108 0.26194 0.233
## moz_data$Forest.TypeMixed Oak_Northern Hardwoods 0.31067 0.20287 1.531
## moz_data$Forest.TypeNorthern Hardwoods 0.15061 0.15707 0.959
## Pr(>|z|)
## (Intercept) <2e-16 ***
## moz_data$WNVLow 0.0015 **
## moz_data$WNVMedium 0.3284
## moz_data$WNVHigh 0.1184
## moz_data$Forest.TypeConiferous_Mixed Oak 0.1223
## moz_data$Forest.TypeConiferous_Northern Hardwoods 0.2314
## moz_data$Forest.TypeMixed Oak 0.8156
## moz_data$Forest.TypeMixed Oak_Northern Hardwoods 0.1257
## moz_data$Forest.TypeNorthern Hardwoods 0.3376
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for Negative Binomial(0.7916) family taken to be 1)
##
## Null deviance: 1357.6 on 1117 degrees of freedom
## Residual deviance: 1344.1 on 1109 degrees of freedom
## AIC: 10603
##
## Number of Fisher Scoring iterations: 1
##
##
## Theta: 0.7916
## Std. Err.: 0.0335
##
## 2 x log-likelihood: -10583.1280
## [1] 1.358016e-06
TrivBin$WNV<-TrivBin$moz_data$WNV
pairwise1 = emmeans(TrivBin, ~ WNV)
pairs(pairwise1, adjust="tukey")## contrast estimate SE df z.ratio p.value
## None - Low -0.2328 0.0797 Inf -2.919 0.0184
## None - Medium -0.1679 0.0974 Inf -1.723 0.3114
## None - High -0.1869 0.1270 Inf -1.471 0.4549
## Low - Medium 0.0649 0.1020 Inf 0.635 0.9208
## Low - High 0.0459 0.1310 Inf 0.351 0.9852
## Medium - High -0.0190 0.1420 Inf -0.134 0.9991
##
## Results are given on the log (not the response) scale.
## P value adjustment: tukey method for comparing a family of 4 estimates
ggplot(moz_data, aes(x = moz_data$WNV, y = moz_data$Oc.triv, fill = moz_data$WNV)) +
geom_boxplot() +
theme(legend.position = "none", plot.title = element_text(hjust = 0.5), axis.title=element_text(size=14,face="bold"), axis.text=element_text(size=12)) +
labs(x = "Risk Level", y = "Oc. trivittatus Counts", title = NULL) +
scale_fill_viridis(option = "turbo",begin = 0.1, end = 0.9, discrete=TRUE) +
Transparent +
geom_signif(annotations ="*", y_position = c(110), xmin = c("None"), xmax =c("Low"))## contrast estimate SE df z.ratio p.value
## None - Low -0.1974 0.0789 Inf -2.502 0.0596
## None - Medium -0.2308 0.0963 Inf -2.396 0.0778
## None - High -0.1820 0.1260 Inf -1.449 0.4687
## Low - Medium -0.0334 0.1010 Inf -0.330 0.9876
## Low - High 0.0153 0.1290 Inf 0.119 0.9994
## Medium - High 0.0487 0.1410 Inf 0.346 0.9857
##
## Results are given on the log (not the response) scale.
## P value adjustment: tukey method for comparing a family of 4 estimates
## contrast estimate SE df z.ratio p.value
## None - Low -0.21142 0.0783 Inf -2.702 0.0348
## None - Medium -0.21250 0.0956 Inf -2.224 0.1167
## None - High -0.16789 0.1250 Inf -1.347 0.5329
## Low - Medium -0.00108 0.1000 Inf -0.011 1.0000
## Low - High 0.04353 0.1280 Inf 0.339 0.9866
## Medium - High 0.04461 0.1400 Inf 0.320 0.9887
##
## Results are given on the log (not the response) scale.
## P value adjustment: tukey method for comparing a family of 4 estimates
ggplot(moz_data, aes(x = moz_data$WNV, y = moz_data$Ae.vex, fill = moz_data$WNV)) +
geom_boxplot() +
theme(legend.position = "none", plot.title = element_text(hjust = 0.5), axis.title=element_text(size=14,face="bold"), axis.text=element_text(size=12)) +
labs(x = "Risk Level", y = "Ae. vexans Counts", title = NULL) +
scale_fill_viridis(option = "turbo",begin = 0.1, end = 0.9, discrete=TRUE) +
Transparent +
geom_signif(annotations ="*", y_position = c(110), xmin = c("None"), xmax =c("Low"))## contrast estimate SE df z.ratio p.value
## None - Low -0.1793 0.0779 Inf -2.301 0.0977
## None - Medium -0.2205 0.0951 Inf -2.318 0.0939
## None - High 0.0391 0.1240 Inf 0.315 0.9892
## Low - Medium -0.0412 0.0998 Inf -0.413 0.9763
## Low - High 0.2184 0.1280 Inf 1.707 0.3200
## Medium - High 0.2596 0.1390 Inf 1.866 0.2427
##
## Results are given on the log (not the response) scale.
## P value adjustment: tukey method for comparing a family of 4 estimates
moz_data$Totals<-rowSums(moz_data[,10:34])
ggplot(moz_data, aes(moz_data$Totals, fill = moz_data$WNV)) +
facet_grid(moz_data$WNV~., scales = "free_y") +
scale_fill_viridis(option = "turbo",begin = 0.1, end = 0.9, discrete=TRUE) +
geom_histogram(binwidth = 10) +
Transparent +
theme(strip.background =element_rect(fill="transparent"))+
theme(panel.background = element_rect(fill = "transparent", color="black"))+
guides(fill = "none") +
labs(x = "Total Mosquitoes Collected",
y = "Count")## None Low
## "Mean (Std. Dev) = 1204.17 (154.72)" "Mean (Std. Dev) = 1232.99 (152.48)"
## Medium High
## "Mean (Std. Dev) = 1237.51 (145.35)" "Mean (Std. Dev) = 1200.10 (140.51)"
##
## Shapiro-Wilk normality tests
##
## data: moz_data$Totals by moz_data$WNV
##
## W p-value
## None 0.9975 0.6804
## Low 0.9946 0.2573
## Medium 0.9905 0.2399
## High 0.9861 0.4076
## Df Sum Sq Mean Sq F value Pr(>F)
## moz_data$WNV 3 273655 91218 3.986 0.00774 **
## Residuals 1114 25495454 22886
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## Tukey multiple comparisons of means
## 95% family-wise confidence level
##
## Fit: aov(formula = moz_data$Totals ~ moz_data$WNV, data = moz_data)
##
## $`moz_data$WNV`
## diff lwr upr p adj
## Low-None 28.823485 1.50959041 56.13738 0.0339592
## Medium-None 33.339693 -0.02244668 66.70183 0.0502291
## High-None -4.066667 -47.58466592 39.45133 0.9951116
## Medium-Low 4.516208 -30.52389897 39.55632 0.9874217
## High-Low -32.890152 -77.70748077 11.92718 0.2336590
## High-Medium -37.406360 -86.14624614 11.33353 0.1982090
ggplot(moz_data, aes(x = moz_data$WNV, y = moz_data$Totals, fill = moz_data$WNV)) +
geom_boxplot() +
theme(legend.position = "none", plot.title = element_text(hjust = 0.5), axis.title=element_text(size=14,face="bold"), axis.text=element_text(size=12)) +
labs(x = "Risk Level", y = "Mosquito Totals", title = NULL) +
scale_fill_viridis(option = "turbo",begin = 0.1, end = 0.9, discrete=TRUE) +
Transparent +
geom_signif(annotations ="*", y_position = c(1800), xmin = c("None"), xmax =c("Low"))ggplot(moz_data, aes(moz_data$Totals, fill = moz_data$Forest.Type)) +
facet_grid(moz_data$Forest.Type~., scales = "free_y") +
scale_fill_viridis(begin = 0.2, end = 0.9, discrete=TRUE) +
theme(legend.title = element_blank(), strip.background = element_blank(),strip.text = element_blank())+
geom_histogram(binwidth = 10)+
Transparent+
theme(panel.background = element_rect(fill = "transparent", color="black"))+
labs(x = "Total Mosquitoes Collected",
y = "Count")## Coniferous Coniferous_Mixed Oak
## "Mean (Std. Dev) = 1230.58 (156.99)" "Mean (Std. Dev) = 1271.12 (168.87)"
## Coniferous_Northern Hardwoods Mixed Oak
## "Mean (Std. Dev) = 1200.44 (154.51)" "Mean (Std. Dev) = 1183.09 (151.80)"
## Mixed Oak_Northern Hardwoods Northern Hardwoods
## "Mean (Std. Dev) = 1226.39 (136.94)" "Mean (Std. Dev) = 1222.53 (150.95)"
##
## Shapiro-Wilk normality tests
##
## data: moz_data$Totals by moz_data$Forest.Type
##
## W p-value
## Coniferous 0.9861 0.6882
## Coniferous_Mixed Oak 0.9830 0.8816
## Coniferous_Northern Hardwoods 0.9977 0.9781
## Mixed Oak 0.9729 0.5818
## Mixed Oak_Northern Hardwoods 0.9935 0.9288
## Northern Hardwoods 0.9976 0.4864
## Df Sum Sq Mean Sq F value Pr(>F)
## moz_data$Forest.Type 5 237242 47448 2.067 0.0672 .
## Residuals 1112 25531867 22960
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
\[\\[0.03in]\]