Author: Grace Strobbe


1. Initiate the Project


1.1. Dependencies

#This command loads required packages
library(ggplot2)

2. The Cave Molly and its Ancestors

To wrap our head around some of the basic observations that led Darwin to infer natural selection, we will spend a little bit of time with the cave molly. The cave molly (Poecilia mexicana) is a small species of livebearing fish that occurs in a couple of small caves in Southern Mexico. One of the caves, the Cueva Luna Azufre, has a wetted area of only 39 square-meters. Even though the available habitat is really small, there has been an isolated population of cave mollies in this cave for several thousand years. Interestingly, mollies also occur in adjacent surface habitats. In the picture below, you can see the a male and a female of the surface (top two pictures) and the cave form (bottom two pictures) side by side.


3. The Struggle for Existence

The first set of observations that led Darwin to infer the process of natural selection related to the imbalance of organisms’ reproductive power and limitations of resource availability. Quantifying the effective reproductive output and resource availability in nature can be difficult. However, what we can do is to measure proxies for these traits and then use simple mathematical models to test whether our predictions and inferences are valid. Here, we use exponential and logistic population growth models to explore whether there is really a struggle for existence in cave mollies.


3.1. Observation 1: Populations Have a Huge Reproductive Potential

Even large animals with long generation times have an incredible reproductive potential. Cave mollies—as many other cave organisms—have a comparatively low fecundity, and females only give birth to one or two fully developed young at a time. Life history analyses based on female longevity and fecundity have revealed that the average female gives birth to about 3 offspring over her life; not exactly what you would call huge reproductive potential, right? But in reality, it is not the reproductive potential of individuals that counts, but the reproductive potential of populations. To illustrate, we want you to model population growth for a hypothetical population of cave mollies. Specifically, use the code below to simulate and graph the population growth of an initial cave molly population of 2 individuals (the initial colonizers of the cave).

How many generations would it take for the population to grow to a million? Under what circumstances you might see population growth like this? Do you think Darwin’s observation that “species have great potential fertility” holds true for cave mollies?

#Choose an initial population size
N0 = 2

#Choose the average number of offspring
b = 3

#Choose a range of generations you want to estimate population size for
t = 0:15

#Calculate the population size for each generation
N = N0*b^t

#Merge the results of the simulation into a single table
final.results <- as.data.frame(cbind(t,N))

#You can view the results by just calling the data frame
final.results

#Plot the results, make sure you properly label the axes
ggplot(final.results, aes(x=t, y=N)) + 
  geom_point() + 
  xlab("Generation") + 
  ylab("Population size") +
  theme_classic()

The population would reach one million individuals by the twelfth generation of cave mollies, although the only way this population growth could occur is if there was enough living space and sufficient nutrients within the habitat. Darwin’s observation most likely holds true for the mollies as shown by the data and the growth in the fifth genration.


3.2. Observation 2: Natural Resources are Limited

Exponential growth only occurs in very specific circumstances. In a cave that is only the fraction of the size of a football field, you would obviously never find a cave molly population of a million. The logistic model more accurately describes population growth in nature. Based on our past analyses, we estimate the population growth coefficient (𝛌) to be around 1.3 and the carrying capacity (K) of the cave around 360 individuals.

How long would it take for the population to reach the carrying capacity if there were two initial colonizers? What do you think determines K for the population of cave mollies in the Cueva Luna Azufre?

#Choose an initial population size
N0 = 2
#Choose population growth rate
lamda = 1.3
#Choose a range of generations you want to estimate population size for
t = range(12,13,14,15)
#Choose a carrying capacity
K = 360
#Calculate the population size for each generation
N = (N0*K)/(N0+(K-N0)*exp(-lamda*t))
#Merge the results of the simulation into a single table
final.results <- as.data.frame(cbind(t,N))
#Use the ggplot function to plot the results, make sure you properly label the axes
ggplot(final.results, aes(x=t, y=N)) + 
  geom_point() + 
  xlab("Generation") + 
  ylab("Population") +
  theme_classic()

It will take fifteen generations for the population to reach its carrying capacity. The factors involved include amount of resources such as nutrients which would limit the number of mollies that are able to survive in the cave enviroment.


3.3. Where Do All the Missing Offspring Go?

Compare the two models (exponential and logistic) that were ran with the same initial parameters. What do the different outcomes mean for individual offspring that are born in any given generation? How might this discrepancy important in the context of evolution?

In the exponential model all the offspring are able to somehow survive and effectively reproduce which in reality is impossible and unrealistic. Whereas in the logistic model less of the offspring are able to survive which in turn means there is less chance for evolution to occur within the population. The main difference between the two models is the ability for evolution to occur within the population.


4. Individuals Vary in Their Traits

Another of Darwin’s key observation was just how variable individuals of the same species are. Let’s explore some of that variation in cave mollies. To do that, we first need to load some data into R. These data were collected as part of my dissertation and include the following variables: habitat (cave or surface), sex (male or female), standard length (in mm, from the snout to the caudal fin base), eye diameter (in mm), head length (in mm), head width (in mm), predorsal length (in mm, from the snout to the insertion of the dorsal fin), and gape width (in mm, from one corner of the mouth to the other).

#Use the read.csv function to import a dataset
morph.data <- read.csv("morphological_variation.csv", fileEncoding = 'UTF-8-BOM')
morph.data

4.1. Comparing Body Size Variation Within and Between Populations

A simple way to compare variation within and between populations is to plot a frequency histogram (which represents the raw counts) along with a density plot (which represents the approximated statistical distribution). You can generate a histogram with the geom_histogram() function and designate any trait you may want as the x axis. You can calculate the density with aes(y=..density..) within geom_histogram() and then plot it with geom_density(). Note that when you have more than two groups (in our case we have samples from a cave and a surface population), you can visualize them separately by designating a different color for each group in the aesthetics (fill=Habitat).

When you visualize body size variation in this manner what do you observe? Is there more variation within or between populations?

#Use the ggplot function to graph the histogram (see: http://www.sthda.com/english/wiki/ggplot2-histogram-plot-quick-start-guide-r-software-and-data-visualization)
ggplot(morph.data, aes(x=Standard.length, fill=Habitat)) + 
  geom_histogram(aes(y=..density..), position = "dodge") +
  geom_density(alpha=0.5)+
  xlab("Standard Length") + 
  ylab("Density") +
  theme_classic()
`stat_bin()` using `bins = 30`. Pick better value with
`binwidth`.

In the graph proves that there are similar size variations between the cave populations and the surface populations of the cave mollies. There seems to be more variation inside the population than outside of it.


4.2. Comparing Predorsal Length Variation Within and Between Populations

Let’s also compare a second trait, predorsal length. With the previous graph you hopefully saw how variable overall body size is within populations. If we want to compare other traits, we have to account for that. We want to know whether variation in predorsal length is due to variation in size (small fish have small predorsal lengths) or whether other patterns might be at play. To do so, we can calculate the residual predorsal length as from a regression between predorsal and standard length:

#Calculating regression line
fit1 <- lm(Predorsal.length ~ Standard.length, data = morph.data)

#Extract residuals and create a new variable res.predorsal in the morph.data data frame
morph.data$res.predorsal <- residuals(fit1)

You can then use the new variable to plot the residual predorsal length, which is corrected for body size:

##Use the ggplot function to graph the histogram and color data based on habitat
ggplot(morph.data, aes(x=Standard.length, fill=Habitat)) +
  geom_histogram(aes(y=..density..)) +
  geom_density(alpha=0.5)+
  xlab("Relative Predorsal Length") + 
  ylab("Density") +
  theme_classic()
`stat_bin()` using `bins = 30`. Pick better value with
`binwidth`.

When you plot relative predorsal length, what do you observe? How does variation in predorsal length vary within and between populations, and how does it compare to variation in standard length?

When plotting the relative pre-dorsal length we can observe that the length differs greatly between the cave and surface populations. It appears that as population density is lower, there is more variation, and it becomes more uniform as the density increases which is similar to standard length.


4.3. Comparing Eye Size Variation Within and Between Populations

Using the same approach as for predorsal variation, compare variation in relative eye diameter:

#Your code goes here
morph.data$relative.eye.size <- morph.data$Eye.diameter/morph.data$Standard.length

##Use the ggplot function to graph the histogram and color data based on habitat
ggplot(morph.data, aes(x=relative.eye.size, fill=Habitat)) + 
  geom_histogram(aes(y=..density..), position = "dodge") +
  geom_density(alpha=0.5) +
  theme_classic()
`stat_bin()` using `bins = 30`. Pick better value with
`binwidth`.

What do you observe? How does variation in eye diameter vary within and between populations, and how does it compare to variation in the other traits?

Eye size shows the most variability between populations, the diameter tends to be smaller in the cave population and larger in the surface populations. This difference in eye size is largley due to the fact that there is little light in in the caves so eyesight is an unimportant feature for those specific mollies. Therefor, it makes sense for other traits besides eyesight to be similar.


5. Variation in Traits is Heritable

An avid breeder of fancy pigeons, Darwin observed that specific traits are passed from parents to offspring, even though he had no clue how this might actually work (genetics was not really a thing yet). Even without an ability to conduct molecular genetic analyses, we can estimate heritability of traits by comparing the traits of offspring to the traits of the parents.

Let’s load some data that compares parent and offspring traits in cave mollies. To do this, we brought cave mollies into the lab and bred them under standardized conditions. Data represent the average trait values of the mother and father and of all offspring from a specific brood. The easiest way to compare parent and offspring traits is through a scatter plot, which we already used in Exercise 1. If a trait is heritable, we would expect to see a correlation between parent and offspring traits (e.g., parents with small eyes should have offspring with small eyes).

The following dataset includes measurements of parental and offspring standard length as well as eye size.

#Use the read.csv function to import a dataset
heritability <- read.csv("heritability.csv", fileEncoding = 'UTF-8-BOM')
heritability

5.1. Heritability of Standard Length

First, let us explore whether there is evidence for heritability in standard length.

What do you observe? Is standard length a heritable trait?

ggplot(heritability, aes(x=parent.standard.length, y=offspring.standard.length)) + 
  geom_point() + 
  geom_smooth(method = "lm") +
  xlab("Parent Standard Length") + 
  ylab("Offspring Standard Length") +
  theme_classic()
`geom_smooth()` using formula 'y ~ x'

The points on the graph appear unorganized and essentially meaningless therefore it appears that standard length is not a heritable trait amoung the cave mollies.


5.2. Heritability of Eye Size

Now let us explore whether there is any heritability in eye size. Remember, there is substantial variation in body size, and in such cases, we want to control for body size by calculating residual eye size first.

##Calculate residual eye sizes
morph.data$heritability$relative.parent.eye.size <- morph.data$parent.eye.size/morph.data$..parent.standard.length
morph.data$heritability$relative.offspring.eye.size <- morph.data$offspring.eye.size/morph.data$offspring.standard.length
Error in `$<-.data.frame`(`*tmp*`, heritability, value = list(relative.parent.eye.size = numeric(0),  : 
  replacement has 498 rows, data has 497

What do you observe? Is standard length a heritable trait?

There seems to be heritability based off the data points which seem to follow a trend line.


6. What Would Happen If…?

Imagine for a moment that smaller fish have a higher likelihood of survival in the cave. Would you expect evolution of body size upon cave colonization?

Imagine for a moment that fish with smaller eyes have a higher likelihood of survival in the cave. Would you expect evolution of eye size upon cave colonization? Justify your response.

If body smaller body size in the mollies increases the chances of survival I would imagine that body size would decrease within the population. The smaller the mollie, the more likely it is to produce offspring which in turn means the spread of genes for smaller body size and a decrease in individuals with larger body size. The same concept applies to smaller eye size as well, so if smaller eyes became more favorable there would be an increase in mollies with smaller eye size in the population.


7. Resources


7.1. Data References

The eye size data was published in the following paper. Other measurements are unpublished data by M. Tobler.


7.2 Resources You Consulted

Consulting additional resources to solve this assignment is absolutely allowed, but failure to disclose those resources is plagiarism. Please list any collaborators you worked with and resources you used below or state that you have not used any.

  • Rpub
