This practical focuses on the field of species distribution modelling (SDM). Thus, we will combine locality data with environment information (in this case, only climate) to create a model that characterises the relationship between environment and distribution.
SDM has many uses, but in this practical we are going to:
In this practical, we are going to use an algorithm called Maximum Entropy (cool name, huh?) that has been implemented in the java software Maxent. We are going to skim the surface of SDM with MaxEnt, but for those of you who’re up for a challenge, Cory Merow’s paper (2013, “A practical guide to modeling species’ distributions”) is well worth a read.
The aims of this practical are to:
The four taxa that we will be modelling are:
Cyclopia intermedia E.Mey. (Fabaceae): This is the most widespread of species in the genus Cyclopia . It’s range extends from the Witteberg Mountains near Touwsrivier to the Van Stadens Mountains near Port Elizabeth, and is found on rocky, loam, sandy soil between elevations of 500–1700 m. Thus, this can be considered an inland mountain species. This species is a resprouter with myrmechochorous seeds.
Metalasia muricata (L.) D.Don (Asteraceae): This widespread and common species found in coastal to mountainous regions of southern Africa. Here we are focusing on the coastal form, which is as ecotype (or subspecies) within this lineage. This species is a reseeder and has tiny wind-dispersed seeds. It is often a pioneer shrub that is the first of the reseeders to appear after fire.
Olea exasperata Jacq. (Oleaceae) is a large, often multi-stemmed, shrub or tree that is usually found along the west and south coastlines growing in recent and ancient dune sands. It is also a post-fire resprouter — this species relies heavily on resprouting for persistence. It has white flowers which appear in spring mature into purple fruits that are bird-dispersed. In germination experiments, Cowling et al. (1997; J. Veg. Sci.) found that no seeds of O. exasperata germinated and that seedlings were rare in the Cape St Francis Dunes and Cape Peninsula.
Pappea capensis Eckl. & Zeyh. (Sapindaceae) has distribution that extends from the Little Karoo, into the Eastern Cape and up into KwaZulu-Natal, to the northern provinces, as well as Mozambique, Zimbabwe and northwards into eastern and southern tropical Africa. For the purposes of this practical, we can consider it an inland subtropical lineage (i.e. has affinities with the eastern part of SA). In the Eastern Cape, it forms a core member of the Albany Subtropical Thicket and is usually found inland. It has a red fruit that is dispersed by birds and browsing herbivores.
There are 10 folders in the zip file that are in the downloaded zip file:
Climate_LGM contains the same six bioclimate layers as the Current_Climate folder, but in this case these represent statistically-downscaled PMIP simulations; these are global circulation models run under LGM conditions at a coarse resolution (usually ~ 1 degree cells, hence the need for downscaling).
Climate_2070 contains the same six bioclimate layers as the Current_Climate folder, but with statistically-downscaled CMIP5 simulations; these are global circulation models run under future scenarios of CO2 conditions at a coarse resolution (again the need for downscaling). There are A LOT of future climate simulations (look here for a brief summary). I have provided the MIROC-ESM model simulations for 2017 under the RCP6 scenario. This scenario assumes that we do get our emission rate under control (i.e. it is not the worst case scenario simulation).
Locality_Dataset contains a .csv file with the localities (in the form of longitude and latitude coordinates for the species under investigation in this practical).
Maxent 4.1 contains the software to run the SDM algorithm, Maxent.
Out.S1 to Out.S6 are used to store the output of different MaxEnt analyses. Note that every time you run a MaxEnt analysis, it will overwrite the old analysis that is in the output folder. As you will need to compare between these analyses (e.g. present vs past and present vs future), it is much better to redirect each analysis to a new folder instead of re-running and overwriting the results the whole time.
You start the software by clicking on maxent.bat. If the software does not run, it means that you do not have java installed! Please click here to install Java..
When you open the software, you will get the interface below…Some descriptions of the interface:
Samples is where you specify the file that contains the species locality data. Browse to “Locality_Dataset/Locs.csv”
Environmental layers is where you specify the folder that contains the present-day bioclimatic layers. The software will intersect the locality points with these layers to build the model, thus it is crucial that modern locality points are used along with modern climate. It is entirely possible for you to specify the 2070 or LGM folder here, you will get a result — but that would very, very wrong! You would be training your model using modern locality points that intersect with a very distant past or a very different future environment. Please make sure you browse to the Climate_Current folder!
If you find that you your analyses are not generating any pictures, it is usually because this has been unticked (it is ticked by default).
Output directory: Please keep track of the output directory, and each time you go to a new section, change the output directory. As explained above, this will help you quickly get your results from each analysis for comparison purposes. Please use the Out.S1 to Out.S6 folders (the S1-S6 stands for sections 1 to 6).
Project layers directory/file: Specify the Climate_2070 or Climate_LGM folder here. It is unfortunate that you cannot specify multiple projection layers (i.e. future and past) in the same analysis. You need to do this separately (redirecting to a different output directory for each projection!),
Match up the figures below with the SDM predictions that you generated.
Q1.2: Please report the Training and Testing AUC statistics.
Q1.3: Look in the .html results file.
Q1.4: visit this site: http://worldclim.org/bioclim. Please only report those variables that you are using in your models.
Q1.5: go to the Analysis of variable contributions section in each *.html file.
Q1.8: Open “Locality.data.csv” in excel and highlight and hold cells (with SHIFT) - then look for the row count. ** Hint: Look in the “Raw data outputs and control parameters” section in the Maxent output .html file.
Run the analyses and open the .html files for each of the species (i.e. NOT the numbered _0.html etc. files).
Consider using the following terminology: contracting, expanding, shifting, loss of geographic area, gain of geographic area, fragmentation, merging/coalescing.
interpolation is a method of constructing new data points within the range of a discrete set of known data points — in this case, the interpolation estimates the climates between the climate stations (the “known data points”).↩