Team ID: 5
Name (tasks): Oscar Alvardo
Name (tasks): Karshima Johnson
Name (tasks): Christina Thai (Introduction, Background, Summary of Data, Visual of Probe Locations in Brain, Time Series Visual of Average Number of Neurons/Region, Kendall Tau’s correlation)
This document is the final report for the Spring 2019 course STA 141A at University of California, Davis. It seeks to examine whether activity in one region affects the activity in another region.
To reduce the variability of cortical responses to sensory stimuli, analyzing the neural spikes in multiple brain regions during spontaneous activity will yield more accurate, less noisy, results.
Using the data gathered from contributors Nick Steinmetz, Marius Pachitariu, Carsen Stringer, Matteo Carandini, and Kenneth Harris’ experiment, we seek to analyze the areas of the brain that are used in conjunction. The original experiment investigates fluctuations in arousal levels indicated by running, pupil area, and whisking. Our variable of interest is the cortical responses as these behaviors are ongoing.
In this project, we analyze the reduced data set Reduced_Krebs.mat containing Neuropixel recordings of a mouse named Krebs during spontaneous activity. The full data set can be found at this webpage.
The data used has already been preprocessed using a Matlab script, which can be found in the code Appendix of the report.
This report focuses on data collected from an experiment done with a mouse named Krebs. The neuron activation recorded from stimuli from whisking (measured by summed videographic motion energy within the whisker region). This data set contains spike trains of neurons in nine regions in the mouse brain.
The following represent elements of behavior data: avgframe Average Frame of Recording
faceSVD Singular Value Decompositon of Compressed Image of Face
motionMask Corresponding Masks to Singular Values
The data set being referenced Reduced_Krebs.mat contains the first 50 singular values and their masks. These are used to reproduce an image based on the average of the frames within the data set using infrared camera.
The following lists within the Krebs data represents the nine regions for Krebs in the mouse brain:
stall.CP Caudate Putamen
stall.FrMoCtx Frontal Motor Cortex
stall.HPF Hippocampal Formation
stall.LS Lateral Septum
stall.MB Midbrain
stall.SC Superior Colliculus
stall.SomMoCtxx Somatomotor Cortex
stall.TH Thalamus
stall.V1 Primary Visual Cortex
The data set contains spike trains of neurons in nine regions of the mouse brain. Each corresponding column represents a specific region of the brain; for example, the matrix stall.CP relates to the activity and activation of neurons located in the Caudate Putamen.
A more in depth explanation on the structure of our data is that it is a binary matrix with values of 1 denoting an existance of neural activity and 0 being that there is no neural activity at the specific time frame. Each row represents a neuron within the region.
A caption
The figure above is the reconstructed probe locations of recordings in Krebs. The poles are generated using the borders of the brain regions and map depth of the region in microns relative to position of the principal axes of the body (AP/DV/LR).
Note: AP refers the antero-posterior, DV refers to the dorso-ventral, and LR refers to the left-right axis formation.
We wish to examine which brain regions are being are activated(?) relatively simultaneously. In other words, we seek to examine if brain activity in the Thalamus can also imply activity in the Primary Visual Cortex.
Additionally, we want to examine neural activity throughout the entire time period and locate where there may be significant changes in neural activity.
The initial study monitored large populations of neurons in awake head-fixed mice. These mice were unengaged from behavioral task and then their spontaneously performed behaviors (i.e. whisking, sniffing, and other facial movements) were monitored videographically. The study recorded simultaneous neurological data of six mice over nine sessions and worked to identify the relationship between these behaviors and their neural activity.
Further, the studies were testing in various settings such as with and without darkness and recordings in darkness or with a gray screen. They observed no differences between these recordings, thus we take it to imply that Krebs’ sample data is representative of the mouse’s overall behavior in this experiment.
In addition, we acknowledge that mouse models for genetic research prove to be excellent tools for insight into immune, endocrine, cardiovascular, skeletal, and in particular, nervous systems shared by mammals. Thus conclusions obtained through the study may become applicable to humans.
We choose to examine the pairwise correlations with Kendall’s method with ranks since this method does not need any assumptions on the distribution. This non-parametric correlation method measures the number of discordant and concordant pairs. Assume there are pairs \((X_i, Y_i)\) and \((X_j,Y_j)\) where \(i < j\). If \(X_i < X_j\) and \(Y_i < Y_j\), then \((X_i - X_j) \times (Y_i - Y_j) > 0\) and \((X_i, Y_i)\) and \((X_j,Y_j)\). Thus \((X_i, Y_i)\) \((X_j,Y_j)\) is a concordant pair and an increase in X implies an increase in Y. Similar logic is applied to the definition of discordant pairs, which can be seen in the mathematical definition: \((X_i - X_j) \times (Y_i - Y_j) < 0\). Tied pairs mean \((X_i - X_j) \times (Y_i - Y_j) = 0\) Kendall’s \(\tau = \frac{C - D}{n(n-1)/2)}\), where C is the number of concordant pairs and D represents discordant pairs. If most pairs are concordant, then there is a postive linear relationship. Similarly, if most pairs are discordinant, then the data suggests a negative linear relationship. If there are an equal number of discordant and concordant pairs, this suggests no linear relationship, i.g. X and Y are independent.
– Karishma will add (summary of data) !!!!!!!!!!!!!!!!!!!!!!!!!!!!
Recall that the brain regions are: stall.CP Caudate Putamen
stall.FrMoCtx Frontal Motor Cortex
stall.HPF Hippocampal Formation
stall.LS Lateral Septum
stall.MB Midbrain
stall.SC Superior Colliculus
stall.SomMoCtxx Somatomotor Cortex
stall.TH Thalamus
stall.V1 Primary Visual Cortex
From the above plot, we can see that the time from of most activity is between 20000 and 29000ms. Thus, we adjust our interval accordingly to analyze correlation.
Since plotting each proportion of neurons firing at each millisecond is extremely volatile, we proposed to analyze the average proportion of neurons firing at a new interval 1 through 1302 where each “time” is considered a time window of 30ms.
The above plot shows the variance of each 30ms time window for each brain region.We choose to examine correlation using the Kendall method because it does not require the assumption of a linear relationship. Another reason why we chose to use Kendall Tau’s method for correlation because the ranking of data points mitigates the effects of outliers.
Kendall’s correlation is much smaller value, which we believe is due to removing
Note: Discuss the limitations of the presented projects, and comment on how this project enlightens future research or analysis.
## R version 3.6.0 (2019-04-26)
## Platform: x86_64-apple-darwin15.6.0 (64-bit)
## Running under: macOS Mojave 10.14
##
## Matrix products: default
## BLAS: /Library/Frameworks/R.framework/Versions/3.6/Resources/lib/libRblas.0.dylib
## LAPACK: /Library/Frameworks/R.framework/Versions/3.6/Resources/lib/libRlapack.dylib
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] corrplot_0.84 ggcorrplot_0.1.3 ggplot2_3.1.1 dplyr_0.8.0.1
## [5] dygraphs_1.1.1.6 R.matlab_3.6.2
##
## loaded via a namespace (and not attached):
## [1] Rcpp_1.0.1 plyr_1.8.4 compiler_3.6.0
## [4] pillar_1.3.1 highr_0.8 R.methodsS3_1.7.1
## [7] R.utils_2.8.0 tools_3.6.0 xts_0.11-2
## [10] digest_0.6.18 jsonlite_1.6 evaluate_0.13
## [13] tibble_2.1.1 gtable_0.3.0 lattice_0.20-38
## [16] pkgconfig_2.0.2 rlang_0.3.4 yaml_2.2.0
## [19] xfun_0.6 withr_2.1.2 stringr_1.4.0
## [22] knitr_1.22 htmlwidgets_1.3 grid_3.6.0
## [25] tidyselect_0.2.5 glue_1.3.1 R6_2.4.0
## [28] rmarkdown_1.12 purrr_0.3.2 magrittr_1.5
## [31] scales_1.0.0 htmltools_0.3.6 assertthat_0.2.1
## [34] colorspace_1.4-1 stringi_1.4.3 lazyeval_0.2.2
## [37] munsell_0.5.0 crayon_1.3.4 zoo_1.8-5
## [40] R.oo_1.22.0