Here I am trying to look at whether we can observe differences in GCC_90 based on early season soil management practices. I created labels for four classes of early season ground cover:
Burnt
Cleared
Thin Crop Bunches Two classes are made for “bunches”. These site look like they are harvested by hand. We should figure out what exactly they are. “Thin” bunches tend to be more intermittent, shorter, and often lay flat against the soil.
Thick Crop Bunches Thick “bunches” tend to stand upright and have much higher density of standing material.
Ground Litter
First step is to see if there are differences in the distribution of GCC for each group
unique(in_data$Soil_Updated)
## [1] NA "Cleared" "Growth, Not Sorted"
## [4] "Ground Litter" "Crop Bunches Thick" "Burnt"
## [7] "Crop Bunches Thin"
"./Data/sorted/Soil_Updated/Burnt/128940-5778e4db-3b35-4839-b6ce-4f0971c70bb2-1479277942199.jpg"
## [1] "./Data/sorted/Soil_Updated/Burnt/128940-5778e4db-3b35-4839-b6ce-4f0971c70bb2-1479277942199.jpg"
# assign labels to all sites where soil label is assigned
for(label in c( "Ground Litter","Crop Bunches Thick","Crop Bunches Thin" ,"Burnt","Cleared")){
in_data$final_soil_class[in_data$userfield %in% unique(in_data[in_data$Soil_Updated==label,'userfield'])] =label
}
# number of observations per class
table( in_data$final_soil_class)
##
## Burnt Cleared Crop Bunches Thick
## 95 1734 112
## Crop Bunches Thin Ground Litter
## 127 335
ggplot() +geom_boxplot(data=in_data[in_data$final_soil_class!='NA',],aes(x=final_soil_class, y=gcc_90,fill=factor(final_soil_class))) +
theme_minimal() +theme(axis.text.x = element_text(angle = 90, hjust = 1))+xlab('')
##
## Welch Two Sample t-test
##
## data: litter and cleared
## t = 8.0699, df = 479.12, p-value = 5.711e-15
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 0.03119988 0.05128367
## sample estimates:
## mean of x mean of y
## 0.5593798 0.5181380
Although not that surprising given the graph we can see that there is a statistically significant difference between GCC values for cleared and ground with litter.
Now let’s see if there are differences in the maximum GCC-90 value by soil class (going forward this should be run on smoothed data).
max_data = in_data %>% group_by(final_soil_class,userfield) %>% summarise(`Max GCC_90` = median(gcc_90, na.rm = TRUE))
ggplot() +geom_boxplot(data=max_data[max_data$final_soil_class!='NA',],aes(x=final_soil_class, y=`Max GCC_90`,fill=factor(final_soil_class))) +
theme_minimal() +theme(axis.text.x = element_text(angle = 90, hjust = 1))+xlab('')
Looks like we can use this data to evaluate the impact of climate smart data practices on GCC or even NDVI. Clearly more work is needed here but hoping this gets a conversation going.