Introduction

TBD

Please find below some general information on the database.

## Reading layer `TANZANIA_TOT_DATA' from data source 
##   `/Users/c055717/Desktop/Tanzania/TANZANIA_TOT_ORIGINAL_DATA/TANZANIA_TOT_DATA.shp' 
##   using driver `ESRI Shapefile'
## Simple feature collection with 47417 features and 18 fields
## Geometry type: POINT
## Dimension:     XY
## Bounding box:  xmin: 30.33111 ymin: -11.1296 xmax: 39.0658 ymax: -1.003332
## Geodetic CRS:  WGS 84
## [1] "Number of points: 44391"

pH

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##     Min.  1st Qu.   Median     Mean  3rd Qu.     Max.     NA's 
##    3.354    5.659    6.100   27.922    6.587 7194.000        1

pH Outliers

We have established an expected pH range of up to 14 to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of pH values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##   3.354   5.657   6.097   6.186   6.581   9.516       1
## 
## Additional_Soil_Data 
##                  170

Total carbon

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##    0.15    7.01   10.52   22.59   15.90 2741.00   19876

TC outliers

We have established an expected TC range below 50 g/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of TC values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##    0.15    6.95   10.33   12.38   15.38   50.00   19876
## 
## Additional_Soil_Data       iSDA_Soil_Data 
##                  176                  436

Organic carbon

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##    0.41    2.22    6.11   32.83   12.32  999.00     836

OC outliers

We have established an expected OC range below 40 g/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of OC values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##   0.410   2.134   5.600   7.827  11.270  40.000     836
## 
## iSDA_Soil_Data   TZ_Soil_Data 
##            727           1358

Nitrogen

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    0.01    0.61    1.24   32.02   55.00  208.00

N outliers

We have established an expected N range below 10 g/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of N values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##  0.0100  0.4300  0.7200  0.8808  1.0900  9.8100
## 
## Additional_Soil_Data       iSDA_Soil_Data         TZ_Soil_Data 
##                  169                    2                17914

Phosohorus

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##      Min.   1st Qu.    Median      Mean   3rd Qu.      Max.      NA's 
##     0.680     3.378     4.813    50.791     6.535 12667.000     26705

P outliers

We have established an expected P range below 100 mg/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of P values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##   0.680   3.356   4.762   5.243   6.432  46.640   26705
## 
## Additional_Soil_Data         TZ_Soil_Data 
##                  170                  170

Potassium

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##     Min.  1st Qu.   Median     Mean  3rd Qu.     Max. 
##      3.9    182.2    399.0   5086.9   8580.0 218127.0

K outliers

We have established an expected K range below 1000 mg/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of K values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     3.9   123.5   216.6   249.9   337.0   999.5
## 
## Additional_Soil_Data       iSDA_Soil_Data         TZ_Soil_Data 
##                  536                  661                17776

Calcium

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##       2     501    1083   20861    5539 8573200

Ca outliers

We have established an expected Ca range below 4000 mg/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of Ca values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     2.0   396.2   703.2   974.7  1278.0  3999.3
## 
##        Additional_Soil_Data              iSDA_Soil_Data 
##                         535                        3207 
## TAMASA_TZ_CC_Soil_2015_Data                TZ_Soil_Data 
##                           2                        9466

Iron

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##     Min.  1st Qu.   Median     Mean  3rd Qu.     Max. 
##     2.53    43.58    74.72   219.07   105.14 63605.00

Fe outliers

We have established an expected Fe range below 400 mg/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of Fe values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   2.527  30.188  34.249  33.261  37.295  39.999
## 
## Additional_Soil_Data       iSDA_Soil_Data         TZ_Soil_Data 
##                  173                    2                   25

Zinc

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##    0.24  398.75  521.00  488.72  659.00 1165.00   26705

Zn outliers

We have established an expected Zn range below 10 mg/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of P values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##   0.240   0.520   0.680   0.765   1.016   2.270   26705
## 
## Additional_Soil_Data         TZ_Soil_Data 
##                  175                17260

Sulfur

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##      Min.   1st Qu.    Median      Mean   3rd Qu.      Max.      NA's 
##      1.78     36.66     51.77    425.77     71.82 111007.00     26705

S outliers

We have established an expected S range below 125 mg/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of P values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##    1.78   36.35   51.08   54.36   70.02  124.91   26705
## 
##        Additional_Soil_Data TAMASA_TZ_CC_Soil_2015_Data 
##                         172                           1 
##                TZ_Soil_Data 
##                         291

Aluminium

Below is the original data distribution along with a comprehensive summary of its statistics. The data distribution provides an overview of how the data is spread across various values, while the statistical summary offers key measures to understand the central tendency, dispersion, and overall characteristics of the dataset.

##     Min.  1st Qu.   Median     Mean  3rd Qu.     Max. 
##       80      711     1197    57528    12170 23688180

Al outliers

We have established an expected Al range below 3000 mg/kg to ensure that our analysis remains within physically valid boundaries. Below, you will find the updated distribution of Al values, adhering to this predefined range. Additionally, we have identified and recorded the number of outliers for each data source.

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    80.5   590.3   753.5   813.5   982.9  2514.8
## 
## Additional_Soil_Data         TZ_Soil_Data 
##                  556                19876

Benchmark Outliers

In the analysis, we found a total of 171 values in the pH_outliers, similar to the number of outliers coming from Additional_data we found in TC_outliers. The number of common IDs between pH_outliers and TC_outliers is 157.

## Warning in geom_point(data = subset_common, aes(x = subset_common$Lon, y =
## subset_common$Lat), : Ignoring unknown parameters: `shp`

Please find below the a sample of the data.

First 30 Lines of the Data Frame
ID LAYER Lat Lon Upper Lower TC OC pH N P K Ca Fe Zn Al S geometry
364 363 Additional_Soil_Data -1.04463 31.84145 0 20 425 NA 6727 0.02 2198.00 19890.0 2794800 43287.00 706.00 7744410.0 27898.00 POINT (31.84145 -1.044626)
365 364 Additional_Soil_Data -1.04722 31.84218 0 20 757 NA 6347 0.04 3.95 23010.0 718200 33646.00 694.00 10882620.0 26053.00 POINT (31.84218 -1.047222)
366 365 Additional_Soil_Data -1.09400 31.80489 20 50 829 NA 6414 46.00 4977.00 14430.0 356600 60.92 796.00 11359440.0 27619.00 POINT (31.80489 -1.093997)
367 366 Additional_Soil_Data -1.53522 31.59704 20 50 1097 NA 6808 53.00 7821.00 3510.0 262 63605.00 708.00 15618420.0 30872.00 POINT (31.59704 -1.535225)
368 367 Additional_Soil_Data -1.67574 31.51187 20 50 1724 NA 6055 0.08 6656.00 14820.0 53800 41747.00 694.00 5914350.0 51.27 POINT (31.51187 -1.675735)
369 368 Additional_Soil_Data -1.48693 31.49154 20 50 1506 NA 5798 69.00 2759.00 15210.0 211600 30244.00 0.92 12900.6 37116.00 POINT (31.49154 -1.486928)
370 369 Additional_Soil_Data -1.63575 31.51893 20 50 1513 NA 6493 55.00 14.94 20280.0 503400 31.71 1012.00 9459720.0 31686.00 POINT (31.51893 -1.635749)
371 370 Additional_Soil_Data -1.36651 31.47704 20 50 2132 NA 5458 99.00 2083.00 3.9 62600 19802.00 439.00 7777080.0 49803.00 POINT (31.47704 -1.366511)
372 371 Additional_Soil_Data -1.38787 31.50790 20 50 1186 NA 5437 61.00 3651.00 5460.0 60400 27295.00 745.00 19571490.0 46938.00 POINT (31.5079 -1.387866)
373 372 Additional_Soil_Data -1.60734 31.40310 20 50 2031 NA 5791 101.00 4073.00 23790.0 412600 37459.00 0.95 16957080.0 95118.00 POINT (31.4031 -1.607336)
374 373 Additional_Soil_Data -1.43439 31.64213 20 50 1835 NA 5199 84.00 4485.00 8970.0 254 52447.00 652.00 22654890.0 48176.00 POINT (31.64213 -1.434391)
375 374 Additional_Soil_Data -1.54211 31.60833 20 50 2038 NA 6513 82.00 4412.00 38220.0 603800 48127.00 992.00 15311160.0 32184.00 POINT (31.60833 -1.542114)
376 375 Additional_Soil_Data -1.42965 31.62412 20 50 935 NA 6236 37.00 4581.00 9360.0 887200 48308.00 748.00 12709170.0 15925.00 POINT (31.62412 -1.429652)
377 376 Additional_Soil_Data -1.41044 31.52263 20 50 1864 NA 6349 72.00 10.96 5850.0 404400 38307.00 763.00 9691650.0 23842.00 POINT (31.52263 -1.410441)
378 377 Additional_Soil_Data -1.56083 31.56310 20 50 1171 NA 6524 48.00 12354.00 10920.0 39000 45702.00 673.00 12132180.0 45364.00 POINT (31.5631 -1.56083)
379 378 Additional_Soil_Data -1.41106 31.57236 20 50 1585 NA 6266 31.00 1.41 3120.0 119400 17533.00 347.00 17587.8 40313.00 POINT (31.57236 -1.411064)
380 379 Additional_Soil_Data -1.14178 31.84513 20 50 896 NA 6585 31.00 2684.00 27.3 484600 43291.00 826.00 9590940.0 43544.00 POINT (31.84513 -1.141778)
381 380 Additional_Soil_Data -1.58724 31.55166 20 50 943 NA 6776 48.00 8662.00 10530.0 803600 39698.00 586.00 8662680.0 47.26 POINT (31.55166 -1.587242)
384 383 Additional_Soil_Data -1.62507 31.37824 20 50 1728 NA 6381 91.00 6631.00 26910.0 276800 38101.00 1032.00 11049210.0 27465.00 POINT (31.37824 -1.625067)
386 385 Additional_Soil_Data -1.53729 31.58123 20 50 1369 NA 6344 53.00 6.34 12480.0 948200 36325.00 488.00 9939240.0 31204.00 POINT (31.58123 -1.537294)
387 386 Additional_Soil_Data -1.41519 31.55418 20 50 1631 NA 5982 0.06 4242.00 8580.0 327400 51.97 0.51 15233130.0 54794.00 POINT (31.55418 -1.415194)
388 387 Additional_Soil_Data -1.11444 31.82287 20 50 862 NA 6083 41.00 3869.00 40170.0 682600 46.59 542.00 12813930.0 37.48 POINT (31.82287 -1.114441)
389 388 Additional_Soil_Data -1.64292 31.35963 20 50 1446 NA 5881 46.00 3577.00 11310.0 119200 34784.00 443.00 14125050.0 45282.00 POINT (31.35963 -1.64292)
390 389 Additional_Soil_Data -1.41173 31.63538 20 50 1974 NA 5833 108.00 7133.00 10530.0 560 38383.00 679.00 22100310.0 33.46 POINT (31.63538 -1.41173)
391 390 Additional_Soil_Data -1.39752 31.51503 20 50 1117 NA 5348 41.00 2203.00 13650.0 311400 42022.00 0.58 16475670.0 31499.00 POINT (31.51503 -1.39752)
392 391 Additional_Soil_Data -1.47201 31.60569 20 50 1608 NA 6516 89.00 6748.00 13260.0 520400 35416.00 842.00 10949850.0 28668.00 POINT (31.60569 -1.472012)
393 392 Additional_Soil_Data -1.51785 31.46398 20 50 2036 NA 5823 87.00 4236.00 14040.0 164 34691.00 0.74 15840360.0 62375.00 POINT (31.46398 -1.517846)
394 393 Additional_Soil_Data -1.64092 31.35926 20 50 1753 NA 6016 59.00 2867.00 8970.0 346400 49.65 832.00 12674340.0 55595.00 POINT (31.35926 -1.640918)
395 394 Additional_Soil_Data -1.42020 31.58809 20 50 1764 NA 6715 0.06 6932.00 20280.0 586400 34267.00 741.00 11517930.0 36209.00 POINT (31.58809 -1.4202)
396 395 Additional_Soil_Data -1.47201 31.60569 0 20 1724 NA 6509 71.00 7056.00 32370.0 618 45444.00 1.18 8123490.0 19114.00 POINT (31.60569 -1.472012)