[Data Structure]
str(data1)
'data.frame': 69492 obs. of 12 variables:
$ j2dt : POSIXct, format: "2016-08-24 08:20:27" "2016-08-24 08:20:36" "2016-08-24 08:20:46" "2016-08-24 08:20:57" ...
$ lat : num 34.9 34.9 34.9 34.9 34.9 ...
$ long : num 129 129 129 129 129 ...
$ pressure : int 0 0 0 0 0 0 0 0 0 0 ...
$ temp : num 28.7 28.7 28.7 28.7 28.7 ...
$ salinity : num 28.3 28.4 28.4 28.4 28.4 ...
$ density1 : num 17.2 17.2 17.2 17.2 17.2 ...
$ density2 : num 17.2 17.2 17.2 17.2 17.2 ...
$ soundv : num 1536 1536 1536 1536 1536 ...
$ flag : num 0 0 0 0 0 0 0 0 0 0 ...
$ temp_flag: logi NA NA NA NA NA NA ...
$ sal_flag : logi NA NA NA NA NA NA ...
[Data summary]
summary(data1)
j2dt lat long pressure temp salinity density1 density2
Min. :2016-08-24 08:20:27 Min. :34.87 Min. :128.6 Min. :0 Min. :22.25 Min. :25.98 Min. :15.42 Min. :15.42
1st Qu.:2016-08-26 08:35:54 1st Qu.:34.99 1st Qu.:128.7 1st Qu.:0 1st Qu.:25.83 1st Qu.:30.69 1st Qu.:19.66 1st Qu.:19.66
Median :2016-08-28 08:51:22 Median :34.99 Median :128.7 Median :0 Median :26.61 Median :30.86 Median :19.78 Median :19.78
Mean :2016-08-28 08:51:22 Mean :34.99 Mean :128.7 Mean :0 Mean :26.68 Mean :30.92 Mean :19.75 Mean :19.75
3rd Qu.:2016-08-30 09:06:49 3rd Qu.:34.99 3rd Qu.:128.7 3rd Qu.:0 3rd Qu.:27.49 3rd Qu.:31.08 3rd Qu.:19.89 3rd Qu.:19.89
Max. :2016-09-01 09:22:17 Max. :35.04 Max. :129.0 Max. :0 Max. :28.67 Max. :32.18 Max. :21.93 Max. :21.93
soundv flag temp_flag sal_flag
Min. :1524 Min. :0 Mode :logical Mode :logical
1st Qu.:1532 1st Qu.:0 FALSE:68809 FALSE:67901
Median :1534 Median :0 TRUE :652 TRUE :1560
Mean :1534 Mean :0 NA's :31 NA's :31
3rd Qu.:1536 3rd Qu.:0
Max. :1538 Max. :0
Moving IQR TEST Result..(Windows size= 31, 중앙값 16번째 값 좌우로 15개씩 총 31개 단위데이터 이용) 따라서 1~15행, 마지막행-15행까지의 데이터는 버려짐.
[Temperature’s M-IQR 검출결과]
cat("Temperature ->>", "Outliers(M-IQR) :", nrow(temp_Outliers), " , ", "Passed :", nrow(temp_Passed))
Temperature ->> Outliers(M-IQR) : 652 , Passed : 68809
[Temperature’s 시각화 체크]
boxplot(data1$temp, col="lightgrey", horizontal = T, xlab="째C", ylab="", main="temperature")
hist(data1$temp, col="lightcyan", breaks = 100, probability = TRUE, xlab="째C")
[Temperature’s Dynamic Plotting]
p<-ggplot(data1, aes(x=j2dt, y=temp, col=temp_flag)) + geom_point(size=1) +
scale_color_manual(values=c("black", "red")) + ggtitle("temperature Plot")
ggplotly(p)
[Salinity’s M-IQR 검출결과]
cat("Salinity ->>", "Outliers(M-IQR) :", nrow(salinity_Outliers), " , ", "Passed :", nrow(salinity_Passed))
Salinity ->> Outliers(M-IQR) : 1560 , Passed : 67901
[Salinity’s 시각화 체크]
boxplot(data1$salinity, col="lightgrey", horizontal = T, xlab="", ylab="", main="Salinity")
hist(data1$salinity, col="lightcyan", breaks = 100, probability = TRUE, xlab="")
[Salinity’s Dynamic Plotting]
p<-ggplot(data1, aes(x=j2dt, y=salinity, col=sal_flag)) + geom_point(size=1) +
scale_color_manual(values=c("black", "red")) + ggtitle("Salinity Plot")
ggplotly(p)