Tugas 10

EKONOMETRIKA

DHELA A. AGATHA (20214920009)

April 22, 2024


Hallo guys,

Berikut ini saya lampirkan tugas kalian untuk pertemuan 10 pada Matakuliah Ekonometrika. Dikumpulkan pada tanggal tertera. Silahkan untuk mengikuti instruksi berikut:

  1. Mendefenisikan Permasalahan
  2. Data Preparation
  3. Exploratory Data Analysis (EDA)
  4. Data Preprocessing
  5. Model Building (Perbandingan 5 model)
  6. Model Evaluation (dengan menggunakan ANOVA, RMSE, Rsquared)
  7. Conclusion

Catatan: Kerjakan proses tersebut dengan menggunakan Rmarkdwon dan kumpulkan link Rpubsnya disini!

Mendefenisikan Permasalahan

Suatu hari, Xander ingin memprediksi harga dari suatu rumah dengan beberapa fasilitas yang memumpuni seperti:

ID - ID Date - Tanggal Price - Harga Bedrooms - Jumalh Kamar Tidur Bathrooms - Jumlah Kamar Mandi Sqft Living - Luas Bangunan (in foot) Sqft Lot - Luas Tanah (in foot) Floors - Jumlah Lantai Waterfront - Bersebelahan dengan Air seperti danau. View - Gazing/ Pemandangan Condition - KondisiRumah Grade - Grade Bangunan Sqft Above - Luas Bangunan Di Atas Tanah Sqft Basement - Luas Basement Year Built - Tahun Dibangun Year Renovated - Tahun Direnovasi terakhir Zipcode - Kode Pos Latitude - Lintang Longitude - Bujur Sqft Living15 - Luas Ruang Hunian pada 2015 (dalam kaki persegi) Sqft Lot15 - Luas Lahan pada 2015 (dalam kaki persegi).

house_data Preparation

## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
## ✔ dplyr     1.1.4     ✔ readr     2.1.5
## ✔ forcats   1.0.0     ✔ stringr   1.5.1
## ✔ ggplot2   3.5.0     ✔ tibble    3.2.1
## ✔ lubridate 1.9.3     ✔ tidyr     1.3.1
## ✔ purrr     1.0.2     
## ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
## ✖ dplyr::filter() masks stats::filter()
## ✖ dplyr::lag()    masks stats::lag()
## ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors

Exploratory Data Analysis (EDA)

summary(house)
##        id                date               price            bedrooms     
##  Min.   :1.000e+06   Length:21613       Min.   :  75000   Min.   : 0.000  
##  1st Qu.:2.123e+09   Class :character   1st Qu.: 321950   1st Qu.: 3.000  
##  Median :3.905e+09   Mode  :character   Median : 450000   Median : 3.000  
##  Mean   :4.580e+09                      Mean   : 540088   Mean   : 3.371  
##  3rd Qu.:7.309e+09                      3rd Qu.: 645000   3rd Qu.: 4.000  
##  Max.   :9.900e+09                      Max.   :7700000   Max.   :33.000  
##    bathrooms      sqft_living       sqft_lot           floors     
##  Min.   :0.000   Min.   :  290   Min.   :    520   Min.   :1.000  
##  1st Qu.:1.750   1st Qu.: 1427   1st Qu.:   5040   1st Qu.:1.000  
##  Median :2.250   Median : 1910   Median :   7618   Median :1.500  
##  Mean   :2.115   Mean   : 2080   Mean   :  15107   Mean   :1.494  
##  3rd Qu.:2.500   3rd Qu.: 2550   3rd Qu.:  10688   3rd Qu.:2.000  
##  Max.   :8.000   Max.   :13540   Max.   :1651359   Max.   :3.500  
##    waterfront            view          condition         grade       
##  Min.   :0.000000   Min.   :0.0000   Min.   :1.000   Min.   : 1.000  
##  1st Qu.:0.000000   1st Qu.:0.0000   1st Qu.:3.000   1st Qu.: 7.000  
##  Median :0.000000   Median :0.0000   Median :3.000   Median : 7.000  
##  Mean   :0.007542   Mean   :0.2343   Mean   :3.409   Mean   : 7.657  
##  3rd Qu.:0.000000   3rd Qu.:0.0000   3rd Qu.:4.000   3rd Qu.: 8.000  
##  Max.   :1.000000   Max.   :4.0000   Max.   :5.000   Max.   :13.000  
##    sqft_above   sqft_basement       yr_built     yr_renovated   
##  Min.   : 290   Min.   :   0.0   Min.   :1900   Min.   :   0.0  
##  1st Qu.:1190   1st Qu.:   0.0   1st Qu.:1951   1st Qu.:   0.0  
##  Median :1560   Median :   0.0   Median :1975   Median :   0.0  
##  Mean   :1788   Mean   : 291.5   Mean   :1971   Mean   :  84.4  
##  3rd Qu.:2210   3rd Qu.: 560.0   3rd Qu.:1997   3rd Qu.:   0.0  
##  Max.   :9410   Max.   :4820.0   Max.   :2015   Max.   :2015.0  
##     zipcode           lat             long        sqft_living15 
##  Min.   :98001   Min.   :47.16   Min.   :-122.5   Min.   : 399  
##  1st Qu.:98033   1st Qu.:47.47   1st Qu.:-122.3   1st Qu.:1490  
##  Median :98065   Median :47.57   Median :-122.2   Median :1840  
##  Mean   :98078   Mean   :47.56   Mean   :-122.2   Mean   :1987  
##  3rd Qu.:98118   3rd Qu.:47.68   3rd Qu.:-122.1   3rd Qu.:2360  
##  Max.   :98199   Max.   :47.78   Max.   :-121.3   Max.   :6210  
##    sqft_lot15    
##  Min.   :   651  
##  1st Qu.:  5100  
##  Median :  7620  
##  Mean   : 12768  
##  3rd Qu.: 10083  
##  Max.   :871200
# Distribusi Histogram
house %>%
  keep(is.numeric) %>%
  gather(key = "variables", value = "values") %>%
  ggplot(aes(x = values)) +
  geom_histogram(bins = 30, fill = "blue", color = "black") +
  facet_wrap(~variables, scales = "free_x")

# Megecek NA
colSums(is.na(house))
##            id          date         price      bedrooms     bathrooms 
##             0             0             0             0             0 
##   sqft_living      sqft_lot        floors    waterfront          view 
##             0             0             0             0             0 
##     condition         grade    sqft_above sqft_basement      yr_built 
##             0             0             0             0             0 
##  yr_renovated       zipcode           lat          long sqft_living15 
##             0             0             0             0             0 
##    sqft_lot15 
##             0

Catatan:

Berikut adalah penjelasan singkat untuk setiap variabel penting , sebagai berikut:

Harga (Price)

summary(house$price)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   75000  321950  450000  540088  645000 7700000

Harga mulai dari 75.000 USD Sampai 7.700.000 USD Dengan Mean senilai 540.088 USD

Kamar Tidur (Bedrooms)

summary (house$bedrooms)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   0.000   3.000   3.000   3.371   4.000  33.000

Tersedia dari 0 sampai 33 Kamar dengan Mean sekitar 3-4 Kamar.

Kamar Mandi (Bathrooms)

summary (house$bathrooms)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   0.000   1.750   2.250   2.115   2.500   8.000

Mulai dari 0-8 kamar Mandi dengan rata-rata memiliki 2-3 Kamar Mandi.

Luas Bangunan

summary (house$sqft_living)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     290    1427    1910    2080    2550   13540

Luas Bangunan dimuali dari 290 sqft sampai 13.540 sqft dengan Mean sebesar 2.080 SQFT.

Luas Tanah (Sqft Lot)

summary (house$sqft_lot)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     520    5040    7618   15107   10688 1651359

Luas Tanah Dimulai dari 520 SQFT sampai 1.651.359 SQFT (Yallah Luas Sekali) dengan Mean Luas Tanah 15.107 SQFT.

Lantai (Floors)

summary (house$floors)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##   1.000   1.000   1.500   1.494   2.000   3.500

Rumah yang dijual memiliki 1 sampai 3,5 Lantai dengan Rata-rata kebanyakan memiliki 1-2 Lantai.

Waterfront

library(dplyr)
house %>% count(waterfront)

Sebanyak 21430 Rumah tidak memiliki view Danau,hanya sekitar 163 yang memilikinya.

View

library(dplyr)
house %>% count(view)

Kebanyakan rumah memili view yang jelek (0) dibandingkan dengan Kategori 1-4.

Kondisi (Condition)

library(dplyr)
house %>% count(condition)

Diketahui bahwa Rata-rata Bangunan kebanyakan berstatus Okay sampai Sangat Bagus jika mengacu bahwa Variabel Kondisi emnggunakan Skala Likert.

Kualitas Konstruksi (Grade)

library(dplyr)
house %>% count(grade)

Diketahui bahwa Rata-rata Bangunan kebanyakan berstatus Okay sampai Sangat Bagus jika mengacu bahwa Variabel Kondisi emnggunakan Skala Likert.

Luas Bangunan Utama (Sqft Above)

summary(house$sqft_above)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     290    1190    1560    1788    2210    9410

Luas Bangunan dari 290-9410 SQFT Dengan Mean 1.788 SQFT.

Luas Basement (Sqft Basement)

summary(house$sqft_basement)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     0.0     0.0     0.0   291.5   560.0  4820.0

Keterangan: Luas basement dari 0 (Tidak ada) hingga 4.820 kaki persegi. Degan Mean 291.5 SQFT

Tahun Dibangun (Year Built)

library(dplyr)
house %>% count(yr_built)

Keterangan: Tahun properti dibangun berkisar dari tahun 1900 hingga 2015.

Tahun Direnovasi (Year Renovated)

library(dplyr)
house %>% count(yr_renovated)

Kebanyakan Rumah Tidak Pernah Di Renovasi.

Kode Pos (Zipcode)

library(dplyr)
house %>% count(zipcode)

ZipCode menunjukkan Banyaknya Rumah DIjual di Daerah Kode Pos TErsedebut.

Luas Hunian di 2015 (Sqft Living15)

summary(house$sqft_living15)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     399    1490    1840    1987    2360    6210

Keterangan: Menunjukkan luas hunian seperti yang tercatat pada tahun 2015, dari 399 hingga 6210 kaki persegi. Dengan Mean 1987

Luas Tanah di 2015 (Sqft Lot15)

summary(house$sqft_lot15)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##     651    5100    7620   12768   10083  871200

Keterangan: Menunjukkan luas tanah seperti yang tercatat pada tahun 2015, dari 651 hingga 871,200 kaki persegi. Dengan Mean 12.768

Data Preprocessing

mengecek dan menangani nilai yang hilang - dan mengisi nilai yang hilang dengan median

house <- house %>% 
  mutate_if(is.numeric, ~ifelse(is.na(.), median(., na.rm = TRUE), .))

Mengubah data house menjadi Distribusi Normal - Contoh menggunakan penskalaan min-max

house <- house %>%
  mutate_if(is.numeric, ~(. - min(.)) / (max(.) - min(.)))

Model Building (Perbandingan 5 model)

library(nnet)
library(rpart)
library(class)
library(stats)

Menyiapkan house untuk model

house <- na.omit(house)  # menghilangkan baris yang NA

Membagi house menjadi house latihan dan house uji

set.seed(123)
indices <- sample(1:nrow(house), size = 0.75 * nrow(house))
train_house <- house[indices, ]
test_house <- house[-indices, ]

Model Linear Regresi (Linear Model)

linmod <- lm(price ~ .-id -zipcode ,  train_house)

Decision Tree

model_tree <- rpart(price ~ .-id,  train_house)

Linear Model

summary(linmod)
## 
## Call:
## lm(formula = price ~ . - id - zipcode, data = train_house)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -0.16595 -0.01282 -0.00112  0.01004  0.56798 
## 
## Coefficients: (1 not defined because of singularities)
##                       Estimate Std. Error t value Pr(>|t|)    
## (Intercept)         -8.824e-02  4.335e-03 -20.355  < 2e-16 ***
## date20140503T000000  2.547e-02  1.557e-02   1.637 0.101745    
## date20140504T000000 -9.066e-03  1.361e-02  -0.666 0.505314    
## date20140505T000000  1.423e-03  4.994e-03   0.285 0.775644    
## date20140506T000000 -2.352e-04  5.144e-03  -0.046 0.963535    
## date20140507T000000  2.021e-03  4.873e-03   0.415 0.678328    
## date20140508T000000  7.719e-03  5.028e-03   1.535 0.124798    
## date20140509T000000 -1.937e-03  5.065e-03  -0.382 0.702177    
## date20140510T000000  3.887e-03  1.229e-02   0.316 0.751843    
## date20140511T000000  1.825e-02  2.642e-02   0.691 0.489740    
## date20140512T000000  2.324e-03  4.978e-03   0.467 0.640625    
## date20140513T000000  5.667e-03  5.028e-03   1.127 0.259772    
## date20140514T000000  2.998e-03  5.012e-03   0.598 0.549694    
## date20140515T000000 -2.729e-03  5.046e-03  -0.541 0.588645    
## date20140516T000000 -3.214e-04  5.103e-03  -0.063 0.949787    
## date20140517T000000  6.173e-03  2.642e-02   0.234 0.815290    
## date20140518T000000  3.392e-03  1.229e-02   0.276 0.782548    
## date20140519T000000  4.398e-04  5.104e-03   0.086 0.931322    
## date20140520T000000  1.422e-03  4.753e-03   0.299 0.764833    
## date20140521T000000 -6.432e-04  4.917e-03  -0.131 0.895919    
## date20140522T000000  2.119e-03  4.994e-03   0.424 0.671268    
## date20140523T000000  3.026e-04  5.029e-03   0.060 0.952011    
## date20140524T000000  7.339e-03  9.097e-03   0.807 0.419833    
## date20140525T000000 -1.525e-02  1.557e-02  -0.980 0.327340    
## date20140526T000000  3.406e-03  1.132e-02   0.301 0.763588    
## date20140527T000000 -1.932e-03  4.811e-03  -0.402 0.688054    
## date20140528T000000  1.626e-03  4.684e-03   0.347 0.728540    
## date20140529T000000  3.314e-03  5.104e-03   0.649 0.516237    
## date20140530T000000  2.184e-04  5.368e-03   0.041 0.967551    
## date20140531T000000  1.117e-04  1.229e-02   0.009 0.992752    
## date20140601T000000  8.902e-03  1.132e-02   0.786 0.431817    
## date20140602T000000  4.937e-03  5.188e-03   0.952 0.341364    
## date20140603T000000  9.641e-04  4.766e-03   0.202 0.839682    
## date20140604T000000  7.797e-03  4.860e-03   1.604 0.108708    
## date20140605T000000  3.055e-03  4.930e-03   0.620 0.535505    
## date20140606T000000  2.597e-03  5.525e-03   0.470 0.638286    
## date20140607T000000 -1.106e-02  1.361e-02  -0.813 0.416341    
## date20140608T000000  6.364e-03  1.059e-02   0.601 0.547801    
## date20140609T000000  2.348e-03  4.886e-03   0.480 0.630887    
## date20140610T000000  2.649e-03  4.786e-03   0.554 0.579910    
## date20140611T000000  7.356e-03  4.995e-03   1.473 0.140830    
## date20140612T000000  3.888e-03  5.012e-03   0.776 0.437973    
## date20140613T000000  5.289e-03  5.012e-03   1.055 0.291315    
## date20140614T000000  4.786e-04  1.361e-02   0.035 0.971950    
## date20140615T000000  1.880e-02  1.556e-02   1.208 0.227089    
## date20140616T000000  3.393e-03  4.901e-03   0.692 0.488779    
## date20140617T000000  6.770e-03  4.787e-03   1.414 0.157270    
## date20140618T000000  2.974e-03  4.873e-03   0.610 0.541670    
## date20140619T000000  1.150e-03  4.822e-03   0.239 0.811461    
## date20140620T000000  5.568e-03  4.712e-03   1.182 0.237391    
## date20140621T000000  1.431e-02  1.132e-02   1.263 0.206478    
## date20140622T000000 -2.394e-04  9.090e-03  -0.026 0.978988    
## date20140623T000000 -7.411e-04  4.550e-03  -0.163 0.870609    
## date20140624T000000  4.241e-03  4.657e-03   0.911 0.362540    
## date20140625T000000  4.963e-03  4.579e-03   1.084 0.278412    
## date20140626T000000 -6.301e-04  4.601e-03  -0.137 0.891080    
## date20140627T000000  1.160e-04  4.994e-03   0.023 0.981469    
## date20140628T000000  4.820e-03  9.092e-03   0.530 0.596029    
## date20140629T000000 -1.025e-03  1.132e-02  -0.091 0.927865    
## date20140630T000000  5.514e-03  5.104e-03   1.080 0.280029    
## date20140701T000000  8.681e-03  4.723e-03   1.838 0.066042 .  
## date20140702T000000  1.067e-02  4.978e-03   2.144 0.032084 *  
## date20140703T000000  8.740e-03  5.237e-03   1.669 0.095141 .  
## date20140704T000000  1.831e-02  1.887e-02   0.970 0.331971    
## date20140705T000000  3.066e-02  1.229e-02   2.494 0.012626 *  
## date20140706T000000 -5.884e-04  1.889e-02  -0.031 0.975146    
## date20140707T000000  1.420e-03  4.961e-03   0.286 0.774692    
## date20140708T000000  2.014e-03  4.609e-03   0.437 0.662076    
## date20140709T000000  1.859e-03  4.704e-03   0.395 0.692651    
## date20140710T000000  3.883e-03  4.888e-03   0.794 0.426966    
## date20140711T000000  5.353e-03  5.103e-03   1.049 0.294219    
## date20140712T000000  2.299e-03  1.229e-02   0.187 0.851575    
## date20140713T000000 -5.289e-03  1.887e-02  -0.280 0.779273    
## date20140714T000000  5.829e-03  4.776e-03   1.220 0.222296    
## date20140715T000000 -1.816e-03  4.901e-03  -0.370 0.711038    
## date20140716T000000  2.362e-03  4.722e-03   0.500 0.616936    
## date20140717T000000  2.156e-03  5.028e-03   0.429 0.668160    
## date20140718T000000  1.219e-03  4.977e-03   0.245 0.806572    
## date20140719T000000  7.254e-03  1.229e-02   0.590 0.555031    
## date20140720T000000 -5.656e-03  1.134e-02  -0.499 0.617883    
## date20140721T000000  5.117e-03  4.860e-03   1.053 0.292373    
## date20140722T000000 -3.671e-03  4.847e-03  -0.757 0.448831    
## date20140723T000000  7.821e-04  4.733e-03   0.165 0.868744    
## date20140724T000000 -6.694e-04  4.888e-03  -0.137 0.891071    
## date20140725T000000  2.900e-03  4.874e-03   0.595 0.551886    
## date20140726T000000  2.577e-03  9.988e-03   0.258 0.796385    
## date20140727T000000 -1.391e-03  2.643e-02  -0.053 0.958038    
## date20140728T000000  6.040e-04  4.961e-03   0.122 0.903106    
## date20140729T000000 -1.751e-03  4.693e-03  -0.373 0.709118    
## date20140730T000000  4.446e-03  5.126e-03   0.867 0.385807    
## date20140731T000000  2.906e-03  5.028e-03   0.578 0.563328    
## date20140801T000000  7.057e-03  5.084e-03   1.388 0.165143    
## date20140802T000000  6.355e-03  1.229e-02   0.517 0.605158    
## date20140803T000000  1.681e-02  2.643e-02   0.636 0.524756    
## date20140804T000000  2.893e-03  4.822e-03   0.600 0.548532    
## date20140805T000000  3.578e-03  4.712e-03   0.759 0.447651    
## date20140806T000000  1.652e-03  4.960e-03   0.333 0.739118    
## date20140807T000000  5.398e-03  5.211e-03   1.036 0.300329    
## date20140808T000000  4.184e-03  5.103e-03   0.820 0.412243    
## date20140809T000000  5.939e-03  1.887e-02   0.315 0.753020    
## date20140810T000000 -5.035e-03  1.557e-02  -0.323 0.746361    
## date20140811T000000  6.350e-03  4.713e-03   1.347 0.177875    
## date20140812T000000  4.472e-03  4.847e-03   0.923 0.356245    
## date20140813T000000  1.695e-03  4.962e-03   0.342 0.732639    
## date20140814T000000 -1.527e-03  4.902e-03  -0.311 0.755489    
## date20140815T000000  1.207e-02  5.312e-03   2.272 0.023104 *  
## date20140816T000000  7.379e-04  1.361e-02   0.054 0.956759    
## date20140817T000000 -2.271e-02  1.557e-02  -1.459 0.144583    
## date20140818T000000 -1.326e-03  5.084e-03  -0.261 0.794205    
## date20140819T000000  5.594e-03  4.809e-03   1.163 0.244757    
## date20140820T000000  9.406e-04  4.848e-03   0.194 0.846147    
## date20140821T000000  8.903e-04  4.977e-03   0.179 0.858026    
## date20140822T000000 -4.503e-03  4.978e-03  -0.905 0.365717    
## date20140823T000000  3.878e-03  1.058e-02   0.367 0.713995    
## date20140824T000000  4.937e-03  2.642e-02   0.187 0.851763    
## date20140825T000000  4.560e-03  4.723e-03   0.966 0.334294    
## date20140826T000000  2.648e-03  4.648e-03   0.570 0.568831    
## date20140827T000000  1.593e-03  4.786e-03   0.333 0.739298    
## date20140828T000000  1.016e-03  5.011e-03   0.203 0.839278    
## date20140829T000000  6.527e-03  5.427e-03   1.203 0.229156    
## date20140830T000000  3.500e-02  2.642e-02   1.325 0.185352    
## date20140831T000000 -1.255e-02  1.888e-02  -0.665 0.506336    
## date20140901T000000  1.054e-02  1.229e-02   0.857 0.391240    
## date20140902T000000  5.251e-03  5.046e-03   1.041 0.298095    
## date20140903T000000  3.705e-03  4.962e-03   0.747 0.455259    
## date20140904T000000  5.908e-03  5.124e-03   1.153 0.248895    
## date20140905T000000 -2.405e-03  4.964e-03  -0.485 0.627965    
## date20140906T000000  9.635e-04  1.361e-02   0.071 0.943576    
## date20140907T000000 -3.024e-03  2.642e-02  -0.114 0.908889    
## date20140908T000000  4.389e-04  5.144e-03   0.085 0.932017    
## date20140909T000000  1.227e-03  4.822e-03   0.254 0.799219    
## date20140910T000000 -1.020e-03  4.962e-03  -0.206 0.837122    
## date20140911T000000  1.876e-03  4.933e-03   0.380 0.703698    
## date20140912T000000 -3.766e-03  5.492e-03  -0.686 0.492988    
## date20140913T000000  2.966e-03  1.230e-02   0.241 0.809372    
## date20140914T000000  1.054e-03  1.888e-02   0.056 0.955476    
## date20140915T000000 -1.258e-04  5.047e-03  -0.025 0.980108    
## date20140916T000000  1.693e-03  4.931e-03   0.343 0.731422    
## date20140917T000000 -3.905e-04  5.104e-03  -0.077 0.939018    
## date20140918T000000  1.883e-03  5.065e-03   0.372 0.710059    
## date20140919T000000  8.284e-03  5.105e-03   1.623 0.104649    
## date20140920T000000  1.510e-02  1.556e-02   0.970 0.332011    
## date20140921T000000 -4.607e-03  1.361e-02  -0.338 0.735086    
## date20140922T000000  1.188e-03  4.822e-03   0.246 0.805456    
## date20140923T000000  2.149e-03  4.835e-03   0.444 0.656766    
## date20140924T000000 -6.382e-04  4.861e-03  -0.131 0.895541    
## date20140925T000000  1.188e-03  5.012e-03   0.237 0.812638    
## date20140926T000000  3.097e-03  4.916e-03   0.630 0.528801    
## date20140927T000000 -1.562e-02  1.559e-02  -1.002 0.316216    
## date20140928T000000  8.721e-03  1.556e-02   0.560 0.575288    
## date20140929T000000  4.523e-03  4.916e-03   0.920 0.357498    
## date20140930T000000  4.485e-03  5.722e-03   0.784 0.433131    
## date20141001T000000  4.823e-03  4.978e-03   0.969 0.332552    
## date20141002T000000  8.654e-04  5.103e-03   0.170 0.865350    
## date20141003T000000  4.239e-03  5.427e-03   0.781 0.434686    
## date20141004T000000  1.717e-03  1.887e-02   0.091 0.927520    
## date20141005T000000 -2.611e-03  1.556e-02  -0.168 0.866766    
## date20141006T000000 -2.202e-03  5.458e-03  -0.403 0.686650    
## date20141007T000000  7.592e-03  4.963e-03   1.530 0.126112    
## date20141008T000000  3.744e-03  5.285e-03   0.708 0.478784    
## date20141009T000000  6.760e-04  4.994e-03   0.135 0.892342    
## date20141010T000000  6.005e-03  4.945e-03   1.214 0.224627    
## date20141011T000000  2.664e-01  2.647e-02  10.065  < 2e-16 ***
## date20141012T000000  1.491e-03  1.887e-02   0.079 0.937036    
## date20141013T000000  1.094e-02  5.236e-03   2.089 0.036763 *  
## date20141014T000000  5.009e-03  4.888e-03   1.025 0.305491    
## date20141015T000000  5.138e-03  5.029e-03   1.022 0.307010    
## date20141016T000000  3.763e-03  4.977e-03   0.756 0.449642    
## date20141017T000000  7.348e-03  5.066e-03   1.450 0.146948    
## date20141018T000000  2.884e-02  1.362e-02   2.118 0.034203 *  
## date20141019T000000  4.296e-03  1.896e-02   0.227 0.820765    
## date20141020T000000  8.422e-03  4.978e-03   1.692 0.090700 .  
## date20141021T000000  2.306e-03  4.888e-03   0.472 0.637154    
## date20141022T000000  3.596e-03  5.144e-03   0.699 0.484568    
## date20141023T000000 -3.777e-04  5.085e-03  -0.074 0.940791    
## date20141024T000000  1.395e-04  5.491e-03   0.025 0.979734    
## date20141025T000000  8.439e-03  1.556e-02   0.542 0.587641    
## date20141026T000000 -9.086e-03  1.557e-02  -0.583 0.559605    
## date20141027T000000  2.367e-05  4.849e-03   0.005 0.996105    
## date20141028T000000  1.604e-03  4.703e-03   0.341 0.733032    
## date20141029T000000  5.016e-03  4.931e-03   1.017 0.309020    
## date20141030T000000  5.271e-03  5.047e-03   1.044 0.296305    
## date20141031T000000  1.095e-03  5.914e-03   0.185 0.853120    
## date20141101T000000  2.599e-02  1.361e-02   1.909 0.056218 .  
## date20141103T000000  3.821e-03  5.124e-03   0.746 0.455919    
## date20141104T000000  9.670e-04  5.261e-03   0.184 0.854167    
## date20141105T000000  1.671e-03  5.166e-03   0.323 0.746329    
## date20141106T000000  3.769e-03  5.103e-03   0.739 0.460182    
## date20141107T000000 -1.811e-04  5.236e-03  -0.035 0.972404    
## date20141108T000000  1.067e-02  1.361e-02   0.784 0.433033    
## date20141109T000000 -1.580e-02  1.557e-02  -1.015 0.310000    
## date20141110T000000  9.857e-03  5.046e-03   1.953 0.050788 .  
## date20141111T000000 -3.830e-04  6.087e-03  -0.063 0.949831    
## date20141112T000000 -1.906e-03  4.962e-03  -0.384 0.700882    
## date20141113T000000  1.183e-03  4.765e-03   0.248 0.803911    
## date20141114T000000  2.477e-03  5.312e-03   0.466 0.641009    
## date20141116T000000  9.237e-04  1.888e-02   0.049 0.960971    
## date20141117T000000 -2.562e-03  5.046e-03  -0.508 0.611666    
## date20141118T000000  2.681e-03  4.930e-03   0.544 0.586664    
## date20141119T000000  2.361e-03  4.995e-03   0.473 0.636492    
## date20141120T000000  6.587e-04  5.105e-03   0.129 0.897330    
## date20141121T000000 -3.606e-03  4.994e-03  -0.722 0.470263    
## date20141122T000000  5.976e-03  1.133e-02   0.527 0.597934    
## date20141123T000000 -3.831e-04  1.887e-02  -0.020 0.983803    
## date20141124T000000  4.757e-03  4.994e-03   0.952 0.340877    
## date20141125T000000  3.312e-03  5.190e-03   0.638 0.523298    
## date20141126T000000  5.752e-03  5.722e-03   1.005 0.314778    
## date20141128T000000  1.589e-02  1.888e-02   0.842 0.399769    
## date20141129T000000 -9.124e-04  1.557e-02  -0.059 0.953263    
## date20141130T000000  2.392e-02  2.643e-02   0.905 0.365476    
## date20141201T000000 -3.358e-04  4.917e-03  -0.068 0.945562    
## date20141202T000000  3.750e-03  4.916e-03   0.763 0.445539    
## date20141203T000000 -3.722e-04  5.012e-03  -0.074 0.940807    
## date20141204T000000  7.732e-04  5.144e-03   0.150 0.880521    
## date20141205T000000  3.759e-03  5.491e-03   0.685 0.493637    
## date20141206T000000  1.457e-03  1.557e-02   0.094 0.925456    
## date20141208T000000  4.238e-03  5.146e-03   0.824 0.410152    
## date20141209T000000  5.206e-03  5.029e-03   1.035 0.300566    
## date20141210T000000 -1.012e-03  5.144e-03  -0.197 0.844116    
## date20141211T000000  1.767e-03  5.166e-03   0.342 0.732338    
## date20141212T000000 -2.178e-04  5.285e-03  -0.041 0.967131    
## date20141213T000000  1.728e-02  2.649e-02   0.652 0.514157    
## date20141214T000000  7.482e-04  1.557e-02   0.048 0.961666    
## date20141215T000000  2.191e-03  4.977e-03   0.440 0.659795    
## date20141216T000000 -5.910e-04  5.189e-03  -0.114 0.909316    
## date20141217T000000  2.958e-03  5.367e-03   0.551 0.581514    
## date20141218T000000  1.109e-03  5.260e-03   0.211 0.833056    
## date20141219T000000  1.279e-02  5.862e-03   2.182 0.029140 *  
## date20141220T000000 -8.055e-03  9.989e-03  -0.806 0.420028    
## date20141221T000000  6.551e-03  2.642e-02   0.248 0.804182    
## date20141222T000000 -1.405e-03  5.235e-03  -0.268 0.788465    
## date20141223T000000  1.623e-03  5.462e-03   0.297 0.766421    
## date20141224T000000  1.932e-03  6.842e-03   0.282 0.777666    
## date20141226T000000 -3.903e-03  6.151e-03  -0.634 0.525790    
## date20141227T000000 -1.708e-02  2.643e-02  -0.646 0.518144    
## date20141229T000000 -1.796e-04  5.369e-03  -0.033 0.973317    
## date20141230T000000  1.001e-02  5.916e-03   1.693 0.090558 .  
## date20141231T000000  4.408e-03  5.914e-03   0.745 0.456083    
## date20150102T000000  3.498e-04  5.681e-03   0.062 0.950907    
## date20150105T000000  5.172e-03  5.313e-03   0.973 0.330325    
## date20150106T000000  1.134e-02  6.220e-03   1.823 0.068338 .  
## date20150107T000000  2.495e-03  5.722e-03   0.436 0.662843    
## date20150108T000000  3.869e-03  5.969e-03   0.648 0.516909    
## date20150109T000000  1.696e-02  6.453e-03   2.628 0.008601 ** 
## date20150110T000000 -1.048e-02  2.642e-02  -0.396 0.691746    
## date20150112T000000  2.366e-04  5.813e-03   0.041 0.967532    
## date20150113T000000  4.572e-03  5.863e-03   0.780 0.435522    
## date20150114T000000 -2.820e-03  5.680e-03  -0.496 0.619553    
## date20150115T000000  3.588e-03  5.865e-03   0.612 0.540657    
## date20150116T000000 -3.084e-03  5.864e-03  -0.526 0.598934    
## date20150119T000000 -5.431e-03  1.239e-02  -0.438 0.661049    
## date20150120T000000  1.588e-03  6.086e-03   0.261 0.794105    
## date20150121T000000  2.729e-03  5.679e-03   0.481 0.630791    
## date20150122T000000  2.546e-03  5.863e-03   0.434 0.664137    
## date20150123T000000  4.007e-03  5.723e-03   0.700 0.483823    
## date20150124T000000 -2.938e-02  1.362e-02  -2.157 0.031041 *  
## date20150125T000000  2.156e-03  1.556e-02   0.138 0.889850    
## date20150126T000000 -4.582e-03  5.914e-03  -0.775 0.438434    
## date20150127T000000  2.587e-03  5.367e-03   0.482 0.629820    
## date20150128T000000  2.736e-03  5.427e-03   0.504 0.614185    
## date20150129T000000  5.515e-03  6.088e-03   0.906 0.365024    
## date20150130T000000  1.405e-02  6.544e-03   2.147 0.031835 *  
## date20150131T000000 -1.844e-02  2.645e-02  -0.697 0.485716    
## date20150201T000000  9.474e-03  1.887e-02   0.502 0.615701    
## date20150202T000000 -2.356e-03  5.863e-03  -0.402 0.687790    
## date20150203T000000  1.704e-04  5.916e-03   0.029 0.977021    
## date20150204T000000  9.117e-03  5.563e-03   1.639 0.101244    
## date20150205T000000 -1.199e-03  5.679e-03  -0.211 0.832792    
## date20150206T000000  3.104e-03  5.812e-03   0.534 0.593360    
## date20150207T000000  9.057e-03  1.888e-02   0.480 0.631479    
## date20150209T000000  9.376e-03  5.427e-03   1.728 0.084066 .  
## date20150210T000000 -1.742e-03  5.563e-03  -0.313 0.754102    
## date20150211T000000  6.136e-03  5.260e-03   1.166 0.243432    
## date20150212T000000  1.538e-02  5.681e-03   2.708 0.006771 ** 
## date20150213T000000  5.137e-03  5.339e-03   0.962 0.336017    
## date20150215T000000  4.864e-03  2.642e-02   0.184 0.853953    
## date20150216T000000  1.169e-02  1.361e-02   0.859 0.390346    
## date20150217T000000  2.779e-03  5.189e-03   0.536 0.592245    
## date20150218T000000 -1.759e-03  4.848e-03  -0.363 0.716814    
## date20150219T000000  5.684e-03  5.125e-03   1.109 0.267412    
## date20150220T000000  4.063e-03  5.285e-03   0.769 0.442087    
## date20150221T000000  3.696e-02  1.557e-02   2.374 0.017610 *  
## date20150222T000000 -3.939e-03  9.093e-03  -0.433 0.664901    
## date20150223T000000  5.020e-03  5.259e-03   0.955 0.339844    
## date20150224T000000  4.946e-04  4.847e-03   0.102 0.918725    
## date20150225T000000  1.033e-02  4.917e-03   2.102 0.035612 *  
## date20150226T000000  8.478e-03  5.492e-03   1.544 0.122645    
## date20150227T000000  9.852e-03  5.639e-03   1.747 0.080646 .  
## date20150228T000000  9.636e-03  1.361e-02   0.708 0.478991    
## date20150301T000000  6.226e-03  1.229e-02   0.506 0.612566    
## date20150302T000000  1.585e-02  5.915e-03   2.681 0.007357 ** 
## date20150303T000000  1.902e-02  5.525e-03   3.443 0.000576 ***
## date20150304T000000  8.264e-03  4.836e-03   1.709 0.087525 .  
## date20150305T000000  5.415e-03  5.084e-03   1.065 0.286796    
## date20150306T000000  7.964e-03  5.427e-03   1.467 0.142269    
## date20150307T000000 -1.551e-03  1.361e-02  -0.114 0.909241    
## date20150308T000000  1.344e-02  2.642e-02   0.509 0.611059    
## date20150309T000000  8.407e-03  5.339e-03   1.575 0.115387    
## date20150310T000000  9.809e-03  5.259e-03   1.865 0.062176 .  
## date20150311T000000  5.181e-03  5.166e-03   1.003 0.315917    
## date20150312T000000  1.440e-03  5.104e-03   0.282 0.777923    
## date20150313T000000  1.193e-02  5.167e-03   2.309 0.020968 *  
## date20150314T000000  4.353e-03  1.361e-02   0.320 0.749082    
## date20150315T000000  9.503e-03  1.556e-02   0.611 0.541489    
## date20150316T000000  1.003e-02  5.083e-03   1.974 0.048418 *  
## date20150317T000000  6.809e-03  4.874e-03   1.397 0.162457    
## date20150318T000000  5.368e-04  4.916e-03   0.109 0.913037    
## date20150319T000000  6.559e-03  5.166e-03   1.270 0.204211    
## date20150320T000000  2.053e-03  5.285e-03   0.389 0.697637    
## date20150321T000000  4.607e-03  8.743e-03   0.527 0.598227    
## date20150322T000000  1.889e-02  1.557e-02   1.214 0.224920    
## date20150323T000000  5.819e-03  4.946e-03   1.176 0.239427    
## date20150324T000000  5.507e-03  4.977e-03   1.106 0.268531    
## date20150325T000000  7.715e-03  4.694e-03   1.644 0.100299    
## date20150326T000000  1.187e-02  4.754e-03   2.496 0.012556 *  
## date20150327T000000  8.347e-03  4.693e-03   1.779 0.075330 .  
## date20150328T000000  6.903e-03  1.058e-02   0.652 0.514243    
## date20150329T000000 -1.152e-02  1.133e-02  -1.016 0.309499    
## date20150330T000000  8.424e-03  4.901e-03   1.719 0.085689 .  
## date20150331T000000  1.150e-02  5.103e-03   2.254 0.024203 *  
## date20150401T000000  8.304e-03  5.010e-03   1.657 0.097473 .  
## date20150402T000000  9.513e-03  4.743e-03   2.006 0.044886 *  
## date20150403T000000  8.503e-03  5.104e-03   1.666 0.095748 .  
## date20150404T000000  1.838e-02  1.556e-02   1.181 0.237668    
## date20150405T000000  1.288e-02  1.133e-02   1.137 0.255442    
## date20150406T000000  8.493e-03  5.028e-03   1.689 0.091200 .  
## date20150407T000000  1.266e-02  4.733e-03   2.675 0.007475 ** 
## date20150408T000000  1.405e-02  4.702e-03   2.987 0.002822 ** 
## date20150409T000000  3.541e-03  4.931e-03   0.718 0.472731    
## date20150410T000000  1.280e-02  5.123e-03   2.498 0.012494 *  
## date20150411T000000  6.617e-03  8.445e-03   0.784 0.433273    
## date20150412T000000 -9.350e-03  9.501e-03  -0.984 0.325114    
## date20150413T000000  1.163e-02  4.962e-03   2.343 0.019135 *  
## date20150414T000000  5.623e-03  4.656e-03   1.207 0.227259    
## date20150415T000000  1.075e-02  4.994e-03   2.152 0.031412 *  
## date20150416T000000  9.421e-04  5.084e-03   0.185 0.852977    
## date20150417T000000  6.913e-03  4.948e-03   1.397 0.162355    
## date20150418T000000  1.221e-02  1.361e-02   0.897 0.369859    
## date20150419T000000 -2.207e-04  1.361e-02  -0.016 0.987063    
## date20150420T000000  2.924e-03  5.047e-03   0.579 0.562308    
## date20150421T000000  1.053e-02  4.683e-03   2.249 0.024529 *  
## date20150422T000000  7.753e-03  4.665e-03   1.662 0.096571 .  
## date20150423T000000  7.859e-03  4.744e-03   1.657 0.097628 .  
## date20150424T000000  7.623e-03  4.764e-03   1.600 0.109577    
## date20150425T000000  1.643e-02  8.440e-03   1.947 0.051591 .  
## date20150426T000000 -4.596e-03  1.059e-02  -0.434 0.664182    
## date20150427T000000  3.110e-03  4.648e-03   0.669 0.503431    
## date20150428T000000  1.010e-02  4.666e-03   2.164 0.030475 *  
## date20150429T000000  1.078e-02  4.639e-03   2.325 0.020098 *  
## date20150430T000000  8.875e-03  5.047e-03   1.758 0.078694 .  
## date20150501T000000  1.090e-02  5.144e-03   2.120 0.034035 *  
## date20150502T000000  5.388e-03  1.133e-02   0.476 0.634339    
## date20150503T000000  5.113e-03  9.994e-03   0.512 0.608935    
## date20150504T000000  6.713e-03  4.874e-03   1.377 0.168417    
## date20150505T000000  9.775e-03  4.835e-03   2.022 0.043208 *  
## date20150506T000000  1.313e-02  4.961e-03   2.646 0.008147 ** 
## date20150507T000000  1.234e-02  5.104e-03   2.417 0.015674 *  
## date20150508T000000  1.284e-02  5.601e-03   2.292 0.021917 *  
## date20150509T000000  1.865e-02  1.888e-02   0.988 0.323164    
## date20150510T000000 -9.897e-03  1.887e-02  -0.524 0.600031    
## date20150511T000000  1.557e-02  6.152e-03   2.530 0.011404 *  
## date20150512T000000  1.006e-02  5.723e-03   1.758 0.078750 .  
## date20150513T000000  5.354e-03  6.633e-03   0.807 0.419554    
## date20150514T000000  9.634e-03  9.092e-03   1.060 0.289338    
## date20150524T000000  1.386e-02  2.642e-02   0.525 0.599880    
## date20150527T000000  3.785e-02  2.644e-02   1.432 0.152271    
## bedrooms            -1.379e-01  9.468e-03 -14.560  < 2e-16 ***
## bathrooms            4.704e-02  3.980e-03  11.820  < 2e-16 ***
## sqft_living          2.444e-01  8.879e-03  27.531  < 2e-16 ***
## sqft_lot             2.762e-02  1.253e-02   2.204 0.027520 *  
## floors               2.070e-04  1.371e-03   0.151 0.879996    
## waterfront           7.414e-02  2.591e-03  28.613  < 2e-16 ***
## view                 2.474e-02  1.285e-03  19.259  < 2e-16 ***
## condition            1.817e-02  1.423e-03  12.769  < 2e-16 ***
## grade                1.579e-01  3.930e-03  40.190  < 2e-16 ***
## sqft_above           3.725e-02  6.065e-03   6.142 8.35e-10 ***
## sqft_basement               NA         NA      NA       NA    
## yr_built            -3.776e-02  1.255e-03 -30.089  < 2e-16 ***
## yr_renovated         5.558e-03  1.122e-03   4.955 7.30e-07 ***
## lat                  4.527e-02  9.870e-04  45.870  < 2e-16 ***
## long                -1.757e-02  2.181e-03  -8.056 8.41e-16 ***
## sqft_living15        2.041e-02  3.035e-03   6.726 1.81e-11 ***
## sqft_lot15          -4.554e-02  9.934e-03  -4.585 4.58e-06 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.02615 on 15827 degrees of freedom
## Multiple R-squared:  0.7083, Adjusted R-squared:  0.7013 
## F-statistic: 100.9 on 381 and 15827 DF,  p-value: < 2.2e-16

Model Linear menggunakan Semua Variabel sebagai Prediktor menghasilkan Model Linear dengan nilai R-SQuared sebesar 69,91% dengan kebanyakan variabel memiliki hubungan yang signifikan dengan Harga.

Decision Tree

model_tree
## n= 16209 
## 
## node), split, n, deviance, yval
##       * denotes terminal node
## 
##   1) root 16209 37.0945000 0.06095083  
##     2) grade< 0.625 13033  8.6908710 0.04764066  
##       4) lat< 0.6087341 5479  1.3503030 0.03154457 *
##       5) lat>=0.6087341 7554  4.8914500 0.05931534  
##        10) sqft_living< 0.1271698 4847  1.6937420 0.05053339 *
##        11) sqft_living>=0.1271698 2707  2.1545680 0.07503978 *
##     3) grade>=0.625 3176 16.6197900 0.11557030  
##       6) sqft_living< 0.2939623 2739  6.0694770 0.10085710  
##        12) lat< 0.5932122 597  0.4461721 0.06380461 *
##        13) lat>=0.5932122 2142  4.5752560 0.11118410  
##          26) long>=0.2761628 1207  0.8365521 0.09567946 *
##          27) long< 0.2761628 935  3.0739860 0.13119910  
##            54) sqft_living< 0.2071698 549  0.9026689 0.10712130 *
##            55) sqft_living>=0.2071698 386  1.4003590 0.16544450  
##             110) date=20140505T000000,20140506T000000,20140507T000000,20140508T000000,20140512T000000,20140513T000000,20140514T000000,20140515T000000,20140516T000000,20140519T000000,20140520T000000,20140521T000000,20140522T000000,20140523T000000,20140528T000000,20140530T000000,20140605T000000,20140609T000000,20140610T000000,20140616T000000,20140618T000000,20140619T000000,20140622T000000,20140623T000000,20140625T000000,20140626T000000,20140627T000000,20140628T000000,20140630T000000,20140702T000000,20140703T000000,20140706T000000,20140708T000000,20140709T000000,20140710T000000,20140711T000000,20140714T000000,20140715T000000,20140718T000000,20140720T000000,20140722T000000,20140723T000000,20140725T000000,20140729T000000,20140730T000000,20140802T000000,20140805T000000,20140807T000000,20140808T000000,20140811T000000,20140813T000000,20140814T000000,20140815T000000,20140827T000000,20140828T000000,20140904T000000,20140908T000000,20140909T000000,20140910T000000,20140911T000000,20140915T000000,20140922T000000,20140923T000000,20140925T000000,20140929T000000,20141002T000000,20141006T000000,20141009T000000,20141010T000000,20141013T000000,20141014T000000,20141015T000000,20141020T000000,20141021T000000,20141022T000000,20141023T000000,20141024T000000,20141029T000000,20141031T000000,20141104T000000,20141105T000000,20141106T000000,20141110T000000,20141111T000000,20141112T000000,20141113T000000,20141114T000000,20141117T000000,20141120T000000,20141121T000000,20141124T000000,20141125T000000,20141204T000000,20141208T000000,20141211T000000,20141212T000000,20141215T000000,20141216T000000,20141224T000000,20141231T000000,20150102T000000,20150108T000000,20150109T000000,20150112T000000,20150113T000000,20150114T000000,20150121T000000,20150202T000000,20150203T000000,20150209T000000,20150223T000000,20150224T000000,20150225T000000,20150304T000000,20150305T000000,20150306T000000,20150316T000000,20150320T000000,20150321T000000,20150323T000000,20150324T000000,20150325T000000,20150326T000000,20150331T000000,20150401T000000,20150402T000000,20150403T000000,20150408T000000,20150409T000000,20150412T000000,20150414T000000,20150415T000000,20150420T000000,20150422T000000,20150423T000000,20150426T000000,20150427T000000,20150430T000000,20150502T000000,20150508T000000,20150512T000000 274  0.4931260 0.14445640 *
##             111) date=20140527T000000,20140604T000000,20140612T000000,20140613T000000,20140617T000000,20140620T000000,20140624T000000,20140701T000000,20140721T000000,20140731T000000,20140806T000000,20140819T000000,20140821T000000,20140825T000000,20140826T000000,20140902T000000,20140916T000000,20140924T000000,20140926T000000,20141001T000000,20141003T000000,20141008T000000,20141017T000000,20141018T000000,20141101T000000,20141119T000000,20141126T000000,20141201T000000,20141205T000000,20141209T000000,20141210T000000,20141223T000000,20150106T000000,20150107T000000,20150115T000000,20150116T000000,20150128T000000,20150130T000000,20150204T000000,20150212T000000,20150217T000000,20150218T000000,20150221T000000,20150226T000000,20150227T000000,20150302T000000,20150303T000000,20150310T000000,20150313T000000,20150317T000000,20150318T000000,20150330T000000,20150407T000000,20150410T000000,20150421T000000,20150424T000000,20150428T000000,20150429T000000,20150501T000000,20150505T000000,20150507T000000 112  0.4912577 0.21679050 *
##       7) sqft_living>=0.2939623 437  6.2410500 0.20778850  
##        14) date=20140505T000000,20140509T000000,20140512T000000,20140514T000000,20140515T000000,20140516T000000,20140520T000000,20140521T000000,20140522T000000,20140523T000000,20140524T000000,20140527T000000,20140528T000000,20140529T000000,20140530T000000,20140602T000000,20140603T000000,20140604T000000,20140605T000000,20140606T000000,20140609T000000,20140610T000000,20140612T000000,20140613T000000,20140616T000000,20140617T000000,20140618T000000,20140619T000000,20140620T000000,20140623T000000,20140624T000000,20140625T000000,20140626T000000,20140627T000000,20140630T000000,20140701T000000,20140703T000000,20140708T000000,20140709T000000,20140710T000000,20140711T000000,20140714T000000,20140715T000000,20140717T000000,20140718T000000,20140721T000000,20140722T000000,20140723T000000,20140724T000000,20140725T000000,20140728T000000,20140730T000000,20140731T000000,20140804T000000,20140805T000000,20140806T000000,20140807T000000,20140808T000000,20140811T000000,20140814T000000,20140818T000000,20140820T000000,20140822T000000,20140825T000000,20140827T000000,20140902T000000,20140903T000000,20140905T000000,20140909T000000,20140910T000000,20140911T000000,20140912T000000,20140915T000000,20140918T000000,20140924T000000,20140927T000000,20140929T000000,20141001T000000,20141003T000000,20141006T000000,20141007T000000,20141009T000000,20141010T000000,20141014T000000,20141015T000000,20141016T000000,20141017T000000,20141021T000000,20141022T000000,20141024T000000,20141027T000000,20141103T000000,20141106T000000,20141107T000000,20141110T000000,20141112T000000,20141114T000000,20141120T000000,20141121T000000,20141125T000000,20141126T000000,20141201T000000,20141202T000000,20141203T000000,20141209T000000,20141211T000000,20141212T000000,20141213T000000,20141215T000000,20141216T000000,20141217T000000,20141222T000000,20141223T000000,20141224T000000,20141226T000000,20150102T000000,20150112T000000,20150113T000000,20150114T000000,20150115T000000,20150123T000000,20150126T000000,20150127T000000,20150128T000000,20150202T000000,20150203T000000,20150206T000000,20150209T000000,20150210T000000,20150211T000000,20150213T000000,20150219T000000,20150222T000000,20150224T000000,20150303T000000,20150305T000000,20150312T000000,20150317T000000,20150318T000000,20150324T000000,20150325T000000,20150326T000000,20150327T000000,20150328T000000,20150330T000000,20150331T000000,20150401T000000,20150406T000000,20150407T000000,20150409T000000,20150410T000000,20150415T000000,20150422T000000,20150423T000000,20150424T000000,20150427T000000,20150428T000000,20150429T000000,20150430T000000,20150504T000000,20150505T000000 346  2.1726180 0.17543970  
##          28) date=20140505T000000,20140512T000000,20140514T000000,20140515T000000,20140520T000000,20140521T000000,20140523T000000,20140527T000000,20140528T000000,20140530T000000,20140602T000000,20140603T000000,20140604T000000,20140606T000000,20140616T000000,20140627T000000,20140715T000000,20140717T000000,20140721T000000,20140722T000000,20140723T000000,20140730T000000,20140731T000000,20140804T000000,20140805T000000,20140806T000000,20140807T000000,20140808T000000,20140814T000000,20140818T000000,20140822T000000,20140825T000000,20140827T000000,20140903T000000,20140909T000000,20140910T000000,20140924T000000,20141006T000000,20141015T000000,20141017T000000,20141024T000000,20141027T000000,20141103T000000,20141107T000000,20141110T000000,20141112T000000,20141125T000000,20141202T000000,20141211T000000,20141215T000000,20141217T000000,20141222T000000,20141223T000000,20141224T000000,20141226T000000,20150114T000000,20150115T000000,20150126T000000,20150128T000000,20150202T000000,20150211T000000,20150213T000000,20150219T000000,20150222T000000,20150224T000000,20150303T000000,20150312T000000,20150317T000000,20150318T000000,20150327T000000,20150328T000000,20150330T000000,20150331T000000,20150406T000000,20150423T000000,20150424T000000,20150427T000000,20150429T000000,20150430T000000 151  0.3235347 0.13555710 *
##          29) date=20140509T000000,20140516T000000,20140522T000000,20140524T000000,20140529T000000,20140605T000000,20140609T000000,20140610T000000,20140612T000000,20140613T000000,20140617T000000,20140618T000000,20140619T000000,20140620T000000,20140623T000000,20140624T000000,20140625T000000,20140626T000000,20140630T000000,20140701T000000,20140703T000000,20140708T000000,20140709T000000,20140710T000000,20140711T000000,20140714T000000,20140718T000000,20140724T000000,20140725T000000,20140728T000000,20140811T000000,20140820T000000,20140902T000000,20140905T000000,20140911T000000,20140912T000000,20140915T000000,20140918T000000,20140927T000000,20140929T000000,20141001T000000,20141003T000000,20141007T000000,20141009T000000,20141010T000000,20141014T000000,20141016T000000,20141021T000000,20141022T000000,20141106T000000,20141114T000000,20141120T000000,20141121T000000,20141126T000000,20141201T000000,20141203T000000,20141209T000000,20141212T000000,20141213T000000,20141216T000000,20150102T000000,20150112T000000,20150113T000000,20150123T000000,20150127T000000,20150203T000000,20150206T000000,20150209T000000,20150210T000000,20150305T000000,20150324T000000,20150325T000000,20150326T000000,20150401T000000,20150407T000000,20150409T000000,20150410T000000,20150415T000000,20150422T000000,20150428T000000,20150504T000000,20150505T000000 195  1.4229120 0.20632310 *
##        15) date=20140507T000000,20140513T000000,20140519T000000,20140608T000000,20140611T000000,20140702T000000,20140729T000000,20140801T000000,20140812T000000,20140815T000000,20140826T000000,20140828T000000,20140829T000000,20140904T000000,20140917T000000,20140919T000000,20140923T000000,20140926T000000,20141011T000000,20141013T000000,20141020T000000,20141029T000000,20141030T000000,20141118T000000,20141124T000000,20141205T000000,20141208T000000,20141219T000000,20141230T000000,20150108T000000,20150109T000000,20150130T000000,20150212T000000,20150218T000000,20150225T000000,20150304T000000,20150402T000000,20150408T000000,20150413T000000,20150417T000000,20150420T000000,20150421T000000,20150501T000000,20150506T000000,20150507T000000,20150508T000000,20150511T000000 91  2.3297000 0.33078500  
##          30) sqft_living< 0.5249057 80  0.8172966 0.29550680 *
##          31) sqft_living>=0.5249057 11  0.6887365 0.58735380 *

node, split, n, deviance, yval: Menunjukkan struktur pohon keputusan dengan split berdasarkan variabel, jumlah sampel pada node (16209), standard deviance (ukuran ketidakseragaman house di node), dan nilai prediksi (yval) untuk node tersebut. Misalnya, node yang membagi house berdasarkan grade < 0.609 memiliki 13905 sampel dengan deviasi 37,09 dan ketepatan prediksi 0.0609.

weights: Jumlah bobot/neuron pada model, yaitu 211. serta 19-10-1 network: Menunjukkan anatomi jaringan dengan 19 input variable, 10 neuron di hidden layer, dan 1 outputvariable.

Model Evaluation

actual <- test_house$price predictions_lm <- predict(linmod, test_house) predictions_tree <- predict(model_tree, test_house) predictions_nnet <- predict(model_nnet, test_house, type = “raw”)

calc_rmse <- function(predictions, actual) { sqrt(mean((predictions - actual)^2)) }

calc_r_squared <- function(predictions, actual) { 1 - sum((predictions - actual)^2) / sum((actual - mean(actual))^2) }

Calculate RMSE

rmse_lm <- calc_rmse(predictions_lm, actual) rmse_tree <- calc_rmse(predictions_tree, actual) rmse_nnet <- calc_rmse(predictions_nnet, actual)

Calculate R-squared

r_squared_lm <- calc_r_squared(predictions_lm, actual) r_squared_tree <- calc_r_squared(predictions_tree, actual) r_squared_nnet <- calc_r_squared(predictions_nnet, actual)