predict(modelname , data)
- predict generates avg value
- Cross Validation can be achieved using RMSE
- RMSE :
- pred <- predict(model, data)
- res - actual - predicted (ALSO CALLED Out of Bag Error)
- rmse > sqrt(mean(res^2))
Hyperplane
Suppport Vector
Data that is not linearly seperatble uses kernel to increase
dimensions of one level in the sum this is called kernel trick
It is Unsuperivised Learning
1. All Independent variables must be continuous
2. Scale Data
3. Works using Euclidian Distance formula
4. To get optimum clusters calculate
- Within SS
- Between SS
- Limiting value should be sqrt of total no of variable
5. Calculate Optimum clusters using Elbow Method by PLOTTING WSS/BSS
cl_varkmeans(data, k)
cl_var$cluster
will assign cluster for every observation.
rules <- inspect(rules[1:n])
arulesviz(rules)