For this demonstration, we will use iris dataset. Using Gaussian Naive Bayes, lets predict the class of the flower of a new obseravtion.

New observation whose class we are going to predict. [Note: This observation belongs to the setosa cluster]

Steps in buliding Gaussian Naive Bayes Algorithm

Calculate the means of the attributes
Calculate standard deviations of the attributes
Calculate the probability densities of the attributes using above two
Calulate the prior probability of each class
calculate the posterior probability by using prior probability and probablity densities
Normalise the values so that it follows normal distribution assumption

# Mean, standard deviation and probability densities
mean <- rep(0, length(iris.attributes))
sd <- rep(0, length(iris.attributes))
densities <- rep(0, length(iris.attributes))

# prior probability
prior <- rep(0, length(iris.classes))

# Posterior probability
posterior <- rep(0, length(iris.classes))

for(i in 1:length(iris.classes)){
  
  prior[i] <- nrow(iris[iris$Class == iris.classes[i], ])/nrow(iris) # prior probability
  
  for(j in 1:length(iris.attributes)){ 
    
    mean[j] <- sapply(iris[iris$Class == iris.classes[i],][j], mean) # Mean of attributes
    sd[j] <- sapply(iris[iris$Class == iris.classes[i],][j], sd) # standard deviation of attributes
    densities[j] <- dnorm(as.numeric(observation[j]), mean[j], sd[j]) # probability density of the new observation
  }
  
  posterior[i] <- prior[i]*prod(densities) # Posterior probability
}

Lets add labels to the observation

names(posterior) <- c("setosa","versicolor","virginica")# Add labels
print(posterior)

##     setosa versicolor  virginica 
##         NA         NA         NA

Prediction

predictedvalue <- round(posterior/ sum(posterior), digits = 3) # normalize
print(predictedvalue)

##     setosa versicolor  virginica 
##         NA         NA         NA

The model correctly predicts the class of the new observation as ‘Setosa’

Gaussian Naive Bayes

karthik

August 21, 2017

Steps in buliding Gaussian Naive Bayes Algorithm

Prediction