Solving clustering problem for iris data set with R-Shiny application

Mykyta Zharov
14/12/2019

Owerview

Using R we can easily solve different clustering problems. There are different clustering algorithms and it is essentially to try different algorithms before stating the results. As an example the R Shiny apllication was built, that solves clustering problem for the iris data set, using the following algorithms:

  • K-means clustering algorithm
  • Model-based clustering algorithm using Gaussian distributions
  • K-medoids clustering

Links to the R Shiny Application and Git-hub repo

Application usage

In the application there are several inpus, which a user can change.

  • In the first dropdown menu a user can choose a clustering method to be used.
  • Using a slider, a user can select a number of desired clusters.
  • Using two dropdown menus at the end, a user can select 2 variables that will describe the data points.

Output Example

After selecting desired inputs, a user can observe the visual result of solving a clustering problem on the plot. For example, for K-means clustering and 3 clusters for variables Sepal.Length and Sepal.Width a user will see the following output: plot of chunk fig