In this article, the dual perceptron implementation and some non-linear kernels are used to separate out some linearly inseparable datasets.
The following figure shows the dual perceptron algorithm used.
Few 2-D training datasets generated uniformly randomly and the binary class labels also generated randomly. Then the dual implementation of the perceptron with a few non-linear kernels, such as Gaussian (with different bandwidth), Polynomial (with different degrees) were used to separate out the positive from the negative class data points. The following figures show the decision boundaries learnt by the perceptron upon convergence.