The project was done using Python and Jupyter Notebook. Some libraries include, but are not limited to:
For in-depth information on the complete project, click the "Visit" button above.
This is a project where I had to implement my own code that performs nested cross-validation and the k-nearest neighbour ML algorithm, build confusion matrices, and estimate distances between data samples.
The purpose of this project was to help me:
The K Nearest Neighbor algorithm is a supervised machine learning algorithm that relies on labelled input data to a function that assigns the appropriate label to new unlabelled data.
There are a couple of basic steps that need to be taken into account in order for the algorithm to work and these are:
Below is a screenshot of the grid plot with the different properties of the wine dataset that was used in the project.