Iris Classification
In 1936, Ronald Fisher published his paper The use of multiple measurements in taxonomic problems , the basis of which was a small data set containing some attributes of iris flowers. The data set is very straightforward, and describes 150 flowers with four measurements each: sepal length, sepal width, petal length, and petal width. Additionally, it lists the species of each iris, of which there were three different types equally represented in the data. These species are Iris setosa, Iris virginica, and Iris versicolor. Despite the simple nature of this data set, it would go on to become one of the most commonly used within machine learning, and it remains a standard test case for classification techniques to this day. It is also the data set that I have decided to examine for my most recent project. Now to explain a bit about this project. The primary impetus for it was very basic: I have worked almost exclusively...