I am doing a course in Machine Learning, and I am having some trouble getting an intuitive understanding of maximum likelihood classifiers. What I am trying to do is to perform Principal Component Analysis on the Iris Flower Data Set, and then classify the points into the three classes, i.e. Setosa, Versicolor, Virginica.
So I take some points from each class of Iris to be used as training data, and transform it onto the principal component axes. Next I want to model each class as a multivariate Gaussian Distribution, estimating the parameters of each class from my training data. However, this is where I get stuck. I will try and explain this graphically using some (made-up) sample data:
Class 1 Class 2 Class 3 1 2 1 4 4 2 3 1 6 8 1 2 4 7 8 1 2 3 5 3 7 7 5 4 1 2 4 5 1 5 4 2 1 6 8 3
If this is the data for three different classes, with four observations and three features each, how would I go about determining the mean and covariance matrix for each class?
My original thoughts were that the mean of class 1 would be [mean(1,2,1,4), mean(4,7,8,1), mean(1,2,4,5)], but then what about the covariance?
I hope that I have explained this well enough, any help is greatly appreciated!
Thanks
Casper
PS. Also, this is my first time posting on the Mathematics part of StackExchange, so feel free to point out any etiquette-rules that I might have inadvertently violated.