I have a dataset of 100 cases. Each case has a class {I,II,III,IV,V} and a value A and V, each class appears exactly 20 times in the dataset:
Class A V 5 2 3 1 3 5 3 2 3 2 3 5 3 2 3 1 2 4 1 2 4 1 4 4 2 3 3 2 3 4
I want to generate another 100 cases based on this set. Am i correct in assuming that I should
- find the distribution of A and the distribution of V per class?
- calculate the joint distribution of A & V per class
- get random numbers based on this joint distribution
I am confused about whether i should assume any relation between A and V or that any relation should come from the joint distribution (and assume their independent)?
Thank you