I have the following birthday data (Just the head):
#pop Freq
1 2976
2 1238
3 738
4 467
5 352
6 243
This means that there are 2976 people who are the only one for this value in the data. (For example, there might be only 1 person with 1/1/1965 and 1 person with 1/2/1957 and there are 2976 people in total who are like this). There are 1238 people who share their birthday with any other person, 738 people who share with 2 other people, etc.
I can get the % of people of the population who have each of these characteristics (Freq/n). I was just summing up the (Freq/n) for the population to get the value that given a given birthday what the probability is for the population. But I am thinking this isn't correct.
How do I get the overall probability that given a birthday that I can determine who the person is in the overall population?