0
$\begingroup$

We are using character strings (out of 64 possible characters) to identify unique elements in a 'bag' of data. each bag has on average 20 elements, but up to 300.

I am trying to determine what size should that string be so that the probability of a collision (if we pick the characters randomly) is less than 1 in a 1,000,000 for 20 elements, and then for 300 elements.

2 Answers 2