2
$\begingroup$

Suppose that there are $n$ independent samples $X_1,X_2,...,X_n$ sampled from the uniform distribution on $[0,1]$ with the pdf $f(x)=1$.

Is there a good way to calculate or approximate the probability that $k$ of the $n$ samples are in an interval $[a,a+b]\in[0,1]$? Here only $b$ and $k$ are fixed.

This is like saying there are $k$ samples "clumped" together in an interval of length $b$. Or more mathematically, the event that there exist some interval of length $b$ that covers exactly $k$ samples.

I know that the inclusion-exclusion principle can be used, but I can only get a long and complicated expression using the inclusion-exclusion principle. I'm wondering if there's a better way or if there's a good approximation.

  • 2
    Is $a$ fixed?$\ \ $2012-04-04
  • 2
    If $a$ and $b$ are known and the samples assume values from 0 to 1, then the probability of any one of those samples being in $[a, a+b]$ is simply $b$. The probability of exactly $k$ of the $n$ samples being in the interval is therefore Binomially distributed: ${n \choose k} b^k (1-b)^{n-k}$. Am I interpreting your question correctly?2012-04-04
  • 0
    "This is like saying there are k samples clumped together in an interval of length b" In a particular given interval? If so, see Bruno's answer. Now, the event that there exist _some_ interval of lenght b that covers exactly k samples is a different thing.2012-04-04
  • 0
    Only $b$ is fixed, $a$ can change. I just edited it.2012-04-04

0 Answers 0