6
$\begingroup$

I don't have any knowledge of statistics beyond high school common sense. Why is the standard deviation usually seen in combinatorics textbooks, and why is the standard deviation defined intentionally? What is its purpose?

Thanks in advance. I have checked in Wikipedia and many other websites to see what this is, but they are not very concise and clear.

  • 0
    Related: [Motivation behind standard deviation?](http://math.stac$k$e$x$change.com/q/4787/856)2012-03-25

4 Answers 4

3

Given a sample $x = \{x_1,x_2,\ldots,x_n\}$ the sample standard deviation is defined as $ \operatorname{sd}(x) = \sqrt{\frac{1}{n} \sum_{k=1}^n(x_k - \bar{x})^2 } $ where $\bar{x} = \frac{1}{n} \sum_{k=1}^n$ denotes the sample mean.

Several properties of the standard deviation are worth noting.

  • It does not change if all the values in the sample are shifted by the same amount, $x_k \mapsto x_k + a$.
  • If all the values in the sample are multiplied by a common factor, i.e. $x_k \mapsto \lambda x_k$, the standard variation also gets multiplied by its absolute value, i.e. $\operatorname{sd}(\lambda \cdot x) \mapsto | \lambda | \cdot \operatorname{sd}( x)$.
  • Standard deviation is bounded from above by the maximal deviation from the mean: $\operatorname{sd}(x) \leqslant \sup_k | x_k - \bar{x} |$. In particular standard deviation of the sample with identical elements is zero.

Because of these properties, standard deviation is a measure of scale of the data, which indicates extent the sample elements deviate from the mean, hence the name.

  • 0
    @c$a$rdinal Thanks for the $c$omment, I have up$d$ated the post.2012-03-25
7

The original poster asked a further question in a comment:

"Why on the sole purpose for what its being use you are saying, there are no other formula could replace it and share the same popularity as the standard deviation?"

The mean absolute deviation can also be used as a measure of dispersion, but it lacks one kind of well-behavedness that makes the standard deviation superior to it: If you have independent random variables $X_1,\ldots,X_n$, then the standard deviation of the sum $X_1+\cdots+X_n$ is the square root of the sum of the squares of the standard deviations of the individual terms. No such neat formula is known for the mean absolute deviation.

  • 0
    Usually you'd say "the variance of the sum of the random variables is the sum of the variances of each random variable", though. :P2013-01-15
6

Standard deviation is a measure of how the observations are located in relation to the mean. For example if you have a set of 2 observations : $ 8 \ ,\ 2\ $ Their mean is $\bar{x}=\frac{8+2}{2}=5$. But as you can see they are not very "close" to the mean. On the other hand a set of another 2 observations like:$ 4 \ , \ 6$ has a mean $\bar{x}=\frac{4+6}{2}=5$ too,but those values are closer to the mean. Therefore the first set has a large standard deviation (SD=3), while the second set has a smaller one(SD=1). These two numbers say that every observation is about 3 away from the mean in the first case, while the other observations are 1 away from the second mean. It's very important in statistics to know the diversity and variability of a set of data in order to analyze them and come to useful conclusions about the population or the sample you're observing.

  • 0
    @Victor there are others quantities we might compute. The popularity of the standard deviation has more to do with the fact that the variance is more tractable than other measures of dispersion and that it just seems to pop up more frequently than other measures of spread, e.g. the standard deviation and the mean are the relevant quantities in the central limit theorem.2012-03-24
6

I never felt that I understood the fuss about the standard deviation, until I encountered Chebyshev's inequality.

Suppose you have a bunch of data. Clearly, the mean of the data is somewhere in the middle, and the data are spread out more or less far from the mean. The standard deviation says how far away they are, in the following sense: no matter how the data are distributed, and no matter how spread out they are, at most $1/n^2$ of them are more than $n$ standard deviations from the mean.

For example, if the standard deviation is 2 units, then all but at most 1/9 of the data are between $m - 6$ and $m + 6$, where $m$ is the mean. Often the data will cluster more closely around the mean than this, but even if you know nothing else, you can guarantee that no more than 1/9 of the data are more than 6 units from $m$.

  • 0
    This is in fact the reasoning behind buzzwords like "six sigma" being thrown around...2012-03-25