I'm trying to prove the "tower property" of conditional expectations, $$ E[V\mid W] = E[\ E[V\mid U,W]\ \mid W\ ], $$ where $U$, $V$ and $W$ are any random variables. $E[X \mid Y]$ is itself a random variable $f(Y)$ where $$f(y) = E[X \mid Y = y) = \sum_x x\cdot Pr[X=x\mid Y=y].$$ Keeping this observation in mind, I still don't see why $U$ is "averaged out" when moving from the right hand side to the left side.
Tower property of conditional expectation
3
$\begingroup$
probability-theory
-
1The tower property is more simply/generally expressed as $E[E[V | U]] = E[V]$. Why do you add the conditioning on $W$? Nothing wrong with that, but you'd first try to prove/understand the more elementary formulation. – 2012-06-04
-
0Related: http://math.stackexchange.com/questions/41536/intuitive-explanation-of-the-tower-property-of-conditional-expectation – 2012-06-04
1 Answers
4
The last equality in your observation does not apply in general (i.e. if $X$ is not discrete). Let $U,V,W$ be random variables such that $V\in \mathcal{L}^1(P)$. In order to show that $$ E[V\mid W]=E[E[V\mid U,W]\mid W] $$ we note that the right hand side is indeed $\sigma(W)$-measurable, so we only need to check the defining equation, i.e. check that $$ \int_A V\,\mathrm{d}P=\int_A E[V\mid U,W]\,\mathrm{d}P $$ for all $A\in\sigma(W)$. Let such an $A$ be given. Then $A\in\sigma(W)\subseteq \sigma(U,W)$ and therefore $$ \int_A E[V\mid U,W]\,\mathrm{d}P=\int_A V\,\mathrm{d}P $$ and we are done.
-
0Could you explain why $\sigma(W) \subseteq \sigma(U,W)$ implies the last equation? – 2012-10-09
-
1Because if $A\in\sigma(U,W)$, then by definition of the conditional expectation $E[V\mid U,W]$ we have the last equality. This is the defining equality of conditional expectations. – 2012-10-09
-
0Thanks for your answer, it's formally clear to me. I'm still having trouble with the intuition: As you said, $A \in \sigma(W)$, e.g., $A$ could be an event $\{ W=w \}$. We know that $E[V \mid U,W]$ depends on both, $U$ and $W$, but becomes a constant for this event $A$ (which **only** depends on $W$), isn't this counterintuitive? – 2012-10-09
-
0It doesn't become a constant since it is still conditioned on $U$. – 2012-10-09