12
$\begingroup$

Edit 3: OK, I had an insight, inspired in part by Ben-Blum Smith's comment, and the post he linked to. (I have no idea if this insight is right; it's barely a hunch, and that's why I'm not submitting it as an answer to my original question [see below], but I'll throw it out there for criticism.) I vaguely remember a theorem on the rearrangement the terms of infinite series showing that, at least for some series, for any $L$, it was possible to find a rearrangement of the terms that would cause the rearranged series to "converge" to $L$. IIRC, this fact was somehow related to the such series were somewhat difficult to work with. In any case, allowing any rearrangement of the terms, it seems, causes problems down the line, but if we were able to agree upon some reasonable canonicalization of the rearrangement procedure, or to put it differently, if we could find a reasonable way to define admissible rearrangements, this could be the basis for a new definition of an infinite series such that more series would converge (i.e. such that there would be a more robust definition of the series' limiting value). One such canonicalization would go like this: divide the range of the series by some standard mesh (e.g. by numbers of the form $m/2^n$, for some fixed integer $n$, and all integers $m$ in some suitable interval). Now rearrange the series by collecting together all the terms of the original series whose values lie within the same cell of the mesh, and ordering these "grouped terms" in decreasing order of their cell's value. Then the series could be approximated by a weighted sum of these cell values. This canonicalized sum would be immune to the pathologies caused by arbitrary rearrangements. One would then need to show that this procedure behaves reasonably as one lets the size of the mesh go to zero. Could something like this lead to the Lebesgue integral?

Edit 4: One point I did not make sufficiently clear in "Edit 3" is that one could rationalize the concern with rearrangements by noting that one property that would be desirable in a definition of an integral is that it remains invariant relative to order in which the sum is performed. The standard definition of the Riemann integral, as a sum that is ordered along the domain of integration, is susceptible to all the problems that one runs into when reordering infinite series (as alluded to above), which is unfortunate, considering that, when one works with sufficiently general functions, the ordering of the domain of integration should be irrelevant to the value of the integral: it should be possible to rearrange the domain of integration in fairly crazy ways and end up with the exact same integral. In contrast, the procedure described in Edit 3 eliminates this dependence on any particular ordering of the domain of integration, and replaces it with a focus on the ordering of the range, which at least has a meaningful relationship to the value of the integral. Also, it is clear why this alternative viewpoint shifts the focus to the problem of defining "measures" on sets, since these "measures" are precisely the weights assigned to the grouped terms in the new summation procedure.


[Original post]

I understand the definitions of the Lebesgue integral and of the Riemann integral, but it is not obvious to me from these definitions that the former would be the more suitable of the two for, say, probability theory, or for defining the Fourier transform, or for defining inner products in Hilbert space, etc. What was exactly the insight that led to the Lebesgue integral as a superior alternative to the Riemann integral?

Thanks!


Edit 1: after looking at the comments/answers so far (with one exception) I realized that the leading wording of my question was quite inept. I've highlighted the part of my original question that is least ineptly put. I know that Lebesgue integrals have all the advantages cited, but it is utterly mystifying to me how anyone could have seen ahead of time that this had to be the case. What made this alternative approach (of the Lebesgue integral) look promising? I know that it's perfectly possible that there was no clue that this approach would be fruitful: someone (Lebesgue, I suppose) just tried it, played with it for a while, and discovered all these unsuspected benefits, and had the latter not happened, the new approach would have been quietly forgotten. But, just in case there is some clue (even a "clue-in-hindsight") of the alternative POV's superiority, I would love to see it. (The "one exception" I mentioned above is Christian Blatter's answer, which goes in the direction I was trying to get at, but I'd like to give my question a second shot.)

Edit 2: To be fair to the commenters, I have not finished digesting all the previous posts linked to in the comments; they may very well be what I was looking for. (In particular, the one with the metaphor featuring a shop owner, etc., may be just the thing.)

  • 5
    Did you see [this previous question](http://math.stackexchange.com/questions/7436/lebesgue-integral-basics/7444#7444)? As to why the Lebesgue integral is "superior", it is simply that every function that is Riemann integrable is also Lebesgue integrable, but there are functions that are Lebesgue integrable (because they have "nicely behaved images") but are *not* Riemann integrable. That's pretty much what makes it "better": it applies to strictly more things, and when they both apply they yield the same answer.2012-01-24
  • 0
    @Arturo: I spent the past 5 minutes searching MO and MSE for this answer... guess going in descending order according to question votes wasn't the best way. Or maybe I was thinking of another answer that points out the differences (pros/cons) in Reimann, Reimann-Steiltles, and Lebesgue integrals...?2012-01-24
  • 1
    @TheChaz: Having written the answer, I had a leg up on you on finding it. (-:2012-01-24
  • 1
    @TheChaz: The other post you were thinking of might have been [this one](http://math.stackexchange.com/questions/32217/how-to-compute-riemann-stieltjes-lebesgue-stieltjes-integral/32385#32385) (discussing the FTC for Lebesgue, Riemann-Stieltjes, and Lebesgue-Stieltjes integrals), or maybe [this one](http://math.stackexchange.com/questions/47285/fundamental-theorem-of-calculus) which talks about several different kinds of integrals (all based on Burk's **A Garden of Integrals**).2012-01-24
  • 0
    Ah yes, @ArturoMagidin. I should have just searched your answers! (It was the former in this most recent comment).2012-01-24
  • 1
    @kjo: The fact that, on top of that, you have some nice convergence theorems that you don't have with Riemann integrals is just gravy. For some discussion of the FTC and convergence theorems for different types of integrals see [this answer](http://math.stackexchange.com/questions/47285/fundamental-theorem-of-calculus), but I definitely recommend Burk's book as a great resource, and a very good read.2012-01-24
  • 1
    I would say the convergence properties are the most important -- they ensure that $L^p(\mu)$ is a Banach space.2012-01-24
  • 2
    As mentioned in one of the answers to the question linked by Arturo Magidin, a metaphor used by Lebesgue himself to describe the difference between his integral and Riemann's is a shop owner totaling up the money earned in a day. Riemann's integral keeps a running tally, adding in each new amount as it comes in. (I.e. adds up $f(x_i^*)(x_{i+1}-x_i)$ left-to-right.) Lebesgue's integral first organizes the money by denomination and counts the number of bills in each denomination before adding. (I.e. finds $m(\{y_i\leq f2012-01-24
  • 0
    @BenBlum-Smith: the shop owner's metaphor is a nice one. What still mystifies me is that anyone would have guessed that, of the two seemingly equivalent methods, the second one is in fact vastly superior, enough so to elbow out the other (well established/traditional) method.2012-01-24
  • 0
    @BenBlum-Smith: I added what amounts to an elaboration of your comment to the original post. Thanks!2012-01-24
  • 1
    @kjo: Have you had a look at Lebesgue's original papers introducing this integral? An English translation of some excerpts is available as part of Stephen Hawking's anthology _God created the integers_. The first paragraph of the preface, at least, is rather amusing...2012-01-24
  • 0
    @ZhenLin: the one about the historian of Babylonian astronomy? If so, it's pretty apt. I look forward to reading those excerpts. I had already seen the observation that the Lebesgue's insight had to do with the partitioning of the range instead of the domain, but why this new approached proved to be more fruitful was not clear to me until today: the Riemann-style partitioning is completely divorced from the value of the integral, so it is impossible to canonicalize it in a way that renders the definition of the integral robust to rearrangements.2012-01-24
  • 2
    @kjo: One reason why it should be clear that partitioning the range is likely to be "better" is that if the function is "nice" they amount to the same thing, but there are plenty of functions that are very nasty in how they behave in the domain, but very reasonable in their codomain: Dirichlet's function ($f(x)=0$ if $x\in\mathbb{Q}$, $f(x)=1$ if $x\notin\mathbb{Q}$) is an obvious example: very nasty in its domain, but the range is *very* simple.2012-01-24
  • 0
    @ArturoMagidin: yes, but it is not obvious to me that there wouldn't be situations where everything hinged on the nastiness/niceness of the domain. What was missing in my earlier exposure to these question was the observation (obvious in retrospect) that, *when it comes to defining an **integral**, the only nastiness/niceness that matters is that of the range*, so the Riemann-style definition has the accent on the wrong syLLAble, so to speak. (Admittedly, the step I had to make to finally "get" all this is decidedly tiny; that it took me so long to make it is somewhat embarrassing.)2012-01-24

3 Answers 3

8

Edit (in response to your 3rd edit)

It seems you're really close to considering the concept of the gauge integral. One of the problems with the Riemann integral is that the definition is very restrictive: For every $\varepsilon > 0$ there must exist a tagged partition $P$ such that for all finer partitions $P^*$ [$\ldots$].

But what if we had some way to throw "bad" partitions away? Then there would be fewer obstructions to being integrable and thus it would be easier for this process to "converge" (q.v. Wikipedia::Net for a more precise definition) - and that's exactly what the gauge integral does.


Original post

What made measure theory look fruitful? To answer this question one needs to understand that Lebesgue's ideas weren't conceived ex nihilo. Lebesgue was certainly familiar with Jordan's work on the subject and from this he would surely have known that a bounded function $f: [a,b] \to \mathbb R$ is Riemann integrable if and only if the two sets $$\begin{align*} A_+ &:= \{(x,y) \in \mathbb R^2 \;|\; a \leq x \leq b \land 0 \leq y \leq f(x)\} \text{ and} \\ A_- &:= \{(x,y) \in \mathbb R^2 \;|\; a \leq x \leq b \land f(x) \leq y \leq 0\} \end{align*}$$ are Jordan measureable. And in that case we have $$\int_a^b f(x)\,dx = m(A_+) - m(A_-)$$ where $m$ is the Jordan measure.

The Jordan measure is only finitely additive, so Lebesgue could conclude that to get something new he'd have to consider at least countable additivity - and since uncountable additivity is right out (an interval is made up of uncountably many singletons each of measure $0$), that would be a natural starting place.

What I find to be the true genius in Lebesgue's work on the integral is his idea to start from a list of requirements and work towards constructing something that satisfies the items on the list. This was quite a novel approach at the time, and it has become so commonplace that when students today are introduced to the Lebesgue integral the idea of a list of desired properties shouldn't seem novel at all.

  • 0
    Thanks for the pointer to Jordan's work. I realize now that what I had regarded as Lebesgue's main contribution (the shift of emphasis to analyzing the range of the integrand) was somewhat off-base, because the decisive aspect (it seems to me) of that change of emphasis (namely, segregating the positive and negative parts of the integrand) was already present in the definition of Jordan's measure, and that Lebesgue's extension of the Jordan measure was a step that required an independent, novel idea, something I had not fully appreciated before.2012-01-24
13

The space of $L_1$ functions - functions that are Lebesgue integrable - is a complete metric space - so Cauchy sequences have limits. So a limit (in the appropriate sense) of Lebesgue integrable functions is Lebesgue integrable. This is not true for Riemann integrable functions, and this is what makes notion of Riemann integration weak. In fact one can obtain $L_1$ functions as a closure of Riemann integrable functions, skipping notion of Lebesgue integration entirely. And the ability to take limits turns out to be very useful once one is doing analysis.

9

As others have pointed out Lebesgue's integral gives better theorems, the main reason being that the universe of integrable functions is much larger than for Riemann's integral. The question remains why this is the case.

The intuitive explanation is the following: A function is integrable according to Riemann if it can be sufficiently well "realized" or approximated by linear combinations of functions $1_Q$, characteristic functions of ordinary euclidean boxes $Q\subset{\mathbb R}^n$. On the other hand a function is integrable according to Lebesgue if it can be sufficiently well "realized" or approximated by linear combinations of functions $1_A$, characteristic functions of arbitrary measurable sets $A\subset{\mathbb R}^n$. As such sets can have pretty crazy shapes there is much more flexibility in this way.