11
$\begingroup$

Three versions of the theorem are stated on pages 1-2 in these notes by Pete L. Clark: http://math.uga.edu/~pete/4400ChevalleyWarning.pdf

Could anyone offer some intuitive way to think about this theorem? Thanks!

  • 2
    I think that the name is *Waring*.2011-07-10
  • 12
    It is *Warning*. Waring was somebody else.2011-07-10
  • 0
    http://mathworld.wolfram.com/Chevalley-WaringTheorem.html I am relatively sure about this. Saw it as an undergrad, and Waring has done a lot of work in finite fields.2011-07-10
  • 0
    It doesn't really matter. But Ewald Warning was 20th century. Waring is 18th century. Wikipedia has it right, and mathworld does not. Odd coincidence of similarity of names!2011-07-10
  • 0
    Odd, indeed. Finite field folks often refer to Waring's formula, but I just learned that this dates back to the English mathematician from 18th century, so not from the same person. I will check Lidl&Niederreiter, when I go to my office, and report my findings here.2011-07-10
  • 1
    @Jyrki: I promise it's (Ewald) Warning. I asked a historical question about this theorem a couple of years ago on sci.math.research, and in response W.C. Waterhouse was kind enough to mail me a copy of the original paper.2011-07-11
  • 0
    @everybody: I didn't realize that my notes don't provide intuition! But I guess this word means different things to different people. Anyway, if in answering this question or participating in the discussion, you see some way for my notes to be improved, I would greatly appreciate your letting me know. This is one of my favorite theorems in all of number theory, and I tried to do it justice.2011-07-11
  • 6
    @Pete L. Clark: This might be a good opportunity to mention that I find your notes very helpful and clear. Thank you!2011-07-11
  • 1
    @everybody: I checked my sources. *Warning* it is. In my case the source of misunderstanding is probably the context I described in an earlier comment: in coding theory papers you see Waring's formula more frequently than Warning's theorem, but people have been exposed to both. Wonder where did it propagate to mathworld from?2011-07-11
  • 0
    @everyone: okay, the .pdf file is back up. I didn't make any mathematical changes in the file, but I corrected a typo in the sentence in which I discussed motivation: "One of the charms of the Chevalley-Warning Theorem is that it can be stated and appreciated without much motivational preamble. So let's just jump right in." The current question seems to cast some doubt on this sentence, although to be honest I'm still not sure exactly what the OP is looking for when she says "intuition" here.2011-07-11
  • 1
    @Jyrki: I don't know what mathworld's methodology is nowadays. It started as a labor of love by this one guy, Eric Weisstein, more than 10 years ago. While I was in grad school a friend indicated that in many instances the fact that it was written by one person who was not (understandably!) an expert in all topics covered showed through. Since then it got taken over by Wolfram and I guess a lot of Wolfram employees -- i.e., serious, mathematically minded people -- have looked at it, but it's not like wikipedia where *any* interested party can immediately chime in to make a correction.2011-07-11
  • 1
    So because of this it's hard to view mathworld as a reliable source in the most demanding sense, although certainly there's a lot of high-quality stuff there which is accurate most of the time. (In fact it's quite rare to find outright errors like this.) By this point I think wikipedia editors have gone over all the mathworld stuff and incorporated / improved it into their own articles. I would definitely trust wikipedia more, at least on an article that has been around for a while and looks "finished" like Chevalley-Warning.2011-07-11
  • 1
    @Pete L. Clark: I agree that the theorem doesn't need motivation and is very interesting by itself. However, I was hoping there could be some way (possibly geometrical, but not necessarily) that could give the feeling of "Oh, sure, it must be like that!" regarding the result(s).2011-07-11
  • 0
    @Pandora: okay, your last comment somehow contained some magic words: I am pretty sure I know what you want now. An answer will be forthcoming soon!2011-07-11
  • 0
    I'm really not into this buisness, but I recently found at the arXiv: [A counterexample to the geometric Chevalley-Warning conjecture](http://xxx.lanl.gov/abs/1307.7765). Maybe you or @Pete are interested...2013-08-18
  • 0
    @draks: I did see that paper when it was posted to the arxiv. But thanks.2013-08-18

3 Answers 3

14

I am now interpreting the question as "How might one come to believe that the Chevalley-Warning Theorem is true?" (More specifically, I will try to put myself in Emil Artin's shoes when he conjectured this result in the 1930's.)

Let us say that a field $K$ satisfies property $C_1(d)$ if any homogeneous polynomial equation $f(x_1,\ldots,x_n) = 0$ in $n$ variables of degree $d$ has a solution $(x_1,\ldots,x_n) \neq (0,\ldots,0)$ if $n > d$. Then a field is $C_1$ if it is $C_1(d)$ for all positive integers $d$.

Here are some observations about this property:

1) Any field is $C_1(1)$: a homogeneous linear equation in at least two variables has a nontrivial solution.

2) Suppose that a field $K$ is such that for all positive integers $d$, every homogeneous polynomial in $n = d$ variables has a nontrivial zero. Then $K$ is algebraically closed.

Indeed, if $K$ is not algebraically closed it admits for some $d > 1$ a degree $d$ field extension $L/K$. The norm form associated to $L/K$ is a homogeneous polynomial of degree $d$ in $d$ variables which has only the trivial solution.

This explains the condition that $n > d$: this is necessary to go beyond algebraically closed fields.

3) Now let's look at the property $C_1(2)$ of a field $K$: this says that any quadratic form in at least three variables over $K$ is isotropic: has a nontrivial solution. When the characteristic is not $2$, this is equivalent to the following: for any $a,b,c \in K^{\times}$, the equation $ax^2 + by^2 = c$ has a solution $(x,y) \in K^2$. (In other words, any plane conic has a $K$-rational point.) This type of equation was studied by number theorists at least since Fermat in the 1600's. Legendre for instance well knew that the field $\mathbb{Z}/p\mathbb{Z}$ has this $C_1(2)$ property. It is possible to give a very elementary proof by counting arguments: you rewrite the equation as $ax^2 -c = by^2$ and observe that the left hand side and right hand side each run through $\frac{p-1}{2} + 1 = \frac{p+1}{2}$ distinct elements of $\mathbb{Z}/p\mathbb{Z}$, hence they must take a common value. This proves Chevalley-Warning when $d = 2$ (and the characteristic is odd; you can certainly prove it in even characteristic by brute force if you want).

4) The next case of Chevalley-Warning is $d = 3$: that is, any cubic equation in at least four variables over a finite field has a nontrivial zero. The general study of cubic surfaces is already very hard, and I'm not sure what classical work there was on this. However, I believe that when Artin conjectured that C-W would hold he knew about Hasse's result that a smooth cubic equation in $3$-variables over a finite field has a nonzero rational point. Here the number of equations and variables are both equal to $3$, and so by 2) above this cannot possibly hold for all cubic equations in three variables: one can take the norm form of a cubic extension to get a counterexample.

But from a geometric perspective this counterexample is not very worrisome: over the algebraic closure, what one has is three planes meeting at the origin, but with the Galois group acting so that there is no component which is rationally defined over the ground field. Therefore the only possible $K$-rational points are the intersection points of the components which, if you're thinking about it affinely, are only the origin (and if you're thinking about it projectively, there are no intersection points). This is very different from a geometrically irreducible cubic equation, to which Hasse's Theorem applies.

Moreover, if you try to geometrically build the same example one dimension higher, it doesn't work: if you have three hyperplanes in $K^4$ all passing through the origin, their common intersection is a line, not just a point.

So I think it's at least plausible that Hasse's Theorem implies that a finite field is $C_1(3)$. Maybe you could actually prove this along the lines given above: I haven't tried, but it would be interesting.

5) Another thing you can do of course is pick your favorite values of $q,n,d$ and do the finite amount of computations necessary to see whether indeed every form of degree $d$ in $n$ variables over $\mathbb{F}_q$ must have a nontrivial zero. Presumably Artin did some of this. In this regard I cannot help mentioning Exercise 10.16 in Ireland & Rosen's classic number theory text: "Show by explicit calculation that every cubic form in $2$ variables over $\mathbb{F}_2$ has a nontrivial zero." As many students have discovered over the years, explicit calculation rather shows that this is false: e.g. $x_1^3 + x_1^2 x_2 + x_2^3$ has no nontrivial zero. See this MO answer for a bit more on this: it is in fact easy to show that for every finite field $\mathbb{F}_q$ and every pair $(n,d)$ with $n \leq d$, there is a form over $\mathbb{F}_q$ of degree $d$ and in $n$ variables with only the trivial zero.

6) It is easy to show that any $C_1$ field has vanishing Brauer group: for any central division algebra $D$ over $K$, the reduced norm is a homogeneous polynomial in $d^2$ variables of degree $d$ with only the trivial zero. The vanishing of the Brauer group of every finite field is equivalent to the statement that every finite division ring is a field, a celebrated 1907 theorem of Wedderburn that Artin was certainly very familiar with. This property really is weaker than $C_1$ in the sense that we now know of fields which have vanishing Brauer group of every finite extension but are not $C_1$ (I believe the first examples were constructed by Ax), but for "familiar" fields the two properties tend to be equivalent. Thus this gives at least a lot of nontrivial examples of Chevalley-Warning.

Beyond that, it seems to me that Artin made a fairly gutsy conjecture here, unless he knew more than he let on. It is perhaps worth noting that he also conjectured that every $p$-adic field has property $C_2$ (a homogeneous form in $n$ variables of degree $d$ has a nontrivial zero if $n > d^2$) and that the maximal abelian extension of $\mathbb{Q}$ has property $C_1$. The former conjecture was eventually proven false by Terjanian (although there is a lot of truth in it...), and the latter conjecture remains wide open to this day, and I don't know any really compelling reasons for (or against) it.

  • 0
    +1. OK, you win. This is much better as "intuition" than a list of fields with the property C1 and a ramble.2011-07-14
  • 0
    Anyway, I have to mention the Ax-Kochen theorem (http://en.wikipedia.org/wiki/Ax%E2%80%93Kochen_theorem) as an elaboration of your statement that there is a "lot of truth" in Artin's conjecture, and add a link to Alex Kruckman's thesis: http://math.berkeley.edu/~kruckman/thesis.pdf (though I have not read it, it looks interesting)2011-07-14
  • 0
    +1.Wow! I had felt that Chevalley-Warning was mostly a curiousity. A very illuminating answer, indeed.2011-07-14
11

Near the end of his notes, Pete mentions some connections to $p$-adic cohomology, which is a very sophisticated alebraic geometry topic. I'm going to sketch how that connection works, and then show that you can actually understand these connection using "only" the level of algebraic geometry you'd learn in Hartshorne. Of course, motivation that goes up to the level of Hartshorne may or may not be what you are looking for!

For simplicity, take the case of a single polynomial $P$ which is homogenous of degree $d$. (Challenging exercise: Show that proving CW in this case implies the full result in Pete's notes.)

Let $Z_{aff}$ be the number of $P$ in $\mathbb{F}_q^{n}$. The center of every proof I know is showing that $Z_{aff} \equiv 0 \mod p$. Let's count zeroes in the projective space $\mathbb{P}_{\mathbb{F}_q}^{n-1}$; call this count $Z_{proj}$. So we have $Z_{aff} = (q-1) Z_{proj} +1$ and $Z_{aff} \equiv 0 \mod p$ if and only if $Z_{proj} \equiv 1 \mod p$.

Let $X$ be the hypersurface $P=0$ in $\mathbb{P}^{n-1}$. In the $p$-adic approach Pete sketches, one would define cohomology groups $H^i(X)$, for $0 \leq i \leq 2 \dim X = 2(n-2)$. These would come with an action of the $q$-power Frobenius $F$, and we would have $$\# X(\mathbb{F}_q) = \sum_{i=0}^{2 \dim X} (-1)^i \mathrm{Tr} \left(F^* : H^i(X) \to H^i(X) \right).$$ One would then show that the first term is $1$, and the other terms are divisible by $p$, completing the proof. Additionally, there would be complications resulting from the fact that $X$ may not be smooth.

However, you don't have to learn these complicated cohomology theories in order to give a cohomological proof of the CW theorem. Instead, you can use Fulton's fixed point formula! This states: $$\# X(\mathbb{F}_q) \equiv \sum_{i=0}^{\dim X} (-1)^i \mathrm{Tr} \left( F^*: H^i(X, \mathcal{O}) \to H^i(X, \mathcal{O}) \right) \mod p$$

Here the cohomology groups are sheaf cohomology, as covered in Hartshorne. Note that the right hand side only makes sense modulo $p$. One advantage to this formula, besides that it uses a more elementary cohomology theory, is that there are no smoothness hypotheses on $X$.

Now, $H^0(X, \mathcal{O})$ is $1$ dimensional and the frobenius action is trivial, so that contributes $1$. By the Lefschetz hyperplane theorem, $H^j(X, \mathcal{O})$ vanishes for $0 < j < \dim X$. So we have $$\# X(\mathbb{F}_q) \equiv 1 + (-1)^{\dim X} \mathrm{Tr} \left( F^*: H^{\dim X}(X, \mathcal{O}) \to H^{\dim X}(X, \mathcal{O}) \right) \mod p$$

So far, we have not used that $d

A fun challenge is to work out the case that $d=n$. The elementary computation in Pete's notes shows that $\#X(\mathbb{F}_q)$ is something like $1$ plus the coefficient of $x_1^{q-1} \cdots x_n^{q-1}$ in $P^{q-1}$. (I'm probably off by a sign somewhere.) Challenge: Show that, when $d=n$, $H^{\dim X}(X, \mathcal{O})$ is one dimensional and the scalar by which Frobenius acts is the coefficient of $x_1^{q-1} \cdots x_n^{q-1}$ in $P^{q-1}$.

  • 1
    This is very nice; thanks for posting it.2011-07-19
7

I don't know if this is relevant, but the Chevalley-Warning theorem does show that finite fields have a rather important property that comes up in geometric applications (in etale cohomology, for instance).

A field $k$ is called C1 or quasi-algebraically closed if every polynomial in $n$ variables with coefficients in $k$ and homogeneous of degree $n>d$ has a nontrivial root. Being C1 means that the field's Brauer group is trivial, and consequently controls the Galois group quite a bit: if the field is C1, then the Galois cohomology of a torsion module vanishes in degrees $>1$. So it's quite useful to know that a field is C1.

Clearly algebraically closed fields are C1, but there are three other important examples:

  1. Finite fields (this is the Chevalley-Warning theorem)
  2. A field of transcendence degree one over an algebraically closed field (i.e. the rational function field of an algebraic curve). This is "Tsen's theorem."
  3. The maximal unramified extension of a local field (this is in Serge Lang's thesis).

Note by contrast that a field like $\mathbb{R}$ is not C1, and in fact its absolute Galois group $\mathbb{Z}/2\mathbb{Z}$ has torsion cohomology in arbitrarily large (i.e., all even) degrees.

  • 0
    Incidentally, theorems 2 and 3 are quite hard! For instance, the only proof I know of th. 2 (which I believe is the simplest) uses a weak form of Riemann-Roch. I haven't read Lang's thesis, but it surely looks harder than Chevalley-Warning. It seems that finite fields are the easiest case of a non-algebraically closed field being C1.2011-07-11
  • 0
    $@$Akhil: what you say is of course all true. Does it count as intuition for Chevalley-Warning? (I don't know the answer to this...)2011-07-11
  • 0
    @Pete: I find it helpful to learn that isolated results, however interesting in themselves, are part of a larger story, and it helps me remember the result (For instance, the rather ad-hoc index-pushing in Cech theory is just a small piece of a larger simplicial picture -- this is the only way I ever could think of the fact that two "refinining" maps of Cech covers induce the same morphism in Cech cohomology. A result which, incidentally, can be overlooked even by mathematicians: when I pointed this out to my teacher last term, he responded that he had never asked thequestion.)2011-07-11
  • 0
    @Pete: contd. So, I always found it helpful and enlightening when I was made aware of such deeper meanings to results, though this may be nothing more than a personal preference (which the OP may or may not share, of course).2011-07-11
  • 0
    @Pete: OK. It's possible I interpreted Pandora's question somewhat loosely, since they did ask for intuition for "thinking about" the theorem and this is how I (now) think about C-W; if they had asked for intuition for the theorem itself, I would not have posted it. No idea if my interpretation was at all accurate, though.2011-07-11