I'm trying to teach myself some notes on Local Fields, in advance for a course I will be taking next year. I covered some algebraic number theory a little while ago, but I'm having trouble understanding the notes I've been teaching myself with online; I was hoping someone could help clear up my confusions - I have a few and much of the material is totally new to me so I'm very grateful for your patience in advance!
The notes are located at http://tartarus.org/gareth/maths/notes/iii/Local_Fields_2011.pdf - in particular, the proof on the bottom of page 16. The setup is as follows: $\mathfrak{p} \subset \mathcal{O}_K$ prime and $L / K$ a finite extension with $\mathfrak{p}\mathcal{O}_L = \mathcal{P}_1^{e_1}\ldots\mathcal{P}_r^{e_r},$ each $\mathcal{P}_i \subset \mathcal{O}_L$ prime, $f_i = [\mathcal{O}_L/\mathcal{P}_i : \mathcal{O}_K/\mathfrak{p}]$ (the "residue class degree"), then we want to prove that $\sum_{i=1}^r e_i\, f_i = [L:K]$.
Question 1: How do we actually know that $\mathcal{O}_L/\mathcal{P}_i$ is an extension of $\mathcal{O}_K/\mathfrak{p}$? I don't intiutively have a grasp of either of these, and since $\mathcal{P}_i$ divides $\mathfrak{p}$ in $\mathcal{O}_L$, it isn't obvious to me that $\mathcal{O}_L/\mathcal{P}_i$ should be 'bigger' than $\mathcal{O}_K/\mathfrak{p}$, let alone a field extension.
The first half of the proof aims to show that dim$_k (\mathcal{O}_L/\mathfrak{p}\mathcal{O}_L) = [L:K]$, where $k = \mathcal{O}_K/\mathfrak{p}$. To do so, we say:
i) By the Chinese Remainder Theorem, $\mathcal{O}_L / \mathfrak{p}\mathcal{O}_L \cong \mathcal{O}_L / \mathcal{P}_1^{e_1} \times \ldots \mathcal{O}_L/\mathcal{P}_r^{e_r}$. Any $x \in \mathcal{P}_i^a \backslash \mathcal{P}_i^{a+1}$ generates a quotient $\mathcal{P}_i^a / \mathcal{P}_i^{a+1}$ as an $\mathcal{O}_L$-module ("using properties of Dedekind domains")
Question 2: How do we know that $\mathcal{P}_i^a \backslash \mathcal{P}_i^{a+1} \neq \phi$? I'm fairly sure that this is something to do with the fact $\mathcal{P}_i$ is prime, but when I tried to prove it I couldn't get anywhere: is this correct? Also, is the deduction by "Use properties of Dedekind domains" particularly straightforward? It certainly wasn't obvious to me.
ii) Then dim$_{\mathcal{O}_L/\mathcal{P}_i}(\mathcal{P}_i^a/\mathcal{P}_i^{a+1})=1$ as a vector space (since generated by 1 vector). Then dim$_k(\mathcal{P}_i^a/\mathcal{P}_i^{a+1})=f_i$, and so dim$_k(\mathcal{O}_L/\mathcal{P}_i^{e_i})=e_i f_i$.
Question 3: How do we make this final deduction, that since dim$_k(\mathcal{P}_i^a/\mathcal{P}_i^{a+1})=f_i$, dim$_k(\mathcal{O}_L/\mathcal{P}_i^{e_i})=e_i f_i$? Are we using the tower law somehow? I can't see how this is deduced.
That's all my problems for now: sorry that this is such a lengthy "question", I'm aware that this isn't ideally suited to the single-problem nature of math.stackexchange, but I thought it would be a lot more efficient than setting up the same problem lots of times to ask similar questions. If there are any relevant books you can suggest which are simple and concise too then please don't hesitate to recommend.