Szemeredi's regularity lemma states that for every $\epsilon >0$ there is $M\in \Bbb N$ such that all graphs have an equipartition $\{V_1,...,V_k\}$ for some $k$ satisfying $\frac 1\epsilon \leq k \leq M$ such that at most $\epsilon k^2$ of the pairs $(V_i,V_j)$ are not $\epsilon$-regular.
By an equipartition we mean that the sets are as equal as possible, i.e. $||V_i|-|V_j||\leq 1$, and by an $\epsilon$-regular pair $(V_i, V_j)$ we mean that for every $A\subseteq V_i, B\subseteq V_j$ with $|A| \geq \epsilon |V_i|, |B| \geq \epsilon |V_j|$ we have that $|d(V_i,V_j)-d(A,B)|<\epsilon$, where $d(X,Y)$ denotes the density of the bipartite graph on $X,Y$.
Deduce from the regularity lemma that we can even require that every $V_i$ will have at most $\epsilon k$ irregular pairs $(V_i,V_j)$, which strengthens the traditional formulation above (in which we only claim that there are at most $\epsilon k^2$ irregular pairs).
This fact is referred to as a well-known (and easy) observation right before theorem 2 in this paper. It is claimed explicitly this holds when we take $M^3$ instead of $M$ in the statement.
This is also given as problem 2 here.
I'd like to know why this is true.