Let $F(x,y)$ be a function in $R \times R \rightarrow R$.
In this case: $\nabla F = (\frac{\partial}{\partial x} , \frac{\partial}{\partial y})^T$.
Well, let's suppose that $\nabla F$ at some point $(a,b)$ $= (n, -n)^T$. In this case, that would intuitively mean:
- When we add $\epsilon$ to $x$ then $F(x,y)$ increases by n
- When we add $\epsilon$ to $y$ then $F(x,y)$ decreases by n
So, it would intuitively mean that by adding this $\epsilon$ amount, then $F(x+\epsilon,y+\epsilon)$ stays at the same value, right?
However, when we take the magnitude of this gradient vector, we get a different value other than 0. Isn't the value of the magnitude supposed to be 0 to match the intuition (meaning that the function would neither increase nor decrease when we move a little bit along its variables)?