The hinge loss function (summed over $m$ examples):
$$ l(w)= \sum_{i=1}^{m} \max\{0 ,1-y_i(w^{\top} \cdot x_i)\} $$
My calculation of the subgradient for a single component and example is:
$$ l(z) = \max\{0, 1 - yz\} $$ $$ l^{\prime}(z) = \max\{0, - y\} $$ $$ g(w) = w \cdot x $$ $$ g^{\prime}(w) = x $$ $$ \frac{\partial l}{\partial z}\frac{\partial g}{\partial w} = \max\{0 \cdot x, - y \cdot x\} = \max\{0, - yx\} $$
For vectors:
$$ l^{\prime}(w) = \sum_{i=1}^{m} \max\{0 ,-(y_i \cdot x_i)\} $$
But the answer I have been given is:
I don't understand this notation. Have I arrived at the same solution, and can someone explain the notation?
