The gradient is usually written as the product of the unit vectors times the derivative with respect to that coordinate. In Einstein summation convention:
$\hat e_i \partial_i$
I've seen it written as so in some places.
Is this wrong and is one of them supposed to be a contravariant vector, because otherwise it won't transform as a tensor between coordinate system?