0
$\begingroup$

My general question is on what to do with a scalar result from a partial derivative.

Suppose a column vector $x$, and a function $f$ which accepts $x$ and returns a scalar. (e.g. proposition 8). Then suppose that in the process of finding $\frac{\partial f }{\partial x} $, that you take the partial derivative with respect to $x_k$ to get the following, $\frac{\partial f }{\partial x_k} = 2x_k $

so then, what is the derivative with respect to the entire vector $x$ (keeping in mind this is a hypothetical example and not the same as prop. 8)? I am not sure whether the the following result would be correct, and whether the result should a row or column vector. $\frac{\partial f }{\partial x} = 2x^T $

What if the function $f$ here returns a vector or matrix, then is the answer any different here?

  • 0
    @ChristianBlatter here I mean for $\partial f / \partial x_k$ to hold for any $x_k$ in the vector, and also renamed it to $f$ since I now realize $\alpha$ by convention is a scalar constant instead of a function per se.2012-07-10

1 Answers 1

1

It's depends on your definition, but usually the derivative of a scalar with respect to a vector is defined as $\frac{\partial \alpha}{\partial\mathbf{x}} \equiv [\frac{\partial \alpha}{\partial x_1},\cdots, \frac{\partial \alpha}{\partial x_n}]^T $ where $\mathbf{x}=[x_1, \cdots, x_n]^T$.

For your question, since $\mathbf{x}$ is a column vector and $\frac{\partial \alpha}{\partial x_k} = 2x_k$ holds for all $k$, then $\frac{\partial \alpha}{\partial\mathbf{x}} = 2\mathbf{x}$, i.e. it's still a column vector by the above definition. (Of course, it can be a row vector by another definition.)

If $\alpha$ is a vector or matrix, then the result is a matrix or three-rank tensor, respectively. You can find more information at here and here or google "matrix calculus".

  • 0
    @chohuang, so what I don't understand is why at (45) of proposition (8) (cf URL), we go from a scalar expression $\partial \alpha/\partial x_k$, to an expression (46) $x^T$ instead of the $n \times 1$ vector $x$2012-07-10