How do I use chain rule to calculate the derivative of $ h(\textbf{x}) $, where $ h(\textbf{x}) = f(\textbf{Sx}), f:R^n \to R$ and $ \textbf{S}$ is a matrix.
I know how to use chain rule to compute derivative of single variable functions, and I know basic operations on matrix and vectors. But I'm not sure how to use chain rule on matrix functions. Any reference will be appreciated.