Given a matrix $A \in \mathbb{R}^{n×m}$, and its pseudoinverse $A^{\dagger} \in \mathbb{R}^{m×n}$, how can I calculate the following gradient:
$\frac{\partial \Vert {A}^{\dagger} \Vert_1 }{\partial A}$
where $\Vert \cdot \Vert_1$ is the entrywise 1-norm: $\vert\vert{ A}\vert\vert _1=\sum_{i=1}^n\sum_{j=1}^m \vert a_{ij}\vert $ ?
I have tried searching trough identities in the differential form, such as:
- $d(|A|) = |A|\operatorname {tr} (A ^{-1}d{A} )$ (which seems to be only defined for $A \in \mathbb{R}^{n×n}$ ?)
- $dy=\operatorname {tr} (B \,dA ) \iff \frac{dy}{dA} = B $
and use them to derive some results, but had no success when checking my expressions against numerical gradient.