0
$\begingroup$

Consider a matrix $A \in \mathbb{R}^{m\times n}$ with $m < n$. Suppose there always exists at least one matrix $B = [b_{ij}] \in \mathbb{R}^{n\times m}$ such that $AB = I$. How to choose the matrix $B$ such that $AB = I$ and $$\max_j \sum_i b_{ij}^2 $$ is minimised? It seems that the optimal $B^*$ is always the Moore–Penrose pseudoinverse of $A$ from trial and error in Matlab. Would someone kindly tell me why? Thank you very much!

  • 0
    Can you construct an example where there are two different matrices $B$ and $C$ such that $AB=I$ and $AC=I$?2017-01-02
  • 0
    For example, A = \begin{bmatrix} 1 & 1 & 1 & 0 & 0 & 0\\ 1 & 0 & 0 & 1 & 1 & 0\\ 0 & 1 & 0 & 1 & 0 & 1\\ 0 & 0 & 1 & 0 & 1 & 1 \end{bmatrix} $B = \frac{1}{6}\begin{bmatrix} 2 & 2 & -1 & -1\\ 2 & -1 & 2 & -1\\ 2 & -1 & -1 & 2\\ -1 & 2 & 2 & -1\\ -1 & 2 & -1 & 2\\ -1 & -1 & 2 & 2 \end{bmatrix}$ $C = \frac{1}{2}\begin{bmatrix} 1 & 1 & -1 & -1\\ 1 & -1 & 1 & -1\\ 0 & 0 & 0 & 2\\ -1 & 1 & 1 & 1\\ 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 0 \end{bmatrix}$2017-01-02
  • 0
    Then $AB = AC = I$ but $\sum_i b_{ij}^2 = [ 0.4167 \ \ 0.4167 \ \ 0.4167 \ \ 0.4167]$ and $\sum_i c_{ij}^2 = [ 0.75 \ \ 0.75 \ \ 0.75 \ \ 1.75]$2017-01-02

1 Answers 1

2

Since $AB=I$, the range of $A$ is $R^{m}$, and thus the rank of $A$ is $m$.

We can use the compact form of the SVD to write $A$ as

$A=U\Sigma V^{T}$

where $U$ is $m$ by $m$ and orthogonal, $\Sigma$ is $m$ by $m$, diagonal, and non-singular, and $V$ is $n$ by $m$ with orthonormal columns.

The Moore-Penrose pseudoinverse of $A$ is

$A^{\dagger}=V\Sigma^{-1}U^{T}$.

It's easy to show that $AA^{\dagger}=I$. Note that this is not true if $A$ has rank less than $m$.

Any matrix $B$ such that $AB=I$ can be written as

$B=A^{\dagger}+N$

where each column of $N$ is in the null space of $A$.

The columns of $A^{\dagger}$ are in the span of $V$ while the columns of $N$ are in the perpendicular complement of the span of $V$. Thus each column $j$ of $A^{\dagger}$ is perpendicular to the corresponding column of $N$.

$A^{\dagger}_{j} \perp N_{j}.$

By the Pythagorean theorem,

$\| A^{\dagger}_{j}+N_{j} \|_{2}^{2}= \| A^{\dagger}_{j}\|_{2}^{2}+\|N_{j} \|_{2}^{2}.$

Since this is true for all columns, you can minimize

$\max_{j} \sum_{i=1}^{n}B_{i,j}^{2}$

by using $B=A^{\dagger}$. Furthermore, by adding up the columns we get

$\| B \|_{F}^{2}=\| A^{\dagger} \|_{F}^{2}+\| N \|_{F}^{2}.$

Clearly, $B=A^{\dagger}$ minimizes $\| B \|_{F}^{2}$.

  • 0
    Thank you for your advice! You are so helpful!2017-01-03