I tried to google for OOMP and I only got two hits. The first was for this page and the other for a paper on computer vision. I tried to skim that article, but probably I still have serious misunderstandings. Anyway, the problem I have in mind is the following. Using a 1-dimensional space to keep it simple. Assume that the domain is the interval $[0,9]$, and that the OOMP output is $f=1*\chi_{[1,2]}+2*\chi_{[4,8]}$, [Edit] in other words, there are two boxes: the interval $[1,2]$ and the interval $[4,8]$. [/Edit] This has a mean value of 1. I first assumed that removing the DC component amounts to subtracting the constant 1 from this function, but that would give us a function $f-1=-1*\chi_{[0,1)}-1*\chi_{(2,4)}+1*\chi_{[4,8]}-1*\chi_{(8,9]}$ and we necessarily create new boxes while doing that, because wherever the value was zero, there is a now a -1.
A way of getting a zero mean function out of this without modifying the set of boxes would be to subtract the mean of the restriction of $f$ to the union of the boxes. Here the union of boxes has measure 5 and the average of $f$ restricted to the union is $9/5$. So, if we do this subtraction business we get $ f-\frac95\,\chi_{[1,2]\cup[4,8]}=-\frac45\,\chi_{[1,2]}+\frac15\,\chi_{[4,8]}. $ The reason I think that this latter way of killing the DC-component is bad is that this changes the relative order of the components. The OOMP output marked the box $[1,2]$ as having something above the 'background'. If we subtract a constant only in the union of the boxes, then this box gets a negative sign indicating a level below the background. This is what I meant by saying that the quality of the approximation is now worse: In the computer vision paper the two boxes could in function $f$ mean that those two regions are both noted for being lighter than the surroundings. If the DC-component is removed by subtracting a constant function with support = the union of the boxes, then after this subtraction one of the original lighter regions is now hit with a negative number, and would be interpreted as darker than the surroundings. Ideally we might want the box generation algorithm itself to mark off both darker and lighter regions.
Here's the general formula that I used in the striked example (the latter, `suspicious' version, where no changes to the boxes are done). Let $\chi_{E_i}$ be the characteristic function of box $b_i$. Then the integral of $f$ over the union of the boxes is $\int f=\sum_i \alpha_i m(E_i),$ where $m(E_i)$ is the measure of the box $b_i$. A weighted measure of the union (=overlapping counted many times) is $m=\sum_i m(E_i)$. Then the function $ \tilde{f}=\sum_i(\alpha_i-\frac{\int f}m)b_i $ has zero mean. This you can then normalize to have unit $L^2$-norm in the usual way. Note you need to first replace $f$ with $\tilde{f}$ and then normalize. Not the other way around, because the $L^2$-norm goes down, when passing from $f$ to $\tilde{f}$. OTOH scaling the norm scales the mean value, so it won't reproduce a DC-component, if one didn't exist.
If you can spare the cycles, try this, and see if the results are ok. In the image-processing application the DC-component is a general grey level value, and you would have to take that into account outside the boxes also. Your application is hopefully very different.