I'm trying to determine the solution (optimal policy) to the standard optimal control problem with a small variation described fully as under: \begin{equation} \min_{\bf{\pi}} \, \mathbb{E_0}[\sum_{t=0}^{N-1}f(x_t, u_t,\epsilon_t)] \end{equation} subject to:
\begin{equation} x_{k+1} = g(x_k,u_k,\epsilon_k)\forall k \\ \sum_{t=0}^{N-1}u_t=K \end{equation} Here, the value of state variable $x_0$ and $K$ is known and $\epsilon_k$ is a random variable with known distribution and we're looking for optimal policy vector of functions ${\pi}=\{\mu_1(x_1),\mu_2(x_2),\mu_3(x_3)\ldots \mu_{N-1}(x_{N-1})\}$
I have one solution in my mind but I fear that it is computationally inefficient therefore I'm looking for suggestions to come with alternate possibilities. Any help in this matter would be highly appreciated.