Consider a dynamical system $\dot x = f(x,u)$ where $x$ is the state and $u$ is a control input. We are interested in moving the system state from one value to another in finite time, say $[0, T]$. At the same time we want to minimise a scalar function $\int_0^T L(x,u) dt$
The Pontryagin minimum principle address this very need. First, we express the constraint optimization problem in one equation, using Lagrange multipliers: $H(x,u,\lambda) = \lambda f (x,u) + L(x,u)$.
After this point, some textbooks I have consulted and even the wikipedia article present the reader with five functions that when solved will provide the optimal trajectory ($x,u,\lambda$) for the system.
When comparing these five equations with the Hamiltonian mechanics equations it is apparent that the Lagrange multiplier $\lambda$ play the role of the momentum in Hamiltonian mechanics.
I am curious to know, how can one interpret the Lagrange multipliers, which are essentially the marginal cost of a constraint violation, as momentum?