Consider the classic Vehicle Routing Problem.
In this paper, the author shows how to optimally partition an optimal TSP tour, into feasible routes. In other words he describes a "route-first cluster-second" approach for the problem, where the clustering part is optimal.
On page $2$, it is written:
Note that it is easily shown that an optimal TSP tour followed by an optimal set of vehicle routes does not necessarily lead to an optimal set of vehicle routes.
I am trying to come up with a very simple example where this is true. Can anyone help?