According to the post the second term disappeared because the expected gradient is not affected by the baseline b but since the expectation is not with respect to b, I wonder why that will play a role in disappearing the second term. Instead I think it is because expectation is taken over the same distribution twice and since the first expectation will lead to a number the second one will lead it to 0.
Is my reasoning correct?
