My problem setup is the following: states in MCTS are described with a sequence of features that take values from set {0, 1}. For every state I get a certain decimal value. I want to 'learn' a function that would give me an estimation for any given state.
I wanted to use Gradient Descent for that purpose but I came to realize that it is not right way to do it because I'm actually working with a discrete function. $$f(s) \in R, s \in \{0,1\}^n$$
Since the function is not continuous but discrete, I cannot get a derivative of function f.
So my question would be: am I right? If not, why? If yes, could you point me to some references that would back up what I stated here while those are not wiki links since I need to present this to my mentor.
Thanks you!