Can someone explain what Parsimony is in the context of probability, more specifically in Parsimonious Markov models?
I have been trying to search around a simple explanation of this but I only seem to be getting domain-specific papers in biology etc. which assume the reader already know what it means.