Reframing OpenCog Action Selection: Contextual Bandit Problems and Reinforcement Learning

I thought a bit today about how OpenCog’s action selector (based on the Psi model from Dietrich Dorner and Joscha Bach) relates to approaches to action selection and behavior learning one sees in the reinforcement learning literature.
After some musing, …

