2004-07-28
Adaptron Test Run-1 28th July, 2004
3 state maze with thinking off. I’ve added good & bad to decision making of what to do next and there is no back propogation of interest resulting from reward. No collapsing of R-Habits because never repeats two in a row.
With thinking turned on as long as it continues to get rewarded it repeats behaviour.