Each cat can perform 4 actions: walk, jump, short jump, and turn around. Every time they touch a yellow circle they must pick one of those actions. The actions "walk" and "short jump" will put them in the lava, "turn around" and "jump" will save them. Over time, each cat's AI will learn to choose actions that keep them out of the lava. If the cat touches the lava, it resets. That action loses the amount in Penalty. If it survives (touches the wall), it learns to prefer that action a little more. That action gains the amount in Reward. See what happens!
Each cat will learn a different approach. The numbers shown are for the blue cat. The timer is how long it's been since a cat fell into the pit. The IQ is the highest value of the timer so far. Download and reattach the sound file in the Background for further explanation.