KV RA Research Assistant

The critic signal combines 1) immediate reinforcement information with 2) changes in value to produce what is sometimes called a reward-prediction error signal.

Montague, Read — Why Choose This Book? How We Make Decisions · Card #9
Quotes are taken from the research journals of Kyle Vanderburg.