Bakker, Bram (2002) Advantage(lambda) learning. Technical Report UNSPECIFIED