Legg, Shane and Hutter, Marcus (2004) Ergodic MDPs Admit Self-Optimising Policies. Technical Report UNSPECIFIED