ques 1 consider a rational agent suppose the performance measure is co
Search for question
Question
Ques 1 Consider a rational agent. Suppose the performance measure is concerned with just the first T time steps of the environment and ignores everything thereafter. Use real life examples
to show that a rational agent’s action may depend not just on the state of the environment but also on the time step T it has reached. [10 points] Hint: Consider any sequential environment in which rewards or goals may take time to arrive ?