Search for question
Question

Ques 1 Consider a rational agent. Suppose the performance measure is concerned with just the first T time steps of the environment and ignores everything thereafter. Use real life examples

to show that a rational agent’s action may depend not just on the state of the environment but also on the time step T it has reached. [10 points] Hint: Consider any sequential environment in which rewards or goals may take time to arrive ?

Fig: 1