Матеріали курсу Reinforcement Learning - Fall 2025 та відповіді на тести | elearning.aua.am

When Monte Carlo methods can not be applied? (Select all that apply)

When the problem is continuing and given a batch of data containing sequences of states, actions, and rewards

✅

When the problem is continuing and there is a model that produces samples of the next state and reward

✅

When the problem is episodic and there is a model that produces samples of the next state and reward

❌

When the problem is episodic and given a batch of data containing sample episodes (sequences of states, actions, and rewards)

❌

Переглянути це питання

Crowdly