Crowdly

Додати до Chrome

Imagine that you have a reinforcement learning policy obtained using Q-learning,...

✅ Перевірена відповідь на це питання доступна нижче. Наші рішення, перевірені спільнотою, допомагають краще зрозуміти матеріал.

Imagine that you have a reinforcement learning policy obtained using Q-learning, and your policy is optimal for the NIM game. You execute this policy with the -greedy exploration where . Would this execution lead to the selection of incorrect actions by the algorithm in some situations? That is, would the policy suggest "irrational" actions in some states?

True

False

Більше питань подібних до цього

Хочете миттєвий доступ до всіх перевірених відповідей на moodle.kent.ac.uk?

Отримайте необмежений доступ до відповідей на екзаменаційні питання - встановіть розширення Crowdly зараз!

Додати до Chrome

Telegram Instagram TikTok Question Bank

Умови використання Зв'яжіться з нами