Crowdly

Add to Chrome

Imagine that you have a reinforcement learning policy obtained using Q-learning,...

✅ The verified answer to this question is available below. Our community-reviewed solutions help you understand the material better.

Imagine that you have a reinforcement learning policy obtained using Q-learning, and your policy is optimal for the NIM game. You execute this policy with the -greedy exploration where . Would this execution lead to the selection of incorrect actions by the algorithm in some situations? That is, would the policy suggest "irrational" actions in some states?

True

False

Want instant access to all verified answers on moodle.kent.ac.uk?

Get Unlimited Answers To Exam Questions - Install Crowdly Extension Now!

Add to Chrome

Telegram Instagram TikTok Question Bank