Add to Chrome
✅ The verified answer to this question is available below. Our community-reviewed solutions help you understand the material better.
What is the purpose of discount factor (γ) in reinforcement learning?
To penalize long episodes
To prioritize short-term rewards over long-term rewards
To balance exploration and exploitation
To weigh the importance of future rewards
Get Unlimited Answers To Exam Questions - Install Crowdly Extension Now!