✅ The verified answer to this question is available below. Our community-reviewed solutions help you understand the material better.
Which of the following is a requirement on the behavior policy b for using off-policy Monte Carlo policy evaluation? This is called the assumption of coverage.