✅ The verified answer to this question is available below. Our community-reviewed solutions help you understand the material better.
You are building a neural network that uses the tanh activation function. Which weight initializer is most suitable to maintain stable gradients during training?