logo

Crowdly

Browser

Add to Chrome

BERT masks 15% of tokens during pre-training, but only uses [MASK] for 80% of t...

✅ The verified answer to this question is available below. Our community-reviewed solutions help you understand the material better.

BERT masks 15% of tokens during pre-training, but only uses [MASK] for 80% of them. What happens to the other 20%?
0%
0%
0%
More questions like this

Want instant access to all verified answers on vns.itstep.edu.ua?

Get Unlimited Answers To Exam Questions - Install Crowdly Extension Now!

Browser

Add to Chrome