Crowdly

Add to Chrome

Questions Bank (1358328 total)

Imagine that you have a reinforcement learning policy obtained using Q-learning, and your policy is optimal for the NIM game. You execute this policy with the -greedy exploration where . Would this execution lead to the selection of incorrect actions by the algorithm in some situations? That is, would the policy suggest "irrational" actions in some states?

True

False

View this question

In a standard set-up, the Transformer takes as input a matrix of word embeddings and returns a matrix of the same size as its output.

True

False

View this question

Pre-attentive

processing relates to how we accumulate information through visual features such as size or orientation, at a subconscious level (i.e. before we consciously pay attention to the visualisation).

What does the acronym MDP studied in this module stand for?

Markov Discovery Process

❌

Markov Decision Process

✅

Markov Deception Process

❌

View this question

The image below shows a simple visualisation of a GPT.

GPT

All the other answers are incorrect.

❌

The input token <start> is useful to make the learning process more efficient because the entire sequence can be presented to the Transformer in one step.

✅

The input token <start> is typographical error, and it does not have any special mining.

❌

The input token <start> is not required when positional encoding is used.

❌

View this question

The supply curve of a product is based on

Government Regulations

Consumer theory

Market Structure

Producer Theory

View this question