Crowdly

Додати до Chrome

Університети
moodle.taltech.ee
ITI0210 Tehisintellekti ja masinõppe alused (2025/26 sügis)

ITI0210 Tehisintellekti ja masinõppe alused (2025/26 sügis)

Шукаєте відповіді та рішення тестів для ITI0210 Tehisintellekti ja masinõppe alused (2025/26 sügis)? Перегляньте нашу велику колекцію перевірених відповідей для ITI0210 Tehisintellekti ja masinõppe alused (2025/26 sügis) в moodle.taltech.ee.

Отримайте миттєвий доступ до точних відповідей та детальних пояснень для питань вашого курсу. Наша платформа, створена спільнотою, допомагає студентам досягати успіху!

"Positional Encoding" in Transformers is needed because:

It increases the vocabulary

It encrypts the input

It compresses the data

It removes stop words

The model has no inherent sense of order/sequence

100%

It acts as a heuristic

Переглянути це питання

In the "Cliff Walking" example above, Q-learning learns the Optimal Path (right along the edge of the cliff), while SARSA learns the Safer Path (farther away). Explain why this difference occurs based on their update equations.

Переглянути це питання

You are designing a pathfinding agent for a grid-based maze where diagonal movement is allowed (cost ) and straight movement cost is 1. You propose using the Manhattan Distance () as a heuristic for the A* algorithm.

Task:

1. Determine if this heuristic is admissible. Prove your answer mathematically or by providing a counter-example.

2. Explain what happens to the optimality of the A* algorithm if we multiply this heuristic by a factor of 2 (i.e.,

Переглянути це питання

Consider using reinforcement learning for controlling a robot with legs (e.g., a humanoid robot or a robot dog) for locomotion (i.e., moving from point A to point B). What could be the states of this learning system? What would the reward be?

Переглянути це питання

In a Convolutional Neural Network (CNN), you have a input image and a filter (kernel). If you apply the filter with stride 1 (one pixel per step) and no padding, what is the dimension of the output feature map? Show the calculation.

Переглянути це питання

Using the Q-learning update rule:

Calculate the new given:

Current

Learning rate

Discount factor

Reward received

Next state

allows actions with Q-values: .

Переглянути це питання

You are designing a heuristic for a path-finding problem on a grid where diagonal movement is allowed and costs the same as horizontal/vertical movement (cost = 1). Would the Manhattan Distance be an admissible heuristic? Explain why or why not with a counter-example.

Переглянути це питання

What is the "Training Data Crisis" mentioned in the RL slides regarding Chess? Why did supervised learning fail for Chess before reinforcement learning?

Переглянути це питання

If the discount factor , the agent considers:

Only infinite future rewards

The average of all rewards

No rewards at all

The penalty only

Rewards 10 steps away

Only the immediate reward

100%

Переглянути це питання

Bayes' Theorem allows us to swap:

Unions and intersections