Looking for Data Analytics (Eng) / Data Analitika (Ing) - 344 test answers and solutions? Browse our comprehensive collection of verified answers for Data Analytics (Eng) / Data Analitika (Ing) - 344 at stemlearn.sun.ac.za.
Get instant access to accurate answers and detailed explanations for your course questions. Our community-driven platform helps students succeed!
As part of the feature identification phase; you identified written maintenance reports as a potential descriptive feature to include. To include the written maintenance reports in the analytics base table, you created four descriptive features, indicating whether the word was present in the report or not.
ID | noisy | tear | wear | the | Broke down at t + 1 |
1 | 0 | 1 | 1 | 1 | Yes |
2 | 0 | 0 | 0 | 1 | No |
3 | 1 | 1 | 1 | 1 | Yes |
4 | 1 | 1 | 1 | 1 | Yes |
5 | 0 | 0 | 0 | 1 | No |
Which of the columns are irrelevant?
Suppose that we have four descriptive features to train our predictive model. To determine which features to select we decide to (i) train a model on each feature individually and (ii) calculate the Pearson correlation between each feature.
The accuracy obtained for each model where 0.9, 0.85, 0.7 and 0.6 respectively, while the correlations between features were as follows:
| D1 | D2 | D3 | D4 |
D1 | 1 | 0.9 | 0.2 | 0.2 |
D2 |
| 1 | 0.6 | 0.2 |
D3 |
|
| 1 | 0.8 |
D4 |
|
|
| 1 |
Given the information, which features would you select if you can only select two feature?
Suppose that we have four descriptive features to train our predictive model. To determine which features to select we decide to (i) train a model on each feature individually and (ii) calculate the Pearson correlation between each feature.
The accuracy obtained for each model where 0.9, 0.85, 0.7 and 0.6 respectively, while the correlations between features were as follows:
| D1 | D2 | D3 | D4 |
D1 | 1 | 0.9 | 0.2 | 0.2 |
D2 |
| 1 | 0.6 | 0.2 |
D3 |
|
| 1 | 0.8 |
D4 |
|
|
| 1 |
Given the information, which features would you select if you can only select a single feature?
A binary classification model can be used to predict different fault types.
Multi-class classification is used when: