Looking for Data Analytics (Eng) / Data Analitika (Ing) - 344 test answers and solutions? Browse our comprehensive collection of verified answers for Data Analytics (Eng) / Data Analitika (Ing) - 344 at stemlearn.sun.ac.za.
Get instant access to accurate answers and detailed explanations for your course questions. Our community-driven platform helps students succeed!
What is the proportion of deliveries that arrived on time in the given dataset?
Use rpart to build a decision tree based on the dataframe created from the given dataset. Train the model on only the descriptive features, but excluding ID. Set the parameters as follows: complexity parameter should be 0.01, the minimum split size should be 2, the minimum leaf size should be 1, and the maximum tree depth should be 5. How many nodes does the resulting decision tree have?
Consider the SOM shown below which was trained on an RGB data set using a sigma of 10.
To improve the result:
Consider the SOM presented below:
The bottom left corner represents
When using a self-organizing feature map for data imputation, which of the following approaches will provide a more accurate estimate for a missing value?
SOM training is robust to missing values.
Consider the SOM shown below which was trained on an RGB data set using a learning rate of 0.9.
To improve the results:
Consider a 5 by 5 SOM with the weights provided below. Given that the winning neuron is at position (2,2), with a learning rate of 1, a sigma value of 2, and using a Gaussian neighbourhood function, what is the value of h for the instance 4,4? Round your answer to three decimals.
1 (244,179,201) (186,42,57) (252,73,127) (63,137,134) (21,238,33)2 (102,139,71) (27,82,98) (135,55,147) (87,30,55) (175,166,238)3 (133,0,143) (43,88,249) (44,82,61) (146,129,216) (208,23,70)4 (251,175,27) (233,84,244) (35,53,64) (159,169,126) (169,19,212)5 (131,154,160) (64,174,199) (117,171,100) (106,215,221) (125,103,7)
The table below illustrates a self-organising map of size 4 by 4 with 1 weight per processing unit. Given an instance with a value of 2, what column contains the best matching unit?
1 2 3 4
1 1.2 1.4 2.4 3.0
2 1.4 2.2 2.6 3.2
3 1.8 2.7 2.6 3.4
4 2.0 2.8 2.8 3.5
Consider the self-organising map below, which shows the h values calculated using a Gaussian neighbourhood function. Which instance is the best matching unit? Provide your answer in the format x,y, where x represents the row and y represents the column. Do not include any spaces.
1 2 3
1 1.000 0.607 0.135
2 0.607 0.368 0.082
3 0.135 0.082 0.018