Crowdly

Add to Chrome

Universities
stemlearn.sun.ac.za
Data Analytics (Eng) / Data Analitika (Ing) - 344

Data Analytics (Eng) / Data Analitika (Ing) - 344

Looking for Data Analytics (Eng) / Data Analitika (Ing) - 344 test answers and solutions? Browse our comprehensive collection of verified answers for Data Analytics (Eng) / Data Analitika (Ing) - 344 at stemlearn.sun.ac.za.

Get instant access to accurate answers and detailed explanations for your course questions. Our community-driven platform helps students succeed!

The graph provided below displays the distribution of each of the thirteen features. What three features will influence the distance calculation the most?

TOV

OREB

STL

AST

X3PA

FGA

100%

FTA

BLK

DRB

100%

PTS

100%

X3P

View this question

The result of a clustering algorithm can be used to perform classification.

Hierarchical clustering is usually defined as an optimization problem

Consider the dendrogram illustrated below. If a horizontal line is drawn at 3.5 how many clusters will be formed?

4.0						x	x	x	x	x
3.5						x				x
3.0				x	x	x	x			x
2.5				x			x			x
2.0		x	x	x	x		x			x
1.5		x			x		x		x	x	x
1.0	x	x	x		x		x		x		x
0.5	x		x		x		x		x		x
0	x		x		x		x		x		x
	A		B		C		D		E		F

View this question

Consider the proximity matrix provided below.

A B C D

A 0 3 2.2 1

B 3 0 1.41 3.16

C 2.2 1.41 0 2

D 1 3.16 2 0

What instances should be joined when average link is used?

Consider the proximity matrix provided below.

A B C D

A 0 3 2.2 1

B 3 0 1.41 3.16

C 2.2 1.41 1 2

D 1 3.16 2 0

Does the proximity matrix contain any errors?

Given two clusters, Cluster A containing the instances { (1, 2), (3, 4), (5, 6) } and Cluster B containing the instances { (7, 8), (9, 10), (11, 12) }, calculate the single link distance between these two clusters using Manhatten distance.

View this question

Is the following true for X-means clustering? When a cluster is selected to be split, it is best to select that cluster with the largest intra-cluster distance.

True

False

View this question

Given the data set D = {(3, 4), (2, 4), (8, 9)}, will k-medoids or k-means clustering produce a centroid that is least sensitive to outliers?

View this question

Which of the following best describes the steps involved in the k-means++ initialization process?

Randomly select all k centroids from the data points at once.

Choose centroids by minimizing the total distance from all data points to the centroids.

Select all centroids based on the average of all data points.

Select the first centroid randomly, then select subsequent centroids by choosing points that are farthest from the already chosen centroids.

100%

View this question

Want instant access to all verified answers on stemlearn.sun.ac.za?

Get Unlimited Answers To Exam Questions - Install Crowdly Extension Now!

Add to Chrome

Telegram Instagram TikTok Question Bank

Add to Chrome

4.0						x	x	x	x	x
3.5						x				x
3.0				x	x	x	x			x
2.5				x			x			x
2.0		x	x	x	x		x			x
1.5		x			x		x		x	x	x
1.0	x	x	x		x		x		x		x
0.5	x		x		x		x		x		x
0	x		x		x		x		x		x
	A		B		C		D		E		F

4.0						x	x	x	x	x
3.5						x				x
3.0				x	x	x	x			x
2.5				x			x			x
2.0		x	x	x	x		x			x
1.5		x			x		x		x	x	x
1.0	x	x	x		x		x		x		x
0.5	x		x		x		x		x		x
0	x		x		x		x		x		x
	A		B		C		D		E		F

4.0						x	x	x	x	x
3.5						x				x
3.0				x	x	x	x			x
2.5				x			x			x
2.0		x	x	x	x		x			x
1.5		x			x		x		x	x	x
1.0	x	x	x		x		x		x		x
0.5	x		x		x		x		x		x
0	x		x		x		x		x		x
	A		B		C		D		E		F