Deck 20: Cluster Analysis

Full screen (f)
exit full mode
Question
Choice of a clustering method and choice of a distance measure are interrelated.
Use Space or
up arrow
down arrow
to flip the card.
Question
Clustering should be done on samples of at least 300 or more.
Question
The primary objective of cluster analysis is to classify objects into relatively homogeneous groups.
Question
Nonhierarchical clustering is faster than hierarchical methods.
Question
In the TwoStep procedure,the euclidean measure can be used only when all of the variables are ordinal.
Question
In cluster analysis,the set of variables selected should describe the similarity between objects in terms that are relevant to the marketing research problem.
Question
The centroid method is a variance method of hierarchical clustering in which the distance between two clusters is the distance between their centroids (means for all the variables).
Question
The parallel threshold method differs from the other two non-hierarchical clustering procedures in that the objects can later be reassigned to clusters to optimize an overall criterion.
Question
Cluster analysis does not classify variables as dependent or independent.
Question
Most clustering methods are relatively complex procedures that are supported by an extensive body of statistical reasoning.
Question
The complete linkage method of hierarchical clustering is based on the minimum distance or the nearest neighbor approach.
Question
Use of different distance measures may lead to different clustering results.Hence,it is advisable to use different measures and compare the results.
Question
The dendrogram is read from right to left.
Question
Cluster analysis is the obverse of factor analysis in that it reduces the number of objects,not the number of variables,by grouping them into a much smaller number of clusters.
Question
The TwoStep cluster analysis procedure can automatically determine the optimal number of clusters by comparing the values of a model-choice criteria across different clustering solutions.
Question
Cluster analysis requires prior knowledge of the cluster or group membership for each object or case included to develop the classification rule.
Question
In cluster analysis,objects with larger distances between them are more similar to each other than are those at smaller distances.
Question
Measuring similarity in terms of distance between pairs of objects is the most common approach used in cluster analysis for grouping similar objects together.
Question
If cluster analysis is used as a general data reduction tool,subsequent multivariate analysis can be conducted on the clusters rather than on the individual observations.
Question
The average linkage method of hierarchical clustering is preferred to the single and complete linkage methods.
Question
In hierarchical clustering,the solution may depend on the order of cases in the data set.
Question
Formal procedures for assessing the reliability and validity of clustering are simple and should be undertaken.
Question
The ________ are the initial starting points in nonhierarchical clustering.

A) factor loadings
B) factor scores
C) cluster centers
D) cluster centroids
Question
The most important part of ________ is selecting the variables on which clustering is based.

A) interpreting and profiling clusters
B) assessing the validity of clustering
C) selecting a clustering procedure
D) formulating the clustering problem
Question
One method of assessing reliability and validity of clustering is to use different methods of clustering and compare the results.
Question
It is helpful to profile the clusters in terms of variables that were not used for clustering.
Question
To reduce the number of variables,a large set of variables can often be replaced by the set of cluster components.
Question
It is possible to obtain information on cluster membership of cases via the icicle plot if the number of clusters is specified.
Question
Which statement is not true about cluster analysis?

A) Objects in each cluster tend to be similar to each other and dissimilar to objects in the other clusters.
B) Groups or clusters are suggested by the data,not defined a priori.
C) Cluster analysis is also called classification analysis or numerical taxonomy.
D) Cluster analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the independent variables are interval in nature.
Question
Cluster analysis has been used in marketing for all of the purposes below except ________.

A) determining how strongly sales are related to advertising expenditures
B) identifying new product opportunities by clustering brands and products so that competitive sets within the market can be determined
C) segmenting the market based on benefits sought from the purchase of a product
D) selecting test markets
Question
Clustering should be done on samples of ________ or more.

A) 50
B) 100
C) 200
D) 300
Question
Principal components are usually easier to interpret than the cluster components.
Question
In non-hierarchical clustering,the F test is only descriptive.Because the cases or objects are systematically assigned to clusters to maximize differences on the clustering variables,the resulting probabilities should not be interpreted as testing the null hypothesis of no differences among clusters.
Question
A ________ is a lower-triangle matrix containing pairwise distances between objects or cases.

A) similarity/distance coefficient matrix
B) classification matrix
C) factor matrix
D) correlation matrix
Question
Most ________ methods are heuristics based on algorithms.

A) analysis of variance
B) discriminant analysis
C) factor analysis
D) clustering
Question
The centroids represent the mean values of the objects contained in the cluster on each of the variables.
Question
Which method of analysis does not classify variables as dependent or independent?

A) cluster analysis
B) regression analysis
C) analysis of variance
D) discriminant analysis
Question
When cluster analysis is also used for clustering variables to identify homogeneous groups,the units used for analysis are the variables and the distance measures are computed for all pairs of variables.
Question
________ is a class of techniques used to classify objects or cases into relatively homogeneous groups.

A) Common factor analysis
B) Cluster analysis
C) Principal components analysis
D) Conjoint analysis
Question
A(n)________ or tree graph is a graphical device for displaying clustering results.Vertical lines represent clusters that are joined together.The position of the line on the scale indicates the distances at which clusters were joined.

A) scattergram
B) scree plot
C) icicle diagram
D) dendrogram
Question
________ is a clustering procedure characterized by the development of a tree-like structure.

A) Optimizing partitioning clustering
B) Non-hierarchical clustering
C) Hierarchical clustering
D) TwoStep clustering
Question
________ is a clustering procedure where each object starts out in a separate cluster.

A) Non-hierarchical clustering
B) Divisive clustering
C) Agglomerative clustering
D) Hierarchical clustering
Question
________ is a procedure that first assigns or determines a cluster center and then groups all objects within a pre-specified threshold value from the center.

A) Divisive clustering
B) Non-hierarchical clustering
C) Ward's method
D) Agglomerative clustering
Question
The ________ is a nonhierarchical method in which a cluster center is selected and all objects within a pre-specified threshold value from the center are grouped together.

A) Ward's procedure
B) optimizing partitioning method
C) sequential threshold method
D) parallel threshold method
Question
The ________ is a nonhierarchical method that allows for later reassignment of objects to clusters to optimize an overall criterion.

A) Ward's procedure
B) optimizing partitioning method
C) parallel threshold method
D) sequential threshold method
Question
________ is a variance method in which the squared euclidean distance to the cluster means is minimized.

A) Optimizing partitioning method
B) Parallel threshold method
C) Ward's procedure
D) Sequential threshold method
Question
The ________ method is based on minimum distance or the nearest neighbor rule.

A) medium linkage
B) complete linkage
C) average linkage
D) single linkage
Question
________ is a clustering procedure where all objects start out in one giant cluster.Clusters are formed by dividing this cluster into smaller and smaller clusters.

A) Divisive clustering
B) Agglomerative clustering
C) Hierarchical clustering
D) Non-hierarchical clustering
Question
Which of the following is not a disadvantage of nonhierarchical clustering procedures?

A) The number of clusters must be pre-specified.
B) The procedures do not work well when the clusters are poorly defined.
C) The selection of cluster centers is arbitrary.
D) All of the above are disadvantages.
Question
The most commonly used measure of similarity is the ________ or its square.

A) Chebychev's distance
B) Manhattan distance
C) euclidean distance
D) city-block distance
Question
Which cluster analysis procedure can automatically determine the optimal number of clusters by comparing the values of a model-choice across different clustering solutions?

A) Ward's method
B) divisive
C) TwoStep
D) sequential threshold
Question
________ is frequently referred to as k-means clustering.

A) Non-hierarchical clustering
B) Ward's method
C) Agglomerative clustering
D) Divisive clustering
Question
The ________ is a nonhierarchical method that specifies several cluster centers at once.All objects within a pre-specified threshold value from the center are grouped together.

A) optimizing partitioning method
B) Ward's procedure
C) parallel threshold method
D) sequential threshold method
Question
The ________ method uses information on all pairs of distances,not merely the minimum or maximum distances.

A) single linkage
B) average linkage
C) complete linkage
D) medium linkage
Question
Which of the following is a variance method of clustering?

A) optimizing partitioning
B) sequential threshold
C) complete linkage
D) Ward's method
Question
The ________ method is based on the maximum distance or the furthest neighbor approach.

A) medium linkage
B) single linkage
C) complete linkage
D) average linkage
Question
________ methods are commonly used in marketing research.

A) Agglomerative clustering
B) TwoStep clustering
C) Divisive clustering
D) Optimizing partitioning
Question
________ are agglomerative methods of hierarchical clustering in which clusters are generated to minimize the within-cluster variance.

A) Centroid methods
B) Variance methods
C) Parallel methods
D) Linkage methods
Question
Which of the methods below is not a hierarchical method?

A) parallel threshold
B) optimizing partitioning
C) both A and B
D) variance
Question
Which statement is not true concerning the clustering solution if the variables are measured in vastly different units?

A) We must standardize the data by rescaling each variable to have a mean of zero and standard deviation of unity.
B) Standardization can reduce the differences between groups on variables that may best discriminate groups or clusters.
C) It is desirable to eliminate outliers.
D) The clustering solution will not be influenced by the units of measurement.
Question
Which method allows the researcher to obtain information on cluster membership of cases if the number of clusters is specified?

A) cluster centers
B) scree plot
C) icicle plot
D) both A and C
Question
What suggested guidelines researchers can use when deciding on the number of clusters?
Question
If you are performing cluster analysis on the same data using different distance measures and then comparing the results across measures to determine stability of the solutions,you are at which stage of the cluster analysis process?

A) interpreting and profiling the clusters
B) deciding on the number of clusters
C) assessing reliability and validity
D) selecting a clustering procedure
Question
________ involves examining the cluster centroids.

A) Interpreting and profiling the clusters
B) Deciding on the number of clusters
C) Assessing reliability and validity
D) Selecting a clustering procedure
Question
To use cluster analysis for clustering variables to identify homogeneous groups,the researcher could do all of the following except ________.

A) inserting communalities in the diagonal of the correlation matrix
B) using the correlation coefficient as a measure of similarity between variables
C) using the variables as the units of analysis
D) A and B
Question
Which of the following is not a procedure to check the quality of clustering results?

A) Use the same method of clustering and compare the results.
B) Split the data randomly into halves.Perform clustering separately on each half.Compare cluster centroids across the two subsamples.
C) Perform cluster analysis on the same data using different distance measures.Compare the results across measures to determine the stability of the solutions.
D) Delete variables randomly.Perform clustering based on the reduced set of variables.Compare the results with those obtained by clustering based on the entire set of variables.
Question
What are the steps in conducting cluster analysis (Figure 20.3 in the text)?
Question
In SAS,the ________ program can be used for the hierarchical clustering of objects or cases.

A) CLUSTER ANALYSIS
B) FASTCLUS
C) HIERARCHICAL CLUSTER
D) VARCLUS
Question
In SPSS,the main program for hierarchical clustering of objects or cases is ________.

A) CLUSTER ANALYSIS
B) FASTCLUS
C) HIERARCHICAL CLUSTER
D) VARCLUS
Question
In non-hierarchical clustering,plotting the ratio of total within-group variance to between-group variance against the number of clusters is useful if you are ________.

A) interpreting and profiling the clusters
B) deciding on the number of clusters
C) assessing the validity of clustering
D) both B and C
Question
Which is best to use when selecting a clustering procedure: hierarchical or nonhierarchical clustering?
Question
Which method allows the researcher to obtain information on cluster membership of cases if the number of clusters is specified?

A) factor loading plot
B) scree plot
C) scattergram
D) icicle plot
Question
Why should the clustering of variables be used?
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/73
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 20: Cluster Analysis
1
Choice of a clustering method and choice of a distance measure are interrelated.
True
2
Clustering should be done on samples of at least 300 or more.
False
3
The primary objective of cluster analysis is to classify objects into relatively homogeneous groups.
True
4
Nonhierarchical clustering is faster than hierarchical methods.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
5
In the TwoStep procedure,the euclidean measure can be used only when all of the variables are ordinal.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
6
In cluster analysis,the set of variables selected should describe the similarity between objects in terms that are relevant to the marketing research problem.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
7
The centroid method is a variance method of hierarchical clustering in which the distance between two clusters is the distance between their centroids (means for all the variables).
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
8
The parallel threshold method differs from the other two non-hierarchical clustering procedures in that the objects can later be reassigned to clusters to optimize an overall criterion.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
9
Cluster analysis does not classify variables as dependent or independent.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
10
Most clustering methods are relatively complex procedures that are supported by an extensive body of statistical reasoning.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
11
The complete linkage method of hierarchical clustering is based on the minimum distance or the nearest neighbor approach.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
12
Use of different distance measures may lead to different clustering results.Hence,it is advisable to use different measures and compare the results.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
13
The dendrogram is read from right to left.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
14
Cluster analysis is the obverse of factor analysis in that it reduces the number of objects,not the number of variables,by grouping them into a much smaller number of clusters.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
15
The TwoStep cluster analysis procedure can automatically determine the optimal number of clusters by comparing the values of a model-choice criteria across different clustering solutions.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
16
Cluster analysis requires prior knowledge of the cluster or group membership for each object or case included to develop the classification rule.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
17
In cluster analysis,objects with larger distances between them are more similar to each other than are those at smaller distances.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
18
Measuring similarity in terms of distance between pairs of objects is the most common approach used in cluster analysis for grouping similar objects together.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
19
If cluster analysis is used as a general data reduction tool,subsequent multivariate analysis can be conducted on the clusters rather than on the individual observations.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
20
The average linkage method of hierarchical clustering is preferred to the single and complete linkage methods.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
21
In hierarchical clustering,the solution may depend on the order of cases in the data set.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
22
Formal procedures for assessing the reliability and validity of clustering are simple and should be undertaken.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
23
The ________ are the initial starting points in nonhierarchical clustering.

A) factor loadings
B) factor scores
C) cluster centers
D) cluster centroids
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
24
The most important part of ________ is selecting the variables on which clustering is based.

A) interpreting and profiling clusters
B) assessing the validity of clustering
C) selecting a clustering procedure
D) formulating the clustering problem
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
25
One method of assessing reliability and validity of clustering is to use different methods of clustering and compare the results.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
26
It is helpful to profile the clusters in terms of variables that were not used for clustering.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
27
To reduce the number of variables,a large set of variables can often be replaced by the set of cluster components.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
28
It is possible to obtain information on cluster membership of cases via the icicle plot if the number of clusters is specified.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
29
Which statement is not true about cluster analysis?

A) Objects in each cluster tend to be similar to each other and dissimilar to objects in the other clusters.
B) Groups or clusters are suggested by the data,not defined a priori.
C) Cluster analysis is also called classification analysis or numerical taxonomy.
D) Cluster analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the independent variables are interval in nature.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
30
Cluster analysis has been used in marketing for all of the purposes below except ________.

A) determining how strongly sales are related to advertising expenditures
B) identifying new product opportunities by clustering brands and products so that competitive sets within the market can be determined
C) segmenting the market based on benefits sought from the purchase of a product
D) selecting test markets
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
31
Clustering should be done on samples of ________ or more.

A) 50
B) 100
C) 200
D) 300
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
32
Principal components are usually easier to interpret than the cluster components.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
33
In non-hierarchical clustering,the F test is only descriptive.Because the cases or objects are systematically assigned to clusters to maximize differences on the clustering variables,the resulting probabilities should not be interpreted as testing the null hypothesis of no differences among clusters.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
34
A ________ is a lower-triangle matrix containing pairwise distances between objects or cases.

A) similarity/distance coefficient matrix
B) classification matrix
C) factor matrix
D) correlation matrix
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
35
Most ________ methods are heuristics based on algorithms.

A) analysis of variance
B) discriminant analysis
C) factor analysis
D) clustering
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
36
The centroids represent the mean values of the objects contained in the cluster on each of the variables.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
37
Which method of analysis does not classify variables as dependent or independent?

A) cluster analysis
B) regression analysis
C) analysis of variance
D) discriminant analysis
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
38
When cluster analysis is also used for clustering variables to identify homogeneous groups,the units used for analysis are the variables and the distance measures are computed for all pairs of variables.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
39
________ is a class of techniques used to classify objects or cases into relatively homogeneous groups.

A) Common factor analysis
B) Cluster analysis
C) Principal components analysis
D) Conjoint analysis
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
40
A(n)________ or tree graph is a graphical device for displaying clustering results.Vertical lines represent clusters that are joined together.The position of the line on the scale indicates the distances at which clusters were joined.

A) scattergram
B) scree plot
C) icicle diagram
D) dendrogram
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
41
________ is a clustering procedure characterized by the development of a tree-like structure.

A) Optimizing partitioning clustering
B) Non-hierarchical clustering
C) Hierarchical clustering
D) TwoStep clustering
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
42
________ is a clustering procedure where each object starts out in a separate cluster.

A) Non-hierarchical clustering
B) Divisive clustering
C) Agglomerative clustering
D) Hierarchical clustering
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
43
________ is a procedure that first assigns or determines a cluster center and then groups all objects within a pre-specified threshold value from the center.

A) Divisive clustering
B) Non-hierarchical clustering
C) Ward's method
D) Agglomerative clustering
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
44
The ________ is a nonhierarchical method in which a cluster center is selected and all objects within a pre-specified threshold value from the center are grouped together.

A) Ward's procedure
B) optimizing partitioning method
C) sequential threshold method
D) parallel threshold method
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
45
The ________ is a nonhierarchical method that allows for later reassignment of objects to clusters to optimize an overall criterion.

A) Ward's procedure
B) optimizing partitioning method
C) parallel threshold method
D) sequential threshold method
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
46
________ is a variance method in which the squared euclidean distance to the cluster means is minimized.

A) Optimizing partitioning method
B) Parallel threshold method
C) Ward's procedure
D) Sequential threshold method
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
47
The ________ method is based on minimum distance or the nearest neighbor rule.

A) medium linkage
B) complete linkage
C) average linkage
D) single linkage
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
48
________ is a clustering procedure where all objects start out in one giant cluster.Clusters are formed by dividing this cluster into smaller and smaller clusters.

A) Divisive clustering
B) Agglomerative clustering
C) Hierarchical clustering
D) Non-hierarchical clustering
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
49
Which of the following is not a disadvantage of nonhierarchical clustering procedures?

A) The number of clusters must be pre-specified.
B) The procedures do not work well when the clusters are poorly defined.
C) The selection of cluster centers is arbitrary.
D) All of the above are disadvantages.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
50
The most commonly used measure of similarity is the ________ or its square.

A) Chebychev's distance
B) Manhattan distance
C) euclidean distance
D) city-block distance
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
51
Which cluster analysis procedure can automatically determine the optimal number of clusters by comparing the values of a model-choice across different clustering solutions?

A) Ward's method
B) divisive
C) TwoStep
D) sequential threshold
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
52
________ is frequently referred to as k-means clustering.

A) Non-hierarchical clustering
B) Ward's method
C) Agglomerative clustering
D) Divisive clustering
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
53
The ________ is a nonhierarchical method that specifies several cluster centers at once.All objects within a pre-specified threshold value from the center are grouped together.

A) optimizing partitioning method
B) Ward's procedure
C) parallel threshold method
D) sequential threshold method
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
54
The ________ method uses information on all pairs of distances,not merely the minimum or maximum distances.

A) single linkage
B) average linkage
C) complete linkage
D) medium linkage
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
55
Which of the following is a variance method of clustering?

A) optimizing partitioning
B) sequential threshold
C) complete linkage
D) Ward's method
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
56
The ________ method is based on the maximum distance or the furthest neighbor approach.

A) medium linkage
B) single linkage
C) complete linkage
D) average linkage
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
57
________ methods are commonly used in marketing research.

A) Agglomerative clustering
B) TwoStep clustering
C) Divisive clustering
D) Optimizing partitioning
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
58
________ are agglomerative methods of hierarchical clustering in which clusters are generated to minimize the within-cluster variance.

A) Centroid methods
B) Variance methods
C) Parallel methods
D) Linkage methods
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
59
Which of the methods below is not a hierarchical method?

A) parallel threshold
B) optimizing partitioning
C) both A and B
D) variance
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
60
Which statement is not true concerning the clustering solution if the variables are measured in vastly different units?

A) We must standardize the data by rescaling each variable to have a mean of zero and standard deviation of unity.
B) Standardization can reduce the differences between groups on variables that may best discriminate groups or clusters.
C) It is desirable to eliminate outliers.
D) The clustering solution will not be influenced by the units of measurement.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
61
Which method allows the researcher to obtain information on cluster membership of cases if the number of clusters is specified?

A) cluster centers
B) scree plot
C) icicle plot
D) both A and C
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
62
What suggested guidelines researchers can use when deciding on the number of clusters?
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
63
If you are performing cluster analysis on the same data using different distance measures and then comparing the results across measures to determine stability of the solutions,you are at which stage of the cluster analysis process?

A) interpreting and profiling the clusters
B) deciding on the number of clusters
C) assessing reliability and validity
D) selecting a clustering procedure
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
64
________ involves examining the cluster centroids.

A) Interpreting and profiling the clusters
B) Deciding on the number of clusters
C) Assessing reliability and validity
D) Selecting a clustering procedure
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
65
To use cluster analysis for clustering variables to identify homogeneous groups,the researcher could do all of the following except ________.

A) inserting communalities in the diagonal of the correlation matrix
B) using the correlation coefficient as a measure of similarity between variables
C) using the variables as the units of analysis
D) A and B
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
66
Which of the following is not a procedure to check the quality of clustering results?

A) Use the same method of clustering and compare the results.
B) Split the data randomly into halves.Perform clustering separately on each half.Compare cluster centroids across the two subsamples.
C) Perform cluster analysis on the same data using different distance measures.Compare the results across measures to determine the stability of the solutions.
D) Delete variables randomly.Perform clustering based on the reduced set of variables.Compare the results with those obtained by clustering based on the entire set of variables.
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
67
What are the steps in conducting cluster analysis (Figure 20.3 in the text)?
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
68
In SAS,the ________ program can be used for the hierarchical clustering of objects or cases.

A) CLUSTER ANALYSIS
B) FASTCLUS
C) HIERARCHICAL CLUSTER
D) VARCLUS
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
69
In SPSS,the main program for hierarchical clustering of objects or cases is ________.

A) CLUSTER ANALYSIS
B) FASTCLUS
C) HIERARCHICAL CLUSTER
D) VARCLUS
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
70
In non-hierarchical clustering,plotting the ratio of total within-group variance to between-group variance against the number of clusters is useful if you are ________.

A) interpreting and profiling the clusters
B) deciding on the number of clusters
C) assessing the validity of clustering
D) both B and C
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
71
Which is best to use when selecting a clustering procedure: hierarchical or nonhierarchical clustering?
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
72
Which method allows the researcher to obtain information on cluster membership of cases if the number of clusters is specified?

A) factor loading plot
B) scree plot
C) scattergram
D) icicle plot
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
73
Why should the clustering of variables be used?
Unlock Deck
Unlock for access to all 73 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 73 flashcards in this deck.