Deck 5: Exploring Data With Graphs
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/16
Play
Full screen (f)
Deck 5: Exploring Data With Graphs
1
Which of the following would be the best way to decide whether the skew in the example above is problematic for analysts at 'Wipe-It'?
A) See if the z-score is bigger than 1.96 or smaller than -1.96.
B) See if the skew is significant at p < .05.
C) Use the Kolmogorov-Smirnov test.
D) None of the above, because of the large sample size.
A) See if the z-score is bigger than 1.96 or smaller than -1.96.
B) See if the skew is significant at p < .05.
C) Use the Kolmogorov-Smirnov test.
D) None of the above, because of the large sample size.
None of the above, because of the large sample size.
2
The Kolmogorov-Smirnov test can be used to test?
A) Whether data are normally distributed.
B) Whether group variance are equal.
C) Whether scores are measured at the interval level.
D) Whether groups means differ.
A) Whether data are normally distributed.
B) Whether group variance are equal.
C) Whether scores are measured at the interval level.
D) Whether groups means differ.
Whether data are normally distributed.
3
Which of the following are assumptions underlying the use of most parametric tests (based on the normal distribution)?
A) The data should be normally distributed.
B) The samples being tested should have approximately equal variances.
C) The data should be (at least) interval level.
D) All of the above.
A) The data should be normally distributed.
B) The samples being tested should have approximately equal variances.
C) The data should be (at least) interval level.
D) All of the above.
All of the above.
4
Which of the following does a box-whisker plot not display?
A) The range
B) The interquartile range
C) The lower quartile
D) The mean
A) The range
B) The interquartile range
C) The lower quartile
D) The mean
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
5
You need to plot values of one continuous variable against two others, whilst also differentiating groups of cases with different-coloured dots. What type of scatterplot should you use?
A) Simple 3-D scatterplot
B) Grouped 3-D scatterplot
C) Scatterplot matrix
D) Grouped scatterplot
A) Simple 3-D scatterplot
B) Grouped 3-D scatterplot
C) Scatterplot matrix
D) Grouped scatterplot
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
6
Some data were collected about consumer loyalty towards a (relatively) new brand of toilet roll - 'Wipe-It' (using a scale from 1 = I'd never go back to 10 = I will always purchase this brand). The sample was 15,467 people. When looking at the distribution, a skew of 1.23 (SE = 0.65) was identified. The mean rating was 4.78. What is the z-score for the skew in this data?
A) 1.89
B) 0.53
C) -3.92
D) 3.36
A) 1.89
B) 0.53
C) -3.92
D) 3.36
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
7
You are comparing customer lifetime value (CLV) in £s for two of your firm's products - Mug-R and Jug-S. What does acceptance of the assumption of homogeneity mean for the two groups in this context?
A) The variance is twice as big for Mug-R than Jug-S.
B) Variances in the groups are approximately equal.
C) The variance across the groups is proportional to the means of those groups
D) The variance for both Mug-R and Jug-S is equal to the interquartile range
A) The variance is twice as big for Mug-R than Jug-S.
B) Variances in the groups are approximately equal.
C) The variance across the groups is proportional to the means of those groups
D) The variance for both Mug-R and Jug-S is equal to the interquartile range
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
8
If a Kolmogorov-Smirnov test is conducted and the result is significant, what does this mean for the data sample?
A) The data sample is normally distributed.
B) The comparison used in the test is not valid.
C) The data sample is not normally distributed.
D) The test is wrong.
A) The data sample is normally distributed.
B) The comparison used in the test is not valid.
C) The data sample is not normally distributed.
D) The test is wrong.
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
9
What does independence of data mean?
A) That we must never collect two sets of data from one person
B) That independent researchers must collect the data
C) That scores from one participant are free from influences from other participants
D) That scores in one condition are free from influences from other conditions
A) That we must never collect two sets of data from one person
B) That independent researchers must collect the data
C) That scores from one participant are free from influences from other participants
D) That scores in one condition are free from influences from other conditions
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
10
What is an outlier?
A) A set of data outside the data file.
B) A single score (e.g., participant response) that is very different from others.
C) A score derived from a participant who has lied.
D) A variable that cannot be quantified.
A) A set of data outside the data file.
B) A single score (e.g., participant response) that is very different from others.
C) A score derived from a participant who has lied.
D) A variable that cannot be quantified.
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
11
Which of the following tests whether variances are homogenous?
A) Levene's test
B) Bartlett's test
C) Neither a nor b
D) Both a and b
A) Levene's test
B) Bartlett's test
C) Neither a nor b
D) Both a and b
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
12
Which of the following is not defined as a property for a variance ratio?
A) It can be used to demonstrate homogeneity of variances.
B) It is one variance divided by another.
C) It is one variance multiplied by another.
D) It can show the effect of a treatment on several groups.
A) It can be used to demonstrate homogeneity of variances.
B) It is one variance divided by another.
C) It is one variance multiplied by another.
D) It can show the effect of a treatment on several groups.
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
13
You have income data for your entire customer database. Some of your clients are far wealthier than others. Which of the following do outliers least affect?
A) The range
B) The mean
C) The median
D) The standard deviation
A) The range
B) The mean
C) The median
D) The standard deviation
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
14
Why are z-scores used to check for outliers?
A) They standardize scores for a known mean and standard deviation, allowing comparison.
B) They allow you to allocate letters for missing values.
C) A z-score is an outlier.
D) They standardize scores in order to convert them to values closer to the mean.
A) They standardize scores for a known mean and standard deviation, allowing comparison.
B) They allow you to allocate letters for missing values.
C) A z-score is an outlier.
D) They standardize scores in order to convert them to values closer to the mean.
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
15
Which of the following is not a transformation that can be used to correct skewed data?
A) Log transformation
B) Tangent transformation
C) Square root transformation
D) Reciprocal transformation
A) Log transformation
B) Tangent transformation
C) Square root transformation
D) Reciprocal transformation
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck
16
If the distribution of CLV for your two products (Q8) is multimodal, what does this mean?
A) It doesn't have a normal distribution.
B) The data have been entered incorrectly.
C) It will be a normal distribution.
D) It will have to be checked with a Levene's test.
A) It doesn't have a normal distribution.
B) The data have been entered incorrectly.
C) It will be a normal distribution.
D) It will have to be checked with a Levene's test.
Unlock Deck
Unlock for access to all 16 flashcards in this deck.
Unlock Deck
k this deck