Deck 17: Data Mining

Full screen (f)
exit full mode
Question
The predicted value from a logistic regression will be:

A) between 0 and 1
B) between -1 and 1
C) less than 0
D) greater than 1
Use Space or
up arrow
down arrow
to flip the card.
Question
Segmentation is also known as clustering, and involves trying to group entities into similar clusters.
Question
Clustering is considered a supervised data mining technique.
Question
Megan is examining the likelihood of people riding the subway. The dependent variable takes on the value of 1 if the individual rides the subway and 0 otherwise. Therefore, she could use logistic regression to examine this question.
Question
A facts table has:

A) few rows and many columns
B) many rows and many columns
C) many rows and few columns
D) few rows and few columns
Question
Melody is a department store manager and wants to examine whether or not female shoppers are more likely than male shoppers to use a department credit card. "Female = 1" indicates the individual is a female. "Credit Card = 1" indicates the individual used a credit card to make the purchase. "Amount spent" is in dollars. Create a pivot table that illustrates something meaningful about the variables in the accompanying table. What does the pivot table indicate about spending habits?
Question
The magnitude of the betas in a logistic regression cannot necessarily be used to determine which variables are the "most important." Explain.
Question
If the regression coefficient estimate from a logistic regression is positive, the probability of the dependent variable taking on a value of 1:

A) decreases
B) approaches zero
C) increases
D) remains constant
Question
Melody is a department store manager and wants to examine whether or not female shoppers are more likely than male shoppers to use a department credit card. "Female = 1" indicates the individual is a female. "Credit Card = 1" indicates the individual used a credit card to make the purchase. "Amount spent" is in dollars. Melody runs a logistic regression. If the estimate on the female variable is positive, what does this indicate about credit card usage?
Question
The higher the "score" for a particular member in logistic regression, the:

A) higher the likelihood that member is in category 1
B) lower the likelihood that member is in category 1
C) higher the likelihood that member is in category 0
D) higher the likelihood that member is not in a category
Question
Data mining is used to examine known, expected patterns and relationships among variables.
Question
The testing set in data partitioning is the:

A) first subset of data, which usually contains 70% of the records
B) second subset of data, which usually contains 30% or less of the records
C) initial dataset from which subsets are created
D) first subset of data, which usually contains 30% of the records
Question
Which methodology is used to group products that customers purchase together?

A) market basket analysis
B) prediction
C) classification analysis
D) forecasting
Question
Suppose the odds of Team A winning are 5 to 1. Then, the odds ratio is:

A) 5/1
B) 1/5
C) 6/1
D) 1/6
Question
A data mart is typically smaller than a data warehouse.
Question
Bridget has partitioned data into two subsets. The original file contains 300,000 observations. The subset she is currently working with has 60,000 observations. Which subset is she most likely to be using?

A) the training set
B) the original set
C) the testing set
D) the prediction set
Question
Melody is a department store manager and wants to examine whether or not female shoppers are more likely than male shoppers to use a department credit card. "Female = 1" indicates the individual is a female. "Credit Card = 1" indicates the individual used a credit card to make the purchase. "Amount spent" is in dollars. Which variables would be the dependent and independent variables in Melody's model?
Question
Mya is investigating the factors that impact soda consumption. She examines a host of variables that help explain the amount consumed. Which type of data mining methodology is she most likely to use?

A) market basket analysis
B) prediction
C) classification analysis
D) forecasting
Question
In a facts table, a supermarket database is likely to have which item listed in rows?

A) the number of units sold
B) the revenue generated from a particular unit
C) the department in which the unit was purchased
D) the individual items purchased
Question
  Create a pivot table that illustrates something meaningful about the variables in the accompanying table.<div style=padding-top: 35px>
Create a pivot table that illustrates something meaningful about the variables in the accompanying table.
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/20
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 17: Data Mining
1
The predicted value from a logistic regression will be:

A) between 0 and 1
B) between -1 and 1
C) less than 0
D) greater than 1
between 0 and 1
2
Segmentation is also known as clustering, and involves trying to group entities into similar clusters.
True
3
Clustering is considered a supervised data mining technique.
False
4
Megan is examining the likelihood of people riding the subway. The dependent variable takes on the value of 1 if the individual rides the subway and 0 otherwise. Therefore, she could use logistic regression to examine this question.
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
5
A facts table has:

A) few rows and many columns
B) many rows and many columns
C) many rows and few columns
D) few rows and few columns
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
6
Melody is a department store manager and wants to examine whether or not female shoppers are more likely than male shoppers to use a department credit card. "Female = 1" indicates the individual is a female. "Credit Card = 1" indicates the individual used a credit card to make the purchase. "Amount spent" is in dollars. Create a pivot table that illustrates something meaningful about the variables in the accompanying table. What does the pivot table indicate about spending habits?
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
7
The magnitude of the betas in a logistic regression cannot necessarily be used to determine which variables are the "most important." Explain.
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
8
If the regression coefficient estimate from a logistic regression is positive, the probability of the dependent variable taking on a value of 1:

A) decreases
B) approaches zero
C) increases
D) remains constant
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
9
Melody is a department store manager and wants to examine whether or not female shoppers are more likely than male shoppers to use a department credit card. "Female = 1" indicates the individual is a female. "Credit Card = 1" indicates the individual used a credit card to make the purchase. "Amount spent" is in dollars. Melody runs a logistic regression. If the estimate on the female variable is positive, what does this indicate about credit card usage?
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
10
The higher the "score" for a particular member in logistic regression, the:

A) higher the likelihood that member is in category 1
B) lower the likelihood that member is in category 1
C) higher the likelihood that member is in category 0
D) higher the likelihood that member is not in a category
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
11
Data mining is used to examine known, expected patterns and relationships among variables.
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
12
The testing set in data partitioning is the:

A) first subset of data, which usually contains 70% of the records
B) second subset of data, which usually contains 30% or less of the records
C) initial dataset from which subsets are created
D) first subset of data, which usually contains 30% of the records
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
13
Which methodology is used to group products that customers purchase together?

A) market basket analysis
B) prediction
C) classification analysis
D) forecasting
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
14
Suppose the odds of Team A winning are 5 to 1. Then, the odds ratio is:

A) 5/1
B) 1/5
C) 6/1
D) 1/6
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
15
A data mart is typically smaller than a data warehouse.
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
16
Bridget has partitioned data into two subsets. The original file contains 300,000 observations. The subset she is currently working with has 60,000 observations. Which subset is she most likely to be using?

A) the training set
B) the original set
C) the testing set
D) the prediction set
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
17
Melody is a department store manager and wants to examine whether or not female shoppers are more likely than male shoppers to use a department credit card. "Female = 1" indicates the individual is a female. "Credit Card = 1" indicates the individual used a credit card to make the purchase. "Amount spent" is in dollars. Which variables would be the dependent and independent variables in Melody's model?
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
18
Mya is investigating the factors that impact soda consumption. She examines a host of variables that help explain the amount consumed. Which type of data mining methodology is she most likely to use?

A) market basket analysis
B) prediction
C) classification analysis
D) forecasting
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
19
In a facts table, a supermarket database is likely to have which item listed in rows?

A) the number of units sold
B) the revenue generated from a particular unit
C) the department in which the unit was purchased
D) the individual items purchased
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
20
  Create a pivot table that illustrates something meaningful about the variables in the accompanying table.
Create a pivot table that illustrates something meaningful about the variables in the accompanying table.
Unlock Deck
Unlock for access to all 20 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 20 flashcards in this deck.