Deck 1: Data Mining: a First View

Full screen (f)
exit full mode
Question
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Develop a profile for credit card customers likely to carry an average monthly balance of more than $1000.00.
Use Space or
up arrow
down arrow
to flip the card.
Question
Data used to build a data mining model.

A) validation data
B) training data
C) test data
D) hidden data
Question
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Do meaningful attribute relationships exist in a database containing information about credit card customers?
Question
Supervised learning and unsupervised clustering both require at least one

A) hidden attribute.
B) output attribute.
C) input attribute.
D) categorical attribute.
Question
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
What is the average weekly salary of all female employees under forty years of age?
Question
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
What attribute similarities group customers holding one or several insurance policies?
Question
Database query is used to uncover this type of knowledge.

A) deep
B) hidden
C) shallow
D) multidimensional
Question
If a customer is spending more than expected, the customer's intrinsic value is ________ their actual value.

A) greater than
B) less than
C) less than or equal to
D) equal to Matching Questions
Question
A person trained to interact with a human expert in order to capture their knowledge.

A) knowledge programmer
B) knowledge developer
C) knowledge engineer
D) knowledge extractor
Question
The process of forming general concept definitions from examples of concepts to be learned.

A) deduction
B) abduction
C) induction
D) conjunction
Question
Computers are best at learning

A) facts.
B) Concepts.
C) Procedures.
D) Principles.
Question
Supervised learning differs from unsupervised clustering in that supervised learning requires

A) at least one input attribute.
B) input attriutes to be categorical.
C) at least one output attribute.
D) ouput attriubutes to be categorical.
Question
A structure designed to store data for decision support.

A) operational database
B) flat file
C) decision tree
D) data warehouse
Question
Like the probabilistic view, the ________ view allows us to associate a probability of membership with each classification.

A) exemplar
B) deductive
C) classical
D) inductive
Question
Which of the following is a valid production rule for the decision tree below? <strong>Which of the following is a valid production rule for the decision tree below?  </strong> A) IF Business Appointment = No & Temp above 70 = No THEN Decision = wear slacks B) IF Business Appointment = Yes & Temp above 70 = Yes THEN Decision = wear shorts C) IF Temp above 70 = No THEN Decision = wear shorts D) IF Business Appointment= No & Temp above 70 = No THEN Decision = wear jeans <div style=padding-top: 35px>

A) IF Business Appointment = No & Temp above 70 = No THEN Decision = wear slacks
B) IF Business Appointment = Yes & Temp above 70 = Yes THEN Decision = wear shorts
C) IF Temp above 70 = No THEN Decision = wear shorts
D) IF Business Appointment= No & Temp above 70 = No THEN Decision = wear jeans
Question
A statement to be tested.

A) theory
B) procedure
C) principle
D) hypothesis
Question
Data mining is best described as the process of

A) identifying patterns in data.
B) deducing relationships in data.
C) representing data.
D) simulating trends in data.
Question
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Determine the characteristics of a successful used car salesperson.
Question
A nearest neighbor approach is best used

A) with large-sized datasets.
B) when irrelevant attributes have been removed from the data.
C) when a generalized model of the data is desireable.
D) when an explanation of what has been found is of primary importance.
Question
Which of the following is not a characteristic of a data warehouse?

A) contains historical data
B) designed for decision support
C) stores data in normalized tables
D) promotes data redundancy
Question
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Do single men play more golf than married men?
Question
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Determine whether a credit card transaction is valid or fraudulent.
Answers to Chapter 1 Questions
Multiple Choice Questions
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/22
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 1: Data Mining: a First View
1
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Develop a profile for credit card customers likely to carry an average monthly balance of more than $1000.00.
a
2
Data used to build a data mining model.

A) validation data
B) training data
C) test data
D) hidden data
B
3
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Do meaningful attribute relationships exist in a database containing information about credit card customers?
b
4
Supervised learning and unsupervised clustering both require at least one

A) hidden attribute.
B) output attribute.
C) input attribute.
D) categorical attribute.
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
5
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
What is the average weekly salary of all female employees under forty years of age?
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
6
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
What attribute similarities group customers holding one or several insurance policies?
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
7
Database query is used to uncover this type of knowledge.

A) deep
B) hidden
C) shallow
D) multidimensional
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
8
If a customer is spending more than expected, the customer's intrinsic value is ________ their actual value.

A) greater than
B) less than
C) less than or equal to
D) equal to Matching Questions
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
9
A person trained to interact with a human expert in order to capture their knowledge.

A) knowledge programmer
B) knowledge developer
C) knowledge engineer
D) knowledge extractor
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
10
The process of forming general concept definitions from examples of concepts to be learned.

A) deduction
B) abduction
C) induction
D) conjunction
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
11
Computers are best at learning

A) facts.
B) Concepts.
C) Procedures.
D) Principles.
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
12
Supervised learning differs from unsupervised clustering in that supervised learning requires

A) at least one input attribute.
B) input attriutes to be categorical.
C) at least one output attribute.
D) ouput attriubutes to be categorical.
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
13
A structure designed to store data for decision support.

A) operational database
B) flat file
C) decision tree
D) data warehouse
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
14
Like the probabilistic view, the ________ view allows us to associate a probability of membership with each classification.

A) exemplar
B) deductive
C) classical
D) inductive
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
15
Which of the following is a valid production rule for the decision tree below? <strong>Which of the following is a valid production rule for the decision tree below?  </strong> A) IF Business Appointment = No & Temp above 70 = No THEN Decision = wear slacks B) IF Business Appointment = Yes & Temp above 70 = Yes THEN Decision = wear shorts C) IF Temp above 70 = No THEN Decision = wear shorts D) IF Business Appointment= No & Temp above 70 = No THEN Decision = wear jeans

A) IF Business Appointment = No & Temp above 70 = No THEN Decision = wear slacks
B) IF Business Appointment = Yes & Temp above 70 = Yes THEN Decision = wear shorts
C) IF Temp above 70 = No THEN Decision = wear shorts
D) IF Business Appointment= No & Temp above 70 = No THEN Decision = wear jeans
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
16
A statement to be tested.

A) theory
B) procedure
C) principle
D) hypothesis
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
17
Data mining is best described as the process of

A) identifying patterns in data.
B) deducing relationships in data.
C) representing data.
D) simulating trends in data.
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
18
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Determine the characteristics of a successful used car salesperson.
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
19
A nearest neighbor approach is best used

A) with large-sized datasets.
B) when irrelevant attributes have been removed from the data.
C) when a generalized model of the data is desireable.
D) when an explanation of what has been found is of primary importance.
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
20
Which of the following is not a characteristic of a data warehouse?

A) contains historical data
B) designed for decision support
C) stores data in normalized tables
D) promotes data redundancy
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
21
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Do single men play more golf than married men?
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
22
Determine which is the best approach for each problem.
a. supervised learning
b. unsupervised clustering
c. data query
Determine whether a credit card transaction is valid or fraudulent.
Answers to Chapter 1 Questions
Multiple Choice Questions
Unlock Deck
Unlock for access to all 22 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 22 flashcards in this deck.