Deck 5: Knowledge Discovery in Databases
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/10
Play
Full screen (f)
Deck 5: Knowledge Discovery in Databases
1

This step of the KDD process model deals with noisy data.
A) Creating a target dataset
B) data preprocessing
C) data transformation
D) data mining
B
2
A data normalization technique for real-valued attributes that divides each numerical value by the same power of 10.
A) min-max normalization
B) z-score normalization
C) decimal scaling
D) decimal smoothing
A) min-max normalization
B) z-score normalization
C) decimal scaling
D) decimal smoothing
C
3

KDD has been described as the application of ___ to data mining.
A) the waterfall model
B) object-oriented programming
C) the scientific method
D) procedural intuition
C
4
The price of a 12 ounce box of cereal decreases from $3.50 to $3.00. What fraction is used to compute the percent decrease in the price of the cereal?
A) 1/3
B) 1/5
C) 1/6
D) 1/7
A) 1/3
B) 1/5
C) 1/6
D) 1/7
Unlock Deck
Unlock for access to all 10 flashcards in this deck.
Unlock Deck
k this deck
5
The relational database model is designed to
A) promote data redundancy.
B) minimize data redundancy.
C) eliminate the need for data transformations.
D) eliminate the need for data preprocessing.
A) promote data redundancy.
B) minimize data redundancy.
C) eliminate the need for data transformations.
D) eliminate the need for data preprocessing.
Unlock Deck
Unlock for access to all 10 flashcards in this deck.
Unlock Deck
k this deck
6

Attibutes may be eliminated from the target dataset during this step of the KDD process.
A) creating a target dataset
B) data preprocessing
C) data transformation
D) data mining
Unlock Deck
Unlock for access to all 10 flashcards in this deck.
Unlock Deck
k this deck
7

The choice of a data mining tool is made at this step of the KDD process.
A) goal identification
B) creating a target dataset
C) data preprocessing
D) data mining
Unlock Deck
Unlock for access to all 10 flashcards in this deck.
Unlock Deck
k this deck
8
This data transformation technique works well when minimum and maximum values for a real-valued attribute are known.
A) min-max normalization
B) decimal scaling
C) z-score normalization
D) logarithmic normalization
A) min-max normalization
B) decimal scaling
C) z-score normalization
D) logarithmic normalization
Unlock Deck
Unlock for access to all 10 flashcards in this deck.
Unlock Deck
k this deck
9
A common method used by some data mining techniques to deal with missing data items during the learning process.
A) replace missing real-valued data items with class means
B) discard records with missing data
C) replace missing attribute values with the values found within other similar instances
D) ignore missing attribute values
A) replace missing real-valued data items with class means
B) discard records with missing data
C) replace missing attribute values with the values found within other similar instances
D) ignore missing attribute values
Unlock Deck
Unlock for access to all 10 flashcards in this deck.
Unlock Deck
k this deck
10
This technique uses mean and standard deviation scores to transform real-valued attributes.
A) decimal scaling
B) min-max normalization
C) z-score normalization
D) logarithmic normalization
A) decimal scaling
B) min-max normalization
C) z-score normalization
D) logarithmic normalization
Unlock Deck
Unlock for access to all 10 flashcards in this deck.
Unlock Deck
k this deck