Services
Discover
Homeschooling
Ask a Question
Log in
Sign up
Filters
Done
Question type:
Essay
Multiple Choice
Short Answer
True False
Matching
Topic
Computing
Study Set
Decision Support and Business Intelligence
Quiz 7: Text and Web Mining
Path 4
Access For Free
Share
All types
Filters
Study Flashcards
Practice Exam
Learn
Question 41
Short Answer
________ is the grouping of similar documents without having a predefined set of categories.
Question 42
Short Answer
________ mining is the extraction of useful information from data generated through Web page visits and transactions.
Question 43
Short Answer
The ________ model,which is one where multiple sources of data describing the same population are integrated to increase the depth and richness of the resulting analysis,forms the framework of the Web site optimization ecosystem
Question 44
Short Answer
________ is the semi-automated process of extracting patterns from large amounts of unstructured data sources.
Question 45
Short Answer
________ words or noise words are words that are filtered out prior to or after processing of natural language data.
Question 46
Short Answer
In linguistics,a(n)________ is a large and structured set of texts prepared for the purpose of conducting knowledge discovery.
Question 47
Short Answer
At a very high level,the first of three consecutive tasks in the text mining process is to establish the ________,which is a list of organized documents.
Question 48
Short Answer
________ is a technique used to detect favorable and unfavorable opinions toward specific products and services using textual data sources,such as customer feedback in Web postings and the detection of unfavorable rumors.
Question 49
Short Answer
The term "stop-words" are used by text mining to ________ commonly used words.
Question 50
Short Answer
In the text mining process,the output of task two is a flat file called a ________ where the cells are populated with the term frequencies.
Question 51
Short Answer
________ applications focus on "who and how" questions by gathering and reporting direct feedback from site visitors,by benchmarking against other sites and offline channels,and by supporting predictive modeling of future visitor behavior.