A company wants to predict the sale prices of houses based on available historical sales data. The target variable in the company's dataset is the sale price. The features include parameters such as the lot size, living area measurements, non-living area measurements, number of bedrooms, number of bathrooms, year built, and postal code. The company wants to use multi-variable linear regression to predict house sale prices. Which step should a machine learning specialist take to remove features that are irrelevant for the analysis and reduce the model's complexity?
A) Plot a histogram of the features and compute their standard deviation. Remove features with high variance.
B) Plot a histogram of the features and compute their standard deviation. Remove features with low variance.
C) Build a heatmap showing the correlation of the dataset against itself. Remove features with low mutual correlation scores.
D) Run a correlation check of all features against the target variable. Remove features with low target variable correlation scores.
Correct Answer:
Verified
Q81: A data scientist has developed a machine
Q82: A data scientist needs to identify fraudulent
Q83: A logistics company needs a forecast model
Q84: A machine learning (ML) specialist wants to
Q85: A Machine Learning Specialist has created a
Q87: A large company has developed a BI
Q88: Machine Learning Specialist is working with a
Q89: A company that promotes healthy sleep patterns
Q90: A company is using Amazon Textract to
Q91: This graph shows the training and validation
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents