A Data Scientist is developing a binary classifier to predict whether a patient has a particular disease on a series of test results. The Data Scientist has data on 400 patients randomly selected from the population. The disease is seen in 3% of the population. Which cross-validation strategy should the Data Scientist adopt?
A) A k-fold cross-validation strategy with k=5
B) A stratified k-fold cross-validation strategy with k=5
C) A k-fold cross-validation strategy with k=5 and 3 repeats
D) An 80/20 stratified split between training and validation
Correct Answer:
Verified
Q97: A company's Machine Learning Specialist needs to
Q98: A machine learning specialist is running an
Q99: A company wants to classify user behavior
Q100: A data scientist uses an Amazon SageMaker
Q101: A Machine Learning Specialist prepared the following
Q103: The chief editor for a product catalog
Q104: A manufacturer of car engines collects data
Q105: An agricultural company is interested in using
Q106: A large consumer goods manufacturer has the
Q107: A Machine Learning Specialist is applying a
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents