A Data Science team is designing a dataset repository where it will store a large amount of training data commonly used in its machine learning models. As Data Scientists may create an arbitrary number of new datasets every day, the solution has to scale automatically and be cost-effective. Also, it must be possible to explore the data using SQL. Which storage scheme is MOST adapted to this scenario?
A) Store datasets as files in Amazon S3.
B) Store datasets as files in an Amazon EBS volume attached to an Amazon EC2 instance.
C) Store datasets as tables in a multi-node Amazon Redshift cluster.
D) Store datasets as global tables in Amazon DynamoDB.
Correct Answer:
Verified
Q56: A retail chain has been ingesting purchasing
Q57: A Machine Learning Specialist receives customer data
Q58: A Machine Learning Specialist is using an
Q59: A Machine Learning Specialist is configuring Amazon
Q60: An insurance company is developing a new
Q62: A Data Scientist is working on an
Q63: A Machine Learning Specialist must build out
Q64: A Machine Learning Specialist wants to determine
Q65: Given the following confusion matrix for a
Q66: A Machine Learning Specialist wants to bring
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents