Solved

A Data Science Team Is Designing a Dataset Repository Where

Question 61

Multiple Choice

A Data Science team is designing a dataset repository where it will store a large amount of training data commonly used in its machine learning models. As Data Scientists may create an arbitrary number of new datasets every day, the solution has to scale automatically and be cost-effective. Also, it must be possible to explore the data using SQL. Which storage scheme is MOST adapted to this scenario?


A) Store datasets as files in Amazon S3.
B) Store datasets as files in an Amazon EBS volume attached to an Amazon EC2 instance.
C) Store datasets as tables in a multi-node Amazon Redshift cluster.
D) Store datasets as global tables in Amazon DynamoDB.

Correct Answer:

verifed

Verified

Unlock this answer now
Get Access to more Verified Answers free of charge

Related Questions

Unlock this Answer For Free Now!

View this answer and more for free by performing one of the following actions

qr-code

Scan the QR code to install the App and get 2 free unlocks

upload documents

Unlock quizzes for free by uploading documents