A retail chain has been ingesting purchasing records from its network of 20,000 stores to Amazon S3 using Amazon Kinesis Data Firehose. To support training an improved machine learning model, training records will require new but simple transformations, and some attributes will be combined. The model needs to be retrained daily. Given the large number of stores and the legacy data ingestion, which change will require the LEAST amount of development effort?
A) Require that the stores to switch to capturing their data locally on AWS Storage Gateway for loading into Amazon S3, then use AWS Glue to do the transformation.
B) Deploy an Amazon EMR cluster running Apache Spark with the transformation logic, and have the cluster run each day on the accumulating records in Amazon S3, outputting new/transformed records to Amazon S3.
C) Spin up a fleet of Amazon EC2 instances with the transformation logic, have them transform the data records accumulating on Amazon S3, and output the transformed records to Amazon S3.
D) Insert an Amazon Kinesis Data Analytics stream downstream of the Kinesis Data Firehose stream that transforms raw record attributes into simple transformed values using SQL.
Correct Answer:
Verified
Q51: A Machine Learning Specialist at a company
Q52: A Machine Learning Specialist is creating a
Q53: A gaming company has launched an online
Q54: A company wants to classify user behavior
Q55: A Machine Learning Specialist is required to
Q57: A Machine Learning Specialist receives customer data
Q58: A Machine Learning Specialist is using an
Q59: A Machine Learning Specialist is configuring Amazon
Q60: An insurance company is developing a new
Q61: A Data Science team is designing a
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents