A company has several teams, and each team has their own Amazon RDS database that totals 100 TB. The company is building a data query platform for Business Intelligence Analysts to generate a weekly business report. The new system must run ad-hoc SQL queries. What is the MOST cost-effective solution?
A) Create a new Amazon Redshift cluster. Create an AWS Glue ETL job to copy data from the RDS databases to the Amazon Redshift cluster. Use Amazon Redshift to run the query.
B) Create an Amazon EMR cluster with enough core nodes. Run an Apache Spark job to copy data from the RDS databases to a Hadoop Distributed File System (HDFS) . Use a local Apache Hive metastore to maintain the table definition. Use Spark SQL to run the query.
C) Use an AWS Glue ETL job to copy all the RDS databases to a single Amazon Aurora PostgreSQL database. Run SQL queries on the Aurora PostgreSQL database.
D) Use an AWS Glue crawler to crawl all the databases and create tables in the AWS Glue Data Catalog. Use an AWS Glue ETL job to load data from the RDS databases to Amazon S3, and use Amazon Athena to run the queries.
Correct Answer:
Verified
Q727: A company is planning to migrate an
Q728: A solutions architect is implementing infrastructure as
Q729: A solutions architect is designing the data
Q730: An AWS account owner has setup multiple
Q731: A financial company is using a high-performance
Q733: A company runs a popular public-facing ecommerce
Q734: A company has released a new version
Q735: During a security audit of a Service
Q736: A solutions architect needs to migrate 50
Q737: A company has an Amazon VPC that
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents