A company has collected more than 100 TB of log files in the last 24 months. The files are stored as raw text in a dedicated Amazon S3 bucket. Each object has a key of the form year-month-day_log_HHmmss.txt where HHmmss represents the time the log file was initially created. A table was created in Amazon Athena that points to the S3 bucket. One-time queries are run against a subset of columns in the table several times an hour. A data analyst must make changes to reduce the cost of running these queries. Management wants a solution with minimal maintenance overhead. Which combination of steps should the data analyst take to meet these requirements? (Choose three.)
A) Convert the log files to Apace Avro format.
B) Add a key prefix of the form date=year-month-day/ to the S3 objects to partition the data.
C) Convert the log files to Apache Parquet format.
D) Add a key prefix of the form year-month-day/ to the S3 objects to partition the data.
E) Drop and recreate the table with the PARTITIONED BY clause. Run the ALTER TABLE ADD PARTITION statement.
F) Drop and recreate the table with the PARTITIONED BY clause. Run the MSCK REPAIR TABLE statement.
Correct Answer:
Verified
Q50: An online retail company with millions of
Q51: A company uses Amazon Redshift as its
Q52: A financial company uses Apache Hive on
Q53: A company uses the Amazon Kinesis SDK
Q54: A technology company is creating a dashboard
Q56: A company has an application that ingests
Q57: A company's data analyst needs to ensure
Q58: A large university has adopted a strategic
Q59: A company that monitors weather conditions from
Q60: A university intends to use Amazon Kinesis
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents