A large company receives files from external parties in Amazon EC2 throughout the day. At the end of the day, the files are combined into a single file, compressed into a gzip file, and uploaded to Amazon S3. The total size of all the files is close to 100 GB daily. Once the files are uploaded to Amazon S3, an AWS Batch program executes a COPY command to load the files into an Amazon Redshift cluster. Which program modification will accelerate the COPY process?
A) Upload the individual files to Amazon S3 and run the COPY command as soon as the files become available.
B) Split the number of files so they are equal to a multiple of the number of slices in the Amazon Redshift cluster. Gzip and upload the files to Amazon S3. Run the COPY command on the files.
C) Split the number of files so they are equal to a multiple of the number of compute nodes in the Amazon Redshift cluster. Gzip and upload the files to Amazon S3. Run the COPY command on the files.
D) Apply sharding by breaking up the files so the distkey columns with the same values go to the same file. Gzip and upload the sharded files to Amazon S3. Run the COPY command on the files.
Correct Answer:
Verified
Q3: A data analyst is designing a solution
Q4: A global company has different sub-organizations, and
Q5: An airline has .csv-formatted data stored in
Q6: A banking company is currently using an
Q7: A company analyzes its data in an
Q9: An Amazon Redshift database contains sensitive user
Q10: A company developed a new elections reporting
Q11: A data analyst is using Amazon QuickSight
Q12: A mobile gaming company wants to capture
Q13: A team of data scientists plans to
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents