Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion service in the cloud. Transmitted data includes a payload of several fields and the timestamp of the transmission. If there are any concerns about a transmission, the system re-transmits the data. How should you deduplicate the data most efficiency?
A) Assign global unique identifiers (GUID) to each data entry.
B) Compute the hash value of each data entry, and compare it with all historical data.
C) Store each data entry as the primary key in a separate database and apply an index.
D) Maintain a database table to store the hash value and other metadata for each data entry.
Correct Answer:
Verified
Q1: MJTelco Case Study Company Overview MJTelco is
Q2: Your company is streaming real-time sensor data
Q3: You are designing a basket abandonment system
Q5: Your company's customer and order databases are
Q6: Your company is in a highly regulated
Q7: You work for a car manufacturer and
Q8: Business owners at your company have given
Q9: You are working on a sensitive project
Q10: Your company is migrating their 30-node Apache
Q11: You have spent a few days loading
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents