Your company receives both batch- and stream-based event data. You want to process the data using Google Cloud Dataflow over a predictable time period. However, you realize that in some instances data can arrive late or out of order. How should you design your Cloud Dataflow pipeline to handle data that is late or out of order?
A) Set a single global window to capture all the data.
B) Set sliding windows to capture all the lagged data.
C) Use watermarks and timestamps to capture the lagged data.
D) Ensure every datasource type (stream or batch) has a timestamp, and use the timestamps to define the logic for lagged data.
Correct Answer:
Verified
Q19: You are building a model to make
Q20: You are creating a model to predict
Q21: Your company is loading comma-separated values (CSV)
Q22: You are designing the database schema for
Q23: An organization maintains a Google BigQuery dataset
Q25: You are deploying a new storage system
Q26: You have some data, which is shown
Q27: An online retailer has built their current
Q28: Your company produces 20,000 files every hour.
Q29: Your analytics team wants to build a
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents