To process input key-value pairs, your mapper needs to lead a 512 MB data file in memory. What is the best way to accomplish this?
A) Serialize the data file, insert in it the JobConf object, and read the data into memory in the configure method of the mapper.
B) Place the data file in the DistributedCache and read the data into memory in the map method of the mapper.
C) Place the data file in the DataCache and read the data into memory in the configure method of the mapper.
D) Place the data file in the DistributedCache and read the data into memory in the configure method of the mapper.
Correct Answer:
Verified
Q7: Which best describes how TextInputFormat processes input
Q8: On a cluster running MapReduce v1 (MRv1),
Q9: Which process describes the lifecycle of a
Q10: The Hadoop framework provides a mechanism for
Q11: What is the disadvantage of using multiple
Q13: You have the following key-value pairs as
Q14: You need to perform statistical analysis in
Q15: Identify the MapReduce v2 (MRv2 / YARN)
Q16: You have written a Mapper which invokes
Q17: All keys used for intermediate output from
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents