Introduction to MapReduce

Ace your homework & exams now with Quizwiz!

Although the Hadoop framework is implemented in Java , MapReduce applications need not be written in: a) Java b) C c) C# d) None of the mentioned

a) Java Hadoop Pipes is a SWIG- compatible C++ API to implement MapReduce applications (non JNITM based).

Point out the correct statement : a) MapReduce tries to place the data and the compute as close as possible b) Map Task in MapReduce is performed using the Mapper() function c) Reduce Task in MapReduce is performed using the Map() function d) All of the mentioned

a) MapReduce tries to place the data and the compute as close as possible This feature of MapReduce is "Data Locality".

__________ maps input key/value pairs to a set of intermediate key/value pairs. a) Mapper b) Reducer c) Both Mapper and Reducer d) None of the mentioned

a) Mapper Maps are the individual tasks that transform input records into intermediate records.

___________ part of the MapReduce is responsible for processing one or more chunks of data and producing the output results. a) Maptask b) Mapper c) Task execution d) All of the mentioned

a) Maptask Map Task in MapReduce is performed using the Map() function.

_________ function is responsible for consolidating the results produced by each of the Map() functions/tasks. a) Reduce b) Map c) Reducer d) All of the mentioned

a) Reduce Reduce function collates the work and resolves the results.

The number of maps is usually driven by the total size of: a) inputs b) outputs c) tasks d) None of the mentioned

a) inputs Total size of inputs means total number of blocks of the input files.

________ is a utility which allows users to create and run jobs with any executables as the mapper and/or the reducer. a) Hadoop Strdata b) Hadoop Streaming c) Hadoop Stream d) None of the mentioned

b) Hadoop Streaming Hadoop streaming is one of the most important utilities in the Apache Hadoop distribution.

_________ is the default Partitioner for partitioning key space. a) HashPar b) Partitioner c) HashPartitioner d) None of the mentioned

c) HashPartitioner The default partitioner in Hadoop is the HashPartitioner which has a method called getPartition to partition.

A ________ node acts as the Slave and is responsible for executing a Task assigned to it by the JobTracker. a) MapReduce b) Mapper c) TaskTracker d) JobTracker

c) TaskTracker TaskTracker receives the information necessary for execution of a Task from JobTracker, Executes the Task, and Sends the Results back to JobTracker.

Point out the wrong statement: a) A MapReduce job usually splits the input data-set into independent chunks which are processed by the map tasks in a completely parallel manner b) The MapReduce framework operates exclusively on pairs c) Applications typically implement the Mapper and Reducer interfaces to provide the map and reduce methods d) None of the mentioned

d) None of the mentioned The MapReduce framework takes care of scheduling tasks, monitoring them and re-executes the failed tasks.


Related study sets

Fundamentals Chapter 5 مهمممم

View Set

Pharmacology Prep U Chapter 30: Adrenergic Agonists

View Set

EXPRESIONES PARA PEDIR Y DAR OPINIONES,

View Set

SnowPro Core Certification (Micro-partitions)

View Set

Chapter 51: Concepts of Care for Patients with Noninflammatory Intestinal Disorders

View Set

BCBP 4760 Molecular Biochemistry 1 - Chapter 2 - The Chemistry Of Water: WileyPlus Homework

View Set

Administrative Medical Assisting

View Set

Repere cronologice - Istoria romanilor

View Set