Changes

GPU621/Apache Spark

81 bytes added, 16:55, 30 November 2020

→‎Hadoop MapReduce

=== Hadoop MapReduce ===

The processing component of Hadoop ecosystem. It assigns the data fragments from the HDFS to separate map tasks in the cluster and processes the chunks in parallel to combine the pieces into the desired result.

== Applications ==

Abalachandran7

73

edits

Changes

GPU621/Apache Spark

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

get involved with CDOT

courses

course projects

links

Tools