MapReduce - Misplaced Pages

This is an old revision of this page, as edited by MParaz (talk | contribs) at 10:27, 19 December 2005 (Nutch implementation). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Revision as of 10:27, 19 December 2005 by MParaz (talk | contribs) (Nutch implementation)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

In Google's MapReduce programming model, parallel computations over large data sets are implemented by specifying a Map function that maps key-value pairs to new key-value pairs and a subsequent Reduce function that consolidates all mapped key-value pairs sharing the same keys to single key-value pairs.

MapReduce is often used in conjunction with Google File System, for greater parallelization.

Other Implementations

Nutch has an experimental implementation of MapReduce.

References

Dean, Jeffrey & Ghemawat, Sanjay (2004). "MapReduce: Simplified Data Processing on Large Clusters". Retrieved Apr. 6, 2005.

This software article is a stub. You can help Misplaced Pages by expanding it.

Categories: