Misplaced Pages

MapReduce

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

This is an old revision of this page, as edited by MParaz (talk | contribs) at 10:27, 19 December 2005 (Nutch implementation). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Revision as of 10:27, 19 December 2005 by MParaz (talk | contribs) (Nutch implementation)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

In Google's MapReduce programming model, parallel computations over large data sets are implemented by specifying a Map function that maps key-value pairs to new key-value pairs and a subsequent Reduce function that consolidates all mapped key-value pairs sharing the same keys to single key-value pairs.

MapReduce is often used in conjunction with Google File System, for greater parallelization.

Other Implementations

Nutch has an experimental implementation of MapReduce.

References

Stub icon

This software article is a stub. You can help Misplaced Pages by expanding it.

Categories: