Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon. Entire thread

Redpill me on Hadoop/Apache Spark

Name: Anonymous 2018-08-06 20:27

What the fuck is MapReduce?

Name: Anonymous 2018-08-07 1:22

Technically, it's Map-shuffle-Reduce

As such, a single-threaded implementation of MapReduce will usually not be faster than a traditional (non-MapReduce) implementation; any gains are usually only seen with multi-threaded implementations.[9] The use of this model is beneficial only when the optimized distributed shuffle operation (which reduces network communication cost) and fault tolerance features of the MapReduce framework come into play. Optimizing the communication cost is essential to a good MapReduce algorithm.[10]

Name: Anonymous 2018-08-07 2:11

>>2

still doesn't explain what the fuck it is

explain it like I'm retarded

Name: Anonymous 2018-08-07 3:55

>>1
Enterprise-grade Functional Programming

Name: Anonymous 2018-08-08 4:22

>>3
Some funny way of reordering a data processing step when there's heaps of data and many cores, for processing gains. Could just be a memoization-like trick (eg, direct data to the core with the memo/look-up) or something fancier
Supposed to not work as well for repeated access, like iterative learning

Newer Posts

Redpill me on Hadoop/Apache Spark

1 Name: Anonymous 2018-08-06 20:27

2 Name: Anonymous 2018-08-07 1:22

3 Name: Anonymous 2018-08-07 2:11

4 Name: Anonymous 2018-08-07 3:55

5 Name: Anonymous 2018-08-08 4:22

Name: Anonymous 2018-08-06 20:27

Name: Anonymous 2018-08-07 1:22

Name: Anonymous 2018-08-07 2:11

Name: Anonymous 2018-08-07 3:55

Name: Anonymous 2018-08-08 4:22