How to switch between two tables?

Can we join two tables using MapReduce? What are the requirements in this case -should one of the tables fit in to the memory or should both of the tables fit in to the memory? Or, is memory not a constraint at all?

  • Answer:

    None of the two tables have to fit in memory, although if they would, it would be lots and lots faster. The problem in this case is that there's going to be a lot of disk "spill": MapReduce jobs writing data to disk. As you might know, disk IO is very slow compared to in-memory computation.

Robin Verlangen at Quora Visit the source

Was this solution helpful to you?

Related Q & A:

Just Added Q & A:

Find solution

For every problem there is a solution! Proved by Solucija.

  • Got an issue and looking for advice?

  • Ask Solucija to search every corner of the Web for help.

  • Get workable solutions and helpful tips in a moment.

Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.