Map-Side Merge Join for MapReduce
We provide an adaption of the classical sort-merge join for MapReduce, which we introduce as an optimization for our RDFPath MapReduce Processor.Related Publications
-
Martin Przyjaciel-Zablocki, Alexander Schätzle, Eduard Skaley, Thomas Hornung, Georg Lausen:
Map-Side Merge Joins for Scalable SPARQL BGP Processing [ .pdf ]
Proc. of the 5th IEEE International Conference on Cloud Computing Technology and Science, (CloudCom 2013). Bristol (UK).
Requirements
- Hadoop, we recommend Cloudera's Distribution of Hadoop CDH4, the implementation is tested with CDH 4.1.2
- Java Version 6
- RDF data in NTriples format
Downloads
Files | Description | |
---|---|---|
Merge Join v0.2 | Readme and binaries | Download |
SVN
You can also checkout the source code of MapMerge from the following subversion repository.(username: anonymous, password: anonymous)
https://dbissvn.informatik.uni-freiburg.de/intern/Projekte/DiPoS/MapMerge/tags
<< back