PigSPARQL
Pig Latin is a high-level language developed at Yahoo! Research designed for data analysis tasks, which is automatically transformed into MapReduce jobs and executed in a Hadoop cluster. PigSPARQL is a translation from SPARQL 1.0 to Pig Latin, which allows to execute SPARQL queries on large RDF graphs with MapReduce.Video
Related Publications
-
Alexander Schätzle, Martin Przyjaciel-Zablocki, Thomas Hornung, Georg Lausen:
PigSPARQL: A SPARQL Query Processing Baseline for Big Data [ .pdf ] [ Poster ]
Proc. of the ISWC 2013 Posters & Demonstrations Track (ISWC 2013). Sydney (Australia). -
Alexander Schätzle, Martin Przyjaciel-Zablocki, Georg Lausen:
PigSPARQL: Mapping SPARQL to Pig Latin [ .pdf ]
3th International Workshop on Semantic Web Information Management (SWIM 2011), in conjunction with the 2011 ACM International Conference on Management of Data (SIGMOD 2011). Athens (Greece). -
Alexander Schätzle, Martin Przyjaciel-Zablocki, Thomas Hornung, Georg Lausen:
PigSPARQL: Übersetzung von SPARQL nach Pig Latin [ .pdf ]
In Proc. 14th conference on Database Systems for Business, Technology and Web (BTW 2011).Kaiserslautern (Germany).
Requirements
- Apache Hadoop, we recommend Cloudera's Distribution of Hadoop CDH4, the implementation is tested with CDH 4.3
- Apache Pig, we recommend the version shipped with CDH4
- Java Version 6
- RDF data in (extended) NTriples format. Beyond the syntax of N-Triples it also supports the most commonly used Prefixes as well as the Prefixes used in the SP2Bench, LUBM and BSBM benchmarks.
Downloads
Version | Description | |
---|---|---|
PigSPARQL v2.0 | Readme, Sources and Binaries | PigSPARQL_v2.0.tar |
PigSPARQL v1.0 | Readme and Binaries | PigSPARQL_v1.0.tar |
GitHub
You can also checkout the source code of PigSPARQL from the following GitHub repository.https://github.com/aschaetzle/PigSPARQL
<< back