PigSPARQL
Pig Latin is a high-level language developed at Yahoo! Research designed for data analysis tasks, which is automatically transformed into MapReduce jobs and executed in a Hadoop cluster.PigSPARQL is a translation from SPARQL 1.0 to Pig Latin, which allows to execute SPARQL queries on large RDF graphs with MapReduce.
Video
Related Publications
-
Alexander Schätzle, Martin Przyjaciel-Zablocki, Thomas Hornung, Georg Lausen:
PigSPARQL: A SPARQL Query Processing Baseline for Big Data [ .pdf ] [ Poster ]
Proc. of the ISWC 2013 Posters & Demonstrations Track (ISWC 2013). Sydney (Australia). -
Alexander Schätzle, Martin Przyjaciel-Zablocki, Georg Lausen:
PigSPARQL: Mapping SPARQL to Pig Latin [ .pdf ]
3th International Workshop on Semantic Web Information Management (SWIM 2011),
in conjunction with the 2011 ACM International Conference on Management of Data (SIGMOD 2011). Athens (Greece). -
Alexander Schätzle, Martin Przyjaciel-Zablocki, Thomas Hornung, Georg Lausen:
PigSPARQL: Übersetzung von SPARQL nach Pig Latin [ .pdf ]
In Proc. 14th conference on Database Systems for Business, Technology and Web (BTW 2011).
Kaiserslautern (Germany).
Requirements
- Apache Hadoop, we recommend Cloudera's Distribution of Hadoop CDH4, the implementation is tested with CDH 4.3
- Apache Pig, we recommend the version shipped with CDH4
- Java Version 6
- RDF data in (extended) NTriples format. Beyond the syntax of N-Triples it also supports the most commonly used Prefixes as well as the Prefixes used in the SP2Bench, LUBM and BSBM benchmarks.
Downloads
Version | Description | |
---|---|---|
PigSPARQL v2.0 | Readme, Sources and Binaries | ![]() |
PigSPARQL v1.0 | Readme and Binaries | ![]() |
GitHub
You can also checkout the source code of PigSPARQL from the following GitHub repository.https://github.com/aschaetzle/PigSPARQL
<< back