Databases and Information Systems
Sie sind hier: Startseite Research Current Projects
<< back


Pig Latin is a high-level language developed at Yahoo! Research designed for data analysis tasks, which is automatically transformed into MapReduce jobs and executed in a Hadoop cluster.
PigSPARQL is a translation from SPARQL 1.0 to Pig Latin, which allows to execute SPARQL queries on large RDF graphs with MapReduce.


Related Publications

  • Alexander Schätzle, Martin Przyjaciel-Zablocki, Thomas Hornung, Georg Lausen:
    PigSPARQL: A SPARQL Query Processing Baseline for Big Data.pdf ] [ Poster ]
    Proc. of the ISWC 2013 Posters & Demonstrations Track (ISWC 2013). Sydney (Australia).
  • Alexander Schätzle, Martin Przyjaciel-Zablocki, Georg Lausen:
    PigSPARQL: Mapping SPARQL to Pig Latin.pdf ]
    3th International Workshop on Semantic Web Information Management (SWIM 2011),
    in conjunction with the 2011 ACM International Conference on Management of Data (SIGMOD 2011). Athens (Greece).
  • Alexander Schätzle, Martin Przyjaciel-Zablocki, Thomas Hornung, Georg Lausen:
    PigSPARQL: Übersetzung von SPARQL nach PigLatin.pdf ]
    In Proc. 14th conference on Database Systems for Business, Technology and Web (BTW 2011).
    Kaiserslautern (Germany).


  • Apache Hadoop, we recommend Cloudera's Distribution of Hadoop CDH4, the implementation is tested with CDH 4.3
  • Apache Pig, we recommend the version shipped with CDH4
  • Java Version 6
  • RDF data in (extended) NTriples format. Beyond the syntax of N-Triples it also supports the most commonly used Prefixes as well as the Prefixes used in the SP2Bench, LUBM and BSBM benchmarks.


Version Description
PigSPARQL v2.0 Readme, Sources and Binaries PigSPARQL_v2.0.tar
PigSPARQL v1.0 Readme and Binaries PigSPARQL_v1.0.tar


You can also checkout the source code of PigSPARQL from the following subversion repository.
(username: anonymous, password: anonymous)

<< back