passage-experiments

This repository contains the experiments in the paper "PASSAGE: Ensuring Completeness and Responsiveness of Public SPARQL Endpoints with SPARQL Continuation Queries"

📄 Abstract

Being able to query online public knowledge graphs such as Wikidata or DBpedia is extremely valuable. However, these queries can be interrupted due to the fair use policies enforced by SPARQL endpoint providers, leading to incomplete results. While these policies help maintain the responsiveness of public SPARQL endpoints, they compromise the completeness of query results, which limits the feasibility of various downstream tasks. Ideally, we should not have to choose between completeness and responsiveness. To address this issue, we introduce and formalize the concept of SPARQL continuation queries. When a SPARQL endpoint interrupts a query, it returns partial results along with a SPARQL continuation query to retrieve the remaining results. If the continuation query is also interrupted,the process repeats, generating further continuation queries until the complete results are obtained. In our experimentation, we show that our continuation server PASSAGE ensures completeness and responsiveness while delivering high performance.

🗂️ Organization of the Repository

./datasets: You can download different formats of the dataset from the git repo. We'll need JNL for Blazegraph and PASSAGE, TDB2 for Jena, and HDT for Sage experiments.
./selected_queries: Contains the queries used in the experiments, which are a subset of the WDBench benchmark that do not contain cartesian products. It contains two subdirectories: ./selected_queries/wdbench-multiple-tps and ./selected_queries/wdbench-opts. The first contains queries with multiple triple patterns, and the second contains queries with optional patterns. These queries take between 1 to 5 minutes to execute in Blazegraph under a single virtual CPU.
./passage: Contains the source code of the PASSAGE implementation. Please refer to the README.md file in this directory for more information.
./passage-communica: PASSAGE x Comunica. The extension of the Comunica smart client for supporting all SPARQL features. Please refer to the README.md file in this directory for more information.
./expe-blazegraph-baseline, ./expe-jena, ./expe-sage, ./expe-passage: Contains the scripts used to run the experiments for comparing the performance of Blazegraph, Jena, Sage, and PASSAGE.
Snakemake file: Contains the workflow for running the experiments.
./blazegraph-cli: Contains Blazegraph CLI commands for Blazegraph experiments. Refer to the README.md file in this directory for more information.

🧪 Steps to Reproduce the Experiments

📂 1. Clone the Repository

📥 2. Download and Set Up the Dataset

Download the dataset in all necessary formats(JNL for Blazegraph and PASSAGE, HDT for SaGe and TDB2 for Jena) and place it in the ./datasets directory.

🔧 3. Build Required Components

Blazegraph & PASSAGE: Refer to the README files in each directory to produce the required .jar files.
Jena: We use version 5.1.0 of Jena, which is already included in the ./expe-jena directory.
Sage: A Docker image has been created to set up Sage for the experiments.

🐋 4. Docker Images

We provide Docker images for each engine's experiments. You can find these in their respective directories.

🚀 5. Run All Experiments

You can reproduce all experiments by running the Snakemake file located in the root directory:

snakemake -p -s Snakefile -c1

⚙️ 6. Run Individual Experiments

To run experiments individually, configure and define the appropriate settings as described in the Snakefile.

📊 7. Analyze the Results

The analysis of the results is done in ./report_analysis.ipynb. You can find the figures and tables in the paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

passage-experiments

📄 Abstract

🗂️ Organization of the Repository

🧪 Steps to Reproduce the Experiments

📂 1. Clone the Repository

📥 2. Download and Set Up the Dataset

🔧 3. Build Required Components

🐋 4. Docker Images

🚀 5. Run All Experiments

⚙️ 6. Run Individual Experiments

📊 7. Analyze the Results

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
blazegraph-cli		blazegraph-cli
expe-blazegraph-baseline		expe-blazegraph-baseline
expe-jena		expe-jena
expe-passage		expe-passage
expe-sage		expe-sage
passage-comunica		passage-comunica
passage		passage
selected_queries		selected_queries
.gitignore		.gitignore
README.md		README.md
Snakefile		Snakefile
pom.xml		pom.xml
report_analysis.ipynb		report_analysis.ipynb

Folders and files

Latest commit

History

Repository files navigation

passage-experiments

📄 Abstract

🗂️ Organization of the Repository

🧪 Steps to Reproduce the Experiments

📂 1. Clone the Repository

📥 2. Download and Set Up the Dataset

🔧 3. Build Required Components

🐋 4. Docker Images

🚀 5. Run All Experiments

⚙️ 6. Run Individual Experiments

📊 7. Analyze the Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages