SQL2SPARQL - Automatic SQL to SPARQL Converter

An independent, production-ready implementation of SQL to SPARQL conversion for direct RDF querying using familiar SQL syntax.

About This Implementation

This is an original implementation inspired by algorithms described in the academic literature, particularly the paper "SQL2SPARQL4RDF" (ABATAL et al., 2019). The code, architecture, and engineering decisions are entirely our own work.

Features

✅ Complete SQL Support

SELECT queries with JOIN, WHERE, GROUP BY, HAVING, ORDER BY
Aggregate functions (COUNT, SUM, AVG, MIN, MAX)
INSERT and DELETE operations
UNION, INTERSECT, EXCEPT combinations

✅ RDF Integration

Automatic schema extraction from RDF data
Support for multiple RDF formats (Turtle, RDF/XML, N3, JSON-LD)
Integration with popular RDF stores (AllegroGraph, Fuseki, Blazegraph, Virtuoso)
In-memory processing with RDFLib

✅ Developer Friendly

Rich CLI interface with syntax highlighting
Python API for integration
Comprehensive test suite
Detailed logging and error messages

Installation

# Install from source
git clone https://github.com/phamthi1812/sql2sparql
cd sql2sparql
pip install -e .

# Or install with pip
pip install sql2sparql

Requirements

Python 3.8+
Java 11+ (for Apache Jena Fuseki, optional)
SPARQL endpoint (see SPARQL_ENDPOINT_SETUP.md)

Quick Start

SPARQL Endpoint Setup

For testing with real datasets, you'll need a SPARQL endpoint. See SPARQL_ENDPOINT_SETUP.md for detailed instructions.

Command Line Usage

# Convert a simple SQL query
sql2sparql convert -s "SELECT name, email FROM client" -r data.ttl

# Extract schema from RDF data
sql2sparql extract-schema -r data.ttl -o schema.json

# Batch convert SQL queries
sql2sparql batch-convert -s queries.sql -r data.ttl -o output/

# Test connection to SPARQL endpoint
sql2sparql test-connection -e http://localhost:3030/sparql

# Show examples
sql2sparql examples

Python API Usage

from sql2sparql import SQL2SPARQLConverter, SchemaMapper, SPARQLExecutor
from sql2sparql.executors.sparql_executor import StoreType
from rdflib import Graph

# Load RDF data
graph = Graph()
graph.parse("data.ttl", format="turtle")

# Extract schema
schema_mapper = SchemaMapper(graph)
schema = schema_mapper.extract_schema()

# Create converter
converter = SQL2SPARQLConverter(schema_mapper)

# Convert SQL to SPARQL
sql_query = "SELECT name, email FROM client WHERE age > 25"
sparql_query = converter.convert(sql_query)
print(sparql_query)

# Execute SPARQL query
executor = SPARQLExecutor(store_type=StoreType.RDFLIB, graph=graph)
results = executor.execute_query(sparql_query)
for row in results:
    print(row)

Supported SQL Constructs

SELECT Queries

-- Simple SELECT
SELECT name, email FROM client

-- With WHERE clause
SELECT * FROM product WHERE price > 100

-- With JOIN
SELECT client.name, order.date
FROM client, order
WHERE client.id = order.client_id

-- With aggregates
SELECT category, COUNT(*), AVG(price)
FROM product
GROUP BY category
HAVING COUNT(*) > 5

-- With ORDER BY and LIMIT
SELECT name, price FROM product
ORDER BY price DESC
LIMIT 10

INSERT Queries

INSERT INTO client (name, email, age)
VALUES ('John Doe', 'john@example.com', 30)

DELETE Queries

DELETE FROM client WHERE age < 18

Architecture

The system follows a modular architecture:

sql2sparql/
├── core/
│   ├── converter.py      # Main converter orchestrator
│   ├── schema_mapper.py  # RDF schema extraction
│   └── models.py         # Data models
├── parsers/
│   └── sql_parser.py     # SQL query parser
├── converters/
│   ├── select_converter.py      # SELECT clause converter
│   ├── where_converter.py       # WHERE clause converter
│   ├── group_having_converter.py # GROUP BY/HAVING converter
│   └── insert_delete_converter.py # INSERT/DELETE converter
├── executors/
│   └── sparql_executor.py       # SPARQL execution engine
└── cli/
    └── main.py                   # CLI interface

Conversion Algorithm

The conversion follows these steps:

Schema Extraction: Analyze RDF data to extract a relational schema
SQL Parsing: Parse SQL query into structured components
Pattern Generation: Convert SQL elements to SPARQL triple patterns
Query Construction: Assemble SPARQL query with proper syntax
Execution: Run SPARQL on RDF store and return results

RDF Store Integration

AllegroGraph

executor = SPARQLExecutor(
    store_type=StoreType.ALLEGROGRAPH,
    endpoint="http://localhost:10035/repositories/myrepo/sparql",
    username="user",
    password="pass"
)

Apache Jena Fuseki

executor = SPARQLExecutor(
    store_type=StoreType.FUSEKI,
    endpoint="http://localhost:3030/dataset/sparql"
)

Blazegraph

executor = SPARQLExecutor(
    store_type=StoreType.BLAZEGRAPH,
    endpoint="http://localhost:9999/blazegraph/sparql"
)

Testing

Unit Tests

# Run all tests
pytest

# Run specific test file
pytest tests/test_converter.py

# Run with coverage
pytest --cov=sql2sparql

Integration Tests with Northwind Dataset

# Set up SPARQL endpoint first (see SPARQL_ENDPOINT_SETUP.md)
cd datasets

# Test basic functionality
python test_northwind.py

# Test GROUP BY queries
python test_simple_groupby.py

# Test complex aggregates
python test_complex_aggregates.py

References

This implementation was inspired by concepts from:

ABATAL et al. (2019). "SQL2SPARQL4RDF: Automatic SQL to SPARQL Conversion for RDF Querying". IJACSA, Vol. 10, No. 11.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
datasets		datasets
sql2sparql		sql2sparql
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CITATION.cff		CITATION.cff
LICENSE		LICENSE
Paper_80-SQL_to_SPARQL_Conversion_for_Direct_RDF_Querying.pdf		Paper_80-SQL_to_SPARQL_Conversion_for_Direct_RDF_Querying.pdf
README.md		README.md
SPARQL_ENDPOINT_SETUP.md		SPARQL_ENDPOINT_SETUP.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SQL2SPARQL - Automatic SQL to SPARQL Converter

About This Implementation

Features

Installation

Requirements

Quick Start

SPARQL Endpoint Setup

Command Line Usage

Python API Usage

Supported SQL Constructs

SELECT Queries

INSERT Queries

DELETE Queries

Architecture

Conversion Algorithm

RDF Store Integration

AllegroGraph

Apache Jena Fuseki

Blazegraph

Testing

Unit Tests

Integration Tests with Northwind Dataset

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SQL2SPARQL - Automatic SQL to SPARQL Converter

About This Implementation

Features

Installation

Requirements

Quick Start

SPARQL Endpoint Setup

Command Line Usage

Python API Usage

Supported SQL Constructs

SELECT Queries

INSERT Queries

DELETE Queries

Architecture

Conversion Algorithm

RDF Store Integration

AllegroGraph

Apache Jena Fuseki

Blazegraph

Testing

Unit Tests

Integration Tests with Northwind Dataset

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages