This Docker image is designed for RDF-related tasks in automated pipelines (ETL/ELT). It packages the Apache Jena stack, Node.js, and a comprehensive set of CLI tools for RDF processing, scripting, and reasoning.
It is particularly useful for:
- Automating RDF extraction, transformation, and loading (ETL).
- Running RDF validation and reasoning in CI/CD pipelines.
- Scripting complex RDF processing workflows using Node.js and specialized CLI tools.
The image contains a rich set of tools for working with RDF data:
- Node.js (v24): For JavaScript/TypeScript-based RDF processing and pipeline logic.
- Java (OpenJDK 21): Required for running Apache Jena and other JVM-based tools.
- Apache Jena(v6.x): A complete framework for building Semantic Web and Linked Data applications. Includes CLI tools like
riot,sparql, etc. - EYE Reasoning Engine(v11.x): A performant reasoning engine for N3 (Notation3).
- sophia-cli (
sop): A versatile CLI tool for RDF processing based on the Sophia toolkit.
- barnard59: A streaming RDF middleware for Node.js pipelines.
- serdi: A fast read/write library for Turtle and NTriples.
- raptor2-utils: Includes
rapperfor RDF parsing and serialization.
- Minio Client (
mc): For interacting with S3-compatible storage (useful for fetching/storing RDF dumps). - jq: A lightweight and flexible command-line JSON processor.
- s3cmd: Command line tool for S3.
The image is available on Docker Hub:
docker pull zazukoians/node-java-jenaYou can adjust and build the image according to your needs:
docker build -t zazukoians/node-java-jena .If you encounter any problems, please report issues on GitHub. Pull requests and suggestions are also welcome.