Add benchmarks #327

Ja-Gk-00 · 2025-11-19T08:49:58Z

Added benchmarks for measuring the performance of flat ser/des to tests.

codecov · 2025-11-20T13:01:44Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

Files with missing lines	Coverage Δ
pyjelly/integrations/generic/generic_sink.py	`100.00% <100.00%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

tests/benchmark_tests/conftest.py

pyproject.toml

Ja-Gk-00 · 2025-11-24T08:29:53Z

Ok, snippet of some provisional results (they were executed for a relatively small file, so they might not be the most accurate, just to show that running benchmarks works).

Ostrzyciel · 2025-11-24T15:15:06Z

pyjelly/integrations/generic/generic_sink.py


    def parse(self, input_file: IO[bytes]) -> None:
-        from pyjelly.integrations.generic.parse import parse_jelly_to_graph
+        from pyjelly.integrations.generic.parse import (


Please don't do irrelevant changes in PRs. Please revert this and other changes like this.

Ostrzyciel · 2025-11-24T15:15:18Z

pyproject.toml

-  "mypy>=1.8; platform_python_implementation == 'CPython'",
+    "hatchling>=1.24",
+    "hatch-mypyc; platform_python_implementation == 'CPython'",
+    "mypy>=1.8; platform_python_implementation == 'CPython'",


Ostrzyciel · 2025-11-24T15:16:08Z

pyproject.toml

-# version 3.1 required for python 3.14 support
-ci = ["cibuildwheel>=3.1.0,<4 ; python_version >= '3.11'"]
-
+# version 3.11 required for python 3.14 support


Ostrzyciel · 2025-11-24T15:16:29Z

pyproject.toml

-
+# version 3.11 required for python 3.14 support
+ci = ['cibuildwheel>=3.1.0,<4 ; python_version >= "3.11"']
+bench = ["pytest-benchmark>=5.2.1", "rdflib>=7.1.4"]


Please rename to "benchmark" to make it clearer what this is.

Ostrzyciel · 2025-11-24T15:17:18Z

tests/benchmark_tests/__init__.py

maybe just "benchmarks" instead of "benchmark_tests"?

Ostrzyciel · 2025-11-24T15:18:18Z

tests/benchmark_tests/conftest.py

+        "--in-jelly-quads",
+        type=str,
+        default=None,
+        help="optional Jelly quads file; if none, generated in-memory from nq slice.",


What is an "nq slice"?

Ostrzyciel · 2025-11-24T15:20:04Z

tests/benchmark_tests/conftest.py

+    g.addoption("--iterations", type=int, default=1, help="iterations per round.")
+
+
+def _slice_lines_to_bytes(path: Path, limit: int) -> bytes:


There are no comments here or anywhere else, again. This makes the code rather hard to review.

Please make this code readable, and then I will review it again.

Ostrzyciel · 2025-11-24T15:21:30Z

tests/benchmark_tests/conftest.py

+
+@pytest.fixture(scope="session")
+def nt_graph(nt_bytes_sliced: bytes) -> Graph:
+    g = Graph()


Why do you even use Graph? For buffering in-memory you must use an array of statements, otherwise you will get nonsensical results. Same with Dataset, of course.

Ostrzyciel · 2025-11-24T15:23:33Z

tests/benchmark_tests/test_flat_deserialize.py

+    pedantic_cfg: dict[str, int],
+    limit_statements: int,
+) -> None:
+    benchmark.pedantic(parse_nt_bytes, args=(nt_bytes_sliced,), **pedantic_cfg)


You are measuring here not the parsing speed, but the speed with which rdflib can insert stuff into the Graph. This is meaningless. You must only iterate over the resulting triples/quads, nothing else.

Ostrzyciel · 2025-11-24T15:25:55Z

Ok, snippet of some provisional results (they were executed for a relatively small file, so they might not be the most accurate, just to show that running benchmarks works).

These results don't make any sense.

After you fix the issues I pointed out, please do a run again, until you get some results that actually make some sense.

Please do NOT put serialization and deserialization results in the same table, this makes the table hard to understand. Just run the two benchmarks separately.

Ja-Gk-00 added 2 commits November 19, 2025 09:46

Init commit

d873906

Tweak pyproject

9c32db4

Ja-Gk-00 marked this pull request as draft November 19, 2025 08:50

Ja-Gk-00 added 2 commits November 19, 2025 10:44

Satisfy mypy

da78079

disallow benchmarks from default runs

770094c

Ja-Gk-00 marked this pull request as ready for review November 20, 2025 14:06

johnslavik reviewed Nov 21, 2025

View reviewed changes

tests/benchmark_tests/conftest.py Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

Ja-Gk-00 and others added 7 commits November 24, 2025 05:09

Merge branch 'main' into PY-61/python-benchmarks

688eb99

tweaks from comments

5bebf27

fix

0461949

fix2

778bd88

reverse tweak

14af2d6

small tweak for conftest

a61fd90

small bug found

414bfe9

Ostrzyciel requested changes Nov 24, 2025

View reviewed changes

		g.addoption("--iterations", type=int, default=1, help="iterations per round.")


		def _slice_lines_to_bytes(path: Path, limit: int) -> bytes:

Add benchmarks #327

Are you sure you want to change the base?

Add benchmarks #327

Uh oh!

Conversation

Ja-Gk-00 commented Nov 19, 2025

Uh oh!

codecov bot commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Ja-Gk-00 commented Nov 24, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Ostrzyciel commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Nov 20, 2025 •

edited

Loading