Durability (& reliability) versus performance

First I'd like to say I like the decision to strictly conform to a (modern) SQL standard. I think this might be one of the selling points - "it was tested against vsql" would have a huge meaning in the bright future I envision :wink:.

Here I'd like to discuss probably the biggest PITA any DB has to clearly decide and define. It's durability guarantees, i.e. a "fault model". This also determines the maximum achievable performance of the DB due to physical limitations.

So I always envisioned the following:

1. to have a DB API requiring one to specify on **per-request** basis whether it shall be treated as **best-effort** (i.e. no guarantee but presumably faster and with lower delays) or as **guaranteed** operation. See e.g. my rant https://github.com/redis/redis/issues/6200#issue-464233707 .

    Note that best effort **might** to lead incorrect data compared to the surrounding global context but it **must not** lead to internally incorrect data within a transaction - e.g. if in one transaction I fetch two different rows from two different tables it still **must** guarantee that the data (i.e. both rows) returned are exactly the data from the point in time when the transaction began (i.e. mutually correct).

2. Guarantee durability in the very maximum technically possible way POSIX allows us to and the knowledge of HW-originated errors allows us to. Namely the same *fault model* as [TigerBeetle](https://github.com/coilhq/tigerbeetle/blob/main/docs/DESIGN.md#fault-models ) has. SQLite3 allows almost for what TigerBeetle does (e.g. commit **both** the parent directory structure as well as the DB file itself and then fsync) but there are still some rough corners.

The first step would be to focus on calling `fsync` **and** waiting for it to finish (see below) after each transaction which has the potential to modify data.

Note also that Linux `fsync` is synchronous (and will stay so in the upcoming years) but on many other systems it's asynchronous with system-specific calls to wait for its return.

What do you think about implementing both points (1) and (2)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Durability (& reliability) versus performance #88

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Durability (& reliability) versus performance #88

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions