TODO: - [ ] better infrastructure, e.g. benchmark per python version - [ ] more cases, e.g. failed CAS, `get_and_set`