Add anndata-duckdb extension #327
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Adding anndata-duckdb
A DuckDB extension for reading AnnData (.h5ad) files, providing SQL access to single-cell genomics data.
Mandatory
Name of the tool: anndata-duckdb
Short description: DuckDB extension that provides SQL access to AnnData (.h5ad) files, enabling queries on obs, var, X matrix, dimensional reductions, layers, and unstructured metadata.
How does the package use scverse data structures: The extension reads AnnData HDF5 files directly, providing SQL table access to all standard AnnData components: observation metadata (obs), variable metadata (var), expression matrix (X), dimensional reductions (obsm/varm like PCA/UMAP), pairwise matrices (obsp/varp), alternative expression layers, and unstructured metadata (uns).
INSTALL anndata FROM community; LOAD anndata)Recommended
Note on installation: This is a DuckDB extension, not a Python package. The
installfield in meta.yaml listsduckdb(available on PyPI and conda-forge) since the schema doesn't have a DuckDB extension registry option. It can be used from python like any duckdb extension.Users can install the DuckDB CLI, then run
INSTALL anndata FROM community;from within DuckDB to install and load the extension from the DuckDB community extension repository.