processor-post-timeseries

A processor for converting NWB (Neurodata Without Borders) files into chunked timeseries data for the Pennsieve platform.

Overview

This processor reads electrical series data from NWB files and:

Extracts channel data with proper scaling (conversion factors, offsets)
Writes chunked binary files (gzip-compressed, big-endian float64)
Generates channel metadata files (JSON)
Optionally uploads the processed data to Pennsieve via the import API

Architecture

main.py - Entry point that orchestrates the processing pipeline.

reader.py - NWBElectricalSeriesReader reads NWB ElectricalSeries data, handles timestamps and sampling rates, applies conversion factors and offsets, and detects contiguous data chunks.

writer.py - TimeSeriesChunkWriter writes chunked binary data (.bin.gz) and channel metadata (.metadata.json) in big-endian format.

importer.py - Creates import manifests via Pennsieve API and uploads files to S3 via presigned URLs.

clients/ - API clients for Pennsieve:

AuthenticationClient - AWS Cognito authentication
ImportClient - Import manifest creation and file upload
TimeSeriesClient - Time series channel management
WorkflowClient - Analytic workflow instance management
BaseClient - Session management with auto-refresh

Setup

Prerequisites

Python 3.10+
Docker (for local runs)

Create Virtual Environment

make venv
source venv/bin/activate

Install Dependencies

make install

Development

Install Pre-commit Hooks

This installs git hooks that automatically lint and format code on commit.

make pre-commit

Run Tests

make test

Run Tests with Coverage

make test-cov

Run Linter

Runs ruff with auto-fix and formatting.

make lint

Running Locally

1. Configure Environment

Configure the environment file

Edit dev.env with your settings:

ENVIRONMENT=local
INPUT_DIR=/data/input
OUTPUT_DIR=/data/output
CHUNK_SIZE_MB=1
IMPORTER_ENABLED=false
...

2. Add Input File

Place your .nwb file in the data/input/ directory:

cp /path/to/your/file.nwb data/input/

3. Run the Processor

make run

This builds and runs the processor via Docker. Output files will be written to data/output/.

4. Clean Up

Remove input/output files:

make clean

Output Format

The processor generates two types of files per channel:

Binary Data Files

Pattern: channel-{index}_{start_us}_{end_us}.bin.gz
Format: Gzip-compressed big-endian float64 values
Example: channel-00001_1000000_2000000.bin.gz

Metadata Files

Pattern: channel-{index}.metadata.json
Contains: name, rate, start, end, unit, type, group, properties

Environment Variables

Variable	Description	Default
`ENVIRONMENT`	Runtime environment (`local` or `production`)	`local`
`INPUT_DIR`	Directory containing NWB files	-
`OUTPUT_DIR`	Directory for output files	-
`CHUNK_SIZE_MB`	Size of each data chunk in MB	`1`
`IMPORTER_ENABLED`	Enable Pennsieve upload	`false`
`PENNSIEVE_API_KEY`	Pennsieve API key	-
`PENNSIEVE_API_SECRET`	Pennsieve API secret	-
`PENNSIEVE_API_HOST`	Pennsieve API endpoint	`https://api.pennsieve.net`
`PENNSIEVE_API_HOST2`	Pennsieve API2 endpoint	`https://api2.pennsieve.net`
`INTEGRATION_ID`	Workflow instance ID	-

License

See LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github/workflows		.github/workflows
processor		processor
scripts		scripts
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
dev.env		dev.env
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

processor-post-timeseries

Overview

Architecture

Setup

Prerequisites

Create Virtual Environment

Install Dependencies

Development

Install Pre-commit Hooks

Run Tests

Run Tests with Coverage

Run Linter

Running Locally

1. Configure Environment

2. Add Input File

3. Run the Processor

4. Clean Up

Output Format

Binary Data Files

Metadata Files

Environment Variables

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

Pennsieve/processor-post-timeseries

Folders and files

Latest commit

History

Repository files navigation

processor-post-timeseries

Overview

Architecture

Setup

Prerequisites

Create Virtual Environment

Install Dependencies

Development

Install Pre-commit Hooks

Run Tests

Run Tests with Coverage

Run Linter

Running Locally

1. Configure Environment

2. Add Input File

3. Run the Processor

4. Clean Up

Output Format

Binary Data Files

Metadata Files

Environment Variables

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages