blech_clust

Python and R based code for clustering and sorting electrophysiology data recorded using the Intan RHD2132 chips. Originally written for cortical multi-electrode recordings in Don Katz's lab at Brandeis University.

📚 Full Documentation | 🚀 Getting Started | 📖 Tutorials | 🔧 API Reference

Features

Automated Spike Sorting: Complete pipeline from raw Intan data to sorted units
EMG Analysis: BSA/STFT frequency analysis and QDA-based gape detection
Quality Assessment: Built-in drift detection, unit similarity analysis, and dataset grading
Parallel Processing: Optimized for HPC environments
Comprehensive Documentation: Detailed guides, tutorials, and API reference

Quick Start

⚠️ Platform Support: blech_clust is primarily tested and supported on Linux. The make-based installation process works only on Linux systems. Windows users should install a tested version of Ubuntu via WSL (Windows Subsystem for Linux).

# Clone the repository
git clone https://github.com/katzlabbrandeis/blech_clust.git
cd blech_clust

# Install everything
make all

# Activate the environment
conda activate blech_clust

# Run the pipeline
python blech_exp_info.py /path/to/data
bash blech_autosort.sh /path/to/data

# Batch processing multiple directories
bash blech_autosort_batch.sh /path/to/dir1 /path/to/dir2 /path/to/dir3
# Or using a file with directory paths
bash blech_autosort_batch.sh directories.txt

For detailed instructions, see the Getting Started Guide.

Tested Platforms

blech_clust is regularly tested on the following platforms:

Linux Distribution	Python Versions
Ubuntu 20.04	3.8, 3.9, 3.10, 3.11
Ubuntu 22.04	3.8, 3.9, 3.10, 3.11
Ubuntu 24.04	3.8, 3.9, 3.10, 3.11

Note: While other Linux distributions may work, only the above combinations are actively tested in our CI pipeline.

Documentation

For comprehensive documentation, visit katzlabbrandeis.github.io/blech_clust

For building documentation locally:

pip install -r requirements/requirements-docs.txt
mkdocs serve

Pipeline Overview

Main Spike-Sorting Pipeline

The following diagram shows the complete operations workflow for the blech_clust pipeline:

Detailed Pipeline Steps

blech_exp_info.py - Pre-clustering step to annotate channels and save experimental parameters
blech_init.py - Initialize directories and prepare data for clustering
blech_common_avg_reference.py - Perform common average referencing
blech_run_process.sh - Parallel spike extraction and clustering
blech_post_process.py - Add selected units to HDF5 file
blech_units_plot.py - Plot waveforms of selected spikes
blech_make_arrays.py - Generate spike-train arrays
blech_run_QA.sh - Quality assurance checks
blech_units_characteristics.py - Analyze unit characteristics
blech_data_summary.py - Generate comprehensive dataset summary
grade_dataset.py - Grade dataset quality based on metrics

Nomnoml Schema

Copy and paste the following code into nomnoml.com to generate the complete workflow diagram:

Spike Sorting
[blech_exp_info] -> [blech_init]
[blech_init] -> [blech_common_average_reference]
[blech_common_average_reference] -> [bash blech_run_process.sh]
[bash blech_run_process.sh] -> [blech_post_process]
[blech_post_process] -> [blech_units_plot]
[blech_units_plot] -> [blech_make_arrays]
[blech_make_arrays] -> [bash blech_run_QA.sh]
[bash blech_run_QA.sh] -> [blech_unit_characteristics]
[blech_unit_characteristics] -> [blech_data_summary]
[blech_data_summary] -> [grade_dataset]

EMG shared
[blech_init] -> [blech_make_arrays]
[blech_make_arrays] -> [emg_filter]

BSA/STFT
[emg_filter] -> [emg_freq_setup]
[emg_freq_setup] -> [bash blech_emg_jetstream_parallel.sh]
[bash blech_emg_jetstream_parallel.sh] -> [emg_freq_post_process]
[emg_freq_post_process] -> [emg_freq_plot]

QDA (Jenn Li)
[emg_freq_setup] -> [get_gapes_Li]

EMG Analysis

Shared Steps:

Complete spike sorting through blech_make_arrays.py
emg_filter.py - Filter EMG signals

BSA/STFT Branch:

Bayesian Spectrum Analysis and Short-Time Fourier Transform for frequency analysis

QDA Branch:

Quadratic Discriminant Analysis for gape detection (based on Li et al.'s methodology)

See the Tutorials for detailed guides on using these features. See the Workflow Diagram for a visual representation of the pipeline.

Test Dataset

Test data available at: Google Drive

Contributing

We welcome contributions! Please read CONTRIBUTING.md for guidelines.

Community and Support

For general questions, discussions, and community support, please use our Discourse forum.

Discourse: For things other than feature requests or bugs, use the discourse forum to ask questions, share knowledge, and crowdsource solutions with other users.
GitHub Issues: For bug reports and feature requests, please open an issue on GitHub.

If you're unsure whether your question belongs on Discourse or GitHub, start with Discourse! The community can help determine if a GitHub issue is warranted.

Citation

If you use this code in your research, please cite:

@software{blech_clust_katz,
  author       = {Mahmood, Abuzar and Mukherjee, Narendra and
                  Stone, Bradly and Raymond, Martin and
                  Germaine, Hannah and Lin, Jian-You and
                  Mazzio, Christina and Katz, Donald},
  title        = {katzlabbrandeis/blech\_clust: v1.1.0},
  month        = apr,
  year         = 2025,
  publisher    = {Zenodo},
  version      = {1.1.0},
  doi          = {10.5281/zenodo.15175273},
  url          = {https://doi.org/10.5281/zenodo.15175273}
}

Acknowledgments

This work used ACCESS-allocated resources at Brandeis University through allocation BIO230103 from the Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) program, supported by U.S. National Science Foundation grants #2138259, #2138286, #2138307, #2137603, and #2138296.

License

See LICENSE for details.

Visit the Katz Lab: katzlab.squarespace.com

Name		Name	Last commit message	Last commit date
Latest commit History 2,287 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
demos		demos
docs		docs
emg		emg
example_meta_files		example_meta_files
params		params
pipeline_testing		pipeline_testing
requirements		requirements
tests		tests
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
__init__.py		__init__.py
blech_autosort.sh		blech_autosort.sh
blech_autosort_batch.sh		blech_autosort_batch.sh
blech_clean_slate.py		blech_clean_slate.py
blech_clust_post.sh		blech_clust_post.sh
blech_clust_pre.sh		blech_clust_pre.sh
blech_common_avg_reference.py		blech_common_avg_reference.py
blech_exp_info.py		blech_exp_info.py
blech_init.py		blech_init.py
blech_make_arrays.py		blech_make_arrays.py
blech_post_process.py		blech_post_process.py
blech_process.py		blech_process.py
blech_run_QA.sh		blech_run_QA.sh
blech_run_process.sh		blech_run_process.sh
blech_units_characteristics.py		blech_units_characteristics.py
blech_units_plot.py		blech_units_plot.py
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

blech_clust

Features

Quick Start

Tested Platforms

Documentation

For building documentation locally:

Pipeline Overview

Main Spike-Sorting Pipeline

Detailed Pipeline Steps

EMG Analysis

Test Dataset

Contributing

Community and Support

Citation

Acknowledgments

License

About

Uh oh!

Releases 3

Packages

Languages

License

katzlabbrandeis/blech_clust

Folders and files

Latest commit

History

Repository files navigation

blech_clust

Features

Quick Start

Tested Platforms

Documentation

For building documentation locally:

Pipeline Overview

Main Spike-Sorting Pipeline

Detailed Pipeline Steps

EMG Analysis

Test Dataset

Contributing

Community and Support

Citation

Acknowledgments

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages