Volume Analyzer

A preprocessing tool that converts audio recordings into servo movement instructions for use in animatronic systems like Samuel the Raven. It transforms audio files (e.g., raven calls) into time-synchronized binary sequences, where 1 indicates a servo should move (e.g., mouth open) and 0 means no movement.

This tool allows the creation of multiple motion maps per audio file using different clustering thresholds, enabling randomized selection at runtime for more natural and expressive behavior.

What It Does

Loads MP3 audio using Librosa
Computes RMS energy to measure loudness over time
Applies thresholding to detect meaningful sound events
Groups events into "clusters" representing vocalizations
Converts clusters into binary open/close movement lists
Outputs multiple maps per file for varied expressiveness
(Optional) Generates development graphs to visualize audio features

Installation

git clone https://github.com/Anatw/volume-analyzer.git
cd volume-analyzer
pip install .

Basic Usage

from volume_analyzer import generate_clusters_for_servo_usage

clusters = generate_clusters_for_servo_usage(
    directory="raven_sounds",
    thresholds=[10, 70, 150],  # or use default
    print_clusters_details=False
)

This will return a nested dictionary structure:

{
  "head_pat5.mp3": {
    "0": [0, 0, 1, 1, ..., 0],
    "1": [0, 1, 1, 1, ..., 0],
    "2": [...],
  }
}

Each list contains 1 and 0 values—one per RMS frame.

Key Functions

`analysed_normalized_rms_dict()`

Loads and analyzes MP3 audio
Returns normalized RMS energy values per file

`exceeding_indexes_clusters()`

Applies thresholding to RMS data
Groups adjacent loud segments into clusters
Supports multiple distance_allowed_between_clusters settings

`generate_clusters_for_servo_usage()`

Converts clusters into binary movement maps (1=open, 0=closed)
Returns multiple maps per file with varying cluster sensitivities

Developer Graphs (Optional)

You can visualize RMS, volume, and power spectrograms during development:

from volume_analyzer import generate_graphs

generate_graphs(
    generate_rms=True,
    generate_power=True,
    generate_volume=True
)

Example Output

Used In

This tool was developed to support the animatronic character Samuel the Raven. It helps generate realistic, dynamic movement synced to pre-recorded raven calls.

License

MIT License

✍️ Author

Developed by Anat Wax Part of the Animatronic Menagerie

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
music_volume_analyser.py		music_volume_analyser.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Volume Analyzer

What It Does

Installation

Basic Usage

Key Functions

`analysed_normalized_rms_dict()`

`exceeding_indexes_clusters()`

`generate_clusters_for_servo_usage()`

Developer Graphs (Optional)

Example Output

Used In

License

✍️ Author

About

Uh oh!

Releases

Packages

Languages

License

Anatw/volume-analyzer

Folders and files

Latest commit

History

Repository files navigation

Volume Analyzer

What It Does

Installation

Basic Usage

Key Functions

analysed_normalized_rms_dict()

exceeding_indexes_clusters()

generate_clusters_for_servo_usage()

Developer Graphs (Optional)

Example Output

Used In

License

✍️ Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`analysed_normalized_rms_dict()`

`exceeding_indexes_clusters()`

`generate_clusters_for_servo_usage()`

Packages