Bringing cutting-edge audio transformations to life, one line of code at a time.
Welcome to the Voice Conversion repository. Here, we push the boundaries of speech processing and database interaction. This guide will help you get started with our setup. Note: Our environment now relies on Poetry (Python 3.10+). We will later switch to uv!
-
Install Poetry
- On macOS / Linux:
curl -sSL https://install.python-poetry.org | python3 - - On Windows (PowerShell):
(Invoke-WebRequest -Uri https://install.python-poetry.org -UseBasicParsing).Content | python - - For detailed instructions, see https://python-poetry.org/docs/.
- On macOS / Linux:
-
Configure Project
- Clone the repository: git clone https://github.com/YourUsername/voice-conversion.git
- Navigate to the project folder: cd voice-conversion
- Install dependencies via Poetry: poetry install
- Activate the Poetry shell (optional, but recommended for local development): poetry env activate (copy paste the output in the terminal)
Keep your local repository up-to-date:
git fetch origin
git merge origin/main
(Or, if you’re using branches, replace main with your target branch.)
(NOT YET FULLY INTEGRATED)
We currently rely on Google Cloud Firestore.
-
Install the gcloud CLI if you haven’t already.
-
Authenticate with:
gcloud auth application-default loginThis grants the necessary permissions to interact with our Firestore database.
-
Future improvements will refine this authentication step to a more permanent solution.
-
Updating Dependencies
If you add or remove libraries, simply editpyproject.tomland run:poetry updateOr, add the library directly: poetry add
Poetry will handle locking versions in
poetry.lock. -
Requirements File (Legacy)
If you still need arequirements.txtfor external use, you can generate one with:poetry export -f requirements.txt --output requirements.txtHowever, the recommended approach is to rely on Poetry’s lockfile.
- Enhanced gcloud Integration: We’ll streamline authentication for a more secure and automated experience.
- Expanded Testing: Additional unit and integration tests to ensure robust transformations.
- Performance Tuning: Profiling and optimization for large-scale voice conversion.
Any questions or suggestions?
Feel free to open an issue or reach out to the team.
Stay tuned for more updates as we refine the pipeline and push the limits of voice conversion technology!
© 2025 Voice Conversion Team – Divergence 2% Lab
