Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,14 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

## [Unreleased]

## [v1.0.1] - 2026-01-07

### Changed
- Updated project links in web app and README

### Added
- Added best TIMIT model fine-tuned on Buckeye to model list in web app

## [v1.0.0] - 2025-12-16

### Added
Expand Down
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,11 @@

Automatically transcribe audio into the International Phonetic Alphabet (IPA) and perform forced alignment. This toolkit includes a command line interface, comparison tools, and interactive web tool.

The AutoIPA project is a collaboration between Virginia Partridge of the UMass Center for Data Science and Artificial
The Wav2IPA project is a collaboration between Virginia Partridge of the UMass Center for Data Science and Artificial
Intelligence and Joe Pater of UMass Linguistics. Its goal is to make automated IPA transcription more useful
to linguists (and others!).
Please reach out to us if you have any questions or comments about our work or have related work to share!
More details are on our [project website](https://websites.umass.edu/comphon/autoipa-automated-ipa-transcription/).
More details are on our [project website](https://websites.umass.edu/comphon/wav2ipa-automated-ipa-transcription/).

If you use our software, please cite our AMP paper:

Expand Down
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[project]
name = "autoipaalign"
version = "1.0.0"
version = "1.0.1"
description = "Automatic IPA alignment workspace with CLI, comparison tools, and web interface"
requires-python = ">=3.10,<3.14"
readme = "README.md"
Expand Down
7 changes: 4 additions & 3 deletions src/autoipaalign/web/app.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@
TEXTGRID_DOWNLOAD_TEXT = "Download TextGrid file"
TEXTGRID_NAME_INPUT_LABEL = "TextGrid file name"

TITLE = "AutoIPA: Automated IPA transcription"
TITLE = "Wav2IPA: Automated IPA transcription"

INTRO_BLOCK = f"""# {TITLE}
Experiment with producing
Expand All @@ -26,7 +26,7 @@
Our first step was to fine-tune a Wav2Vec 2.0 model on the Buckeye corpus, which you can try out here.
Our next steps will be to extend our work to other varieties of English and other languages.
Please reach out to us if you have any questions or comments about our work or have related work to share!
More details are on our [project website](https://websites.umass.edu/comphon/autoipa-automated-ipa-transcription/).
More details are on our [project website](https://websites.umass.edu/comphon/wav2ipa-automated-ipa-transcription/).

If you use our software, please cite our AMP paper:
Partridge, Virginia, Joe Pater, Parth Bhangla, Ali Nirheche and Brandon Prickett. 2025/to appear. [AI-assisted analysis of phonological variation in English](https://docs.google.com/presentation/d/1IJrfokvX5T_fKkiFXmcYEgRI2ZRwgFU4zU1tNC-iYl0/edit?usp=sharing). Special session on Deep Phonology, AMP 2025, UC Berkeley. To appear in the Proceedings of AMP 2025.
Expand All @@ -52,6 +52,7 @@
"ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
"excalibur12/wav2vec2-large-lv60_phoneme-timit_english_timit-4k",
"excalibur12/wav2vec2-large-lv60_phoneme-timit_english_timit-4k_simplified",
"ginic/wav2vec2-large-lv60_phoneme-timit_english_timit-4k_buckeye-4k_bs32_3",
"ginic/full_dataset_train_1_wav2vec2-large-xlsr-53-buckeye-ipa",
"ginic/full_dataset_train_2_wav2vec2-large-xlsr-53-buckeye-ipa",
"ginic/full_dataset_train_3_wav2vec2-large-xlsr-53-buckeye-ipa",
Expand Down Expand Up @@ -240,7 +241,7 @@ def launch_demo():
VALID_MODELS,
value=DEFAULT_MODEL,
label="IPA transcription ASR model",
info="Select the model to use for prediction.",
info="Select the model to use for prediction. For details about each one, visit its model page on the HuggingFace Hub",
)

# Dropdown for transcription type selection
Expand Down
2 changes: 1 addition & 1 deletion uv.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading