[wip] Decode WAV with C++ backend by Dan-Flores · Pull Request #1221 · meta-pytorch/torchcodec

Dan-Flores · 2026-02-04T20:17:25Z

To review:

First, see the changes to the public AudioDecoder class.
- This implementation changes the _audio_decoder.py Python file. To completely contain changes in C++ requires more complex changes, ex. implementing a Decoder class that SingleStreamDecoder and WavDecoder implement.
Read _is_uncompressed_wav, and read a function it dispatches to, like get_wav_metadata_from_file.
- Skim through the input type handler classes, WavFileReader and WavTensorReader
Read WavDecoder::WavDecoder init implementation, follow it to parseHeader.
Wonder at convertSamplesToFloat
- Using tensor operations here reduces the performance gains. This implementation does the conversion in a single pass.

pytorch-bot · 2026-02-04T20:17:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/meta-pytorch/torchcodec/1221

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Lint/Mac jobs intermittently fail

❌ 1 New Failure

As of commit c01d370 with merge base 377c638 ():

NEW FAILURE - The following job has failed:

Lint / mypy (3.12) (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Dan-Flores · 2026-02-05T05:19:26Z

src/torchcodec/decoders/_audio_decoder.py

+            self._desired_sample_rate = metadata["sampleRate"]
+            self._decoder = None  # type: ignore[assignment]
+            self.metadata = AudioStreamMetadata.from_json(metadata)
+            return


The AudioDecoder exposes the input audio's metadata to the user. Using a C++ backend without FFmpeg, we pass over the information necessary to create AudioStreamMetadata from the C++ side. This implementation uses JSON, but its possible to pass each field individually and construct an AudioStreamMetadata object here as well.

Dan-Flores · 2026-02-05T05:19:27Z

src/torchcodec/decoders/_audio_decoder.py

+                pts_seconds=0.0,
+                duration_seconds=metadata["durationSeconds"],
+                sample_rate=metadata["sampleRate"],
+            )


self._wav_source is only populated if WAV decoding was successful.

Dan-Flores · 2026-02-10T06:13:44Z

src/torchcodec/_core/WavDecoder.cpp

+  if (!checkFourCC(data + 8, "WAVE")) {
+    throw std::runtime_error("Missing WAVE format identifier");
+  }
+


Once we find the RIFF and WAVE signatures, we can look for the fmt chunk which contains metadata, and the data chunk which contains the actual samples. We find the data chunk now to store dataSize and dataOffset, which will be needed later for decoding.

Dan-Flores added 4 commits February 3, 2026 19:59

changes

414a353

renaming

d8bda2d

reuse mapToJson for metadata

70f774a

add comments

d57f06a

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 4, 2026

Dan-Flores commented Feb 5, 2026

View reviewed changes

Dan-Flores added 3 commits February 9, 2026 16:01

refactor to not consume entire file upfront

05664a4

sample_format metadata map

055dfc0

add comment to convertSamplesToFloat

c01d370

Dan-Flores commented Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wip] Decode WAV with C++ backend#1221

[wip] Decode WAV with C++ backend#1221
Dan-Flores wants to merge 7 commits intometa-pytorch:mainfrom
Dan-Flores:wav-cpp

Dan-Flores commented Feb 4, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 4, 2026 •

edited

Loading

Uh oh!

Dan-Flores Feb 5, 2026 •

edited

Loading

Uh oh!

Dan-Flores Feb 5, 2026 •

edited

Loading

Uh oh!

Dan-Flores Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Dan-Flores commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/meta-pytorch/torchcodec/1221

❗ 1 Active SEVs

❌ 1 New Failure

Uh oh!

Dan-Flores Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Dan-Flores commented Feb 4, 2026 •

edited

Loading

pytorch-bot bot commented Feb 4, 2026 •

edited

Loading

Dan-Flores Feb 5, 2026 •

edited

Loading

Dan-Flores Feb 5, 2026 •

edited

Loading