Refactor order of getting metadata and adding a stream #1060

scotts · 2025-11-18T02:27:14Z

I've thought this was strange for a long time now - on main, in the public VideoDecoder and AudioDecoder, we add a stream before getting the metadata. This was not the originally intended order, as evidenced by some of the error checking we do:

torchcodec/src/torchcodec/decoders/_video_decoder.py

Lines 409 to 414 in 22bcf4d

    
           if stream_index is None: 
        
               if (stream_index := container_metadata.best_video_stream_index) is None: 
        
                   raise ValueError( 
        
                       "The best video stream is unknown and there is no specified stream. " 
        
                       + ERROR_REPORTING_INSTRUCTIONS 
        
                   )

We should never hit that error condition, as before we call it, we add the stream. And if the video file has no best video stream, the C++ layer would have thrown before we ever had a chance to reach this condition. I feel that it's more natural to do things in the order in this PR: first get the metadata from the file, then add the stream if the metadata is valid.

The reason why I'm doing this now is that this should simplify the decoder-native transforms. We'll want to know a video stream's height and width when pre-processing the transforms before adding a stream. And that means getting that metadata before adding a stream. In the C++ layer, this does mean accessing values in the headers in initializeDecoder() through AVCodecParameters that we didn't before.

NicolasHug · 2025-11-18T10:30:17Z

src/torchcodec/_core/SingleStreamDecoder.cpp

+  // This metadata was already set in initializeDecoder() from the
+  // AVCodecParameters that are part of the AVStream. But we consider the
+  // AVCodecContext to be more authoritative, so we use that for our decoding
+  // stream.


From what I understand, the AVCodecContext fields were set to those of the AVCodecParameters when we called avcodec_parameters_to_context just above in addStream:

torchcodec/src/torchcodec/_core/SingleStreamDecoder.cpp

Lines 462 to 463 in 22bcf4d

int retVal = avcodec_parameters_to_context(

streamInfo.codecContext.get(), streamInfo.stream->codecpar);

I think it's best to remove the lines below and trust that avcodec_parameters_to_context is doing what we expect it to do. Right now, we are setting the streamMetadata in a lot of different places and it makes it harder to reason about.

Oh, good call! Yup, I'm happy to remove more code. :)

NicolasHug · 2025-11-18T14:48:45Z

src/torchcodec/_core/FFMPEGCommon.h


 int getNumChannels(const UniqueAVFrame& avFrame);
 int getNumChannels(const SharedAVCodecContext& avCodecContext);
+int getNumChannels(const AVCodecParameters* codecpar);


We may not need the one above anymore, I'm not sure.

We're using it in CPUDeviceInterface:

torchcodec/src/torchcodec/_core/CpuDeviceInterface.cpp

Line 325 in 22bcf4d

int srcNumChannels = getNumChannels(codecContext_);

And:

torchcodec/src/torchcodec/_core/CpuDeviceInterface.cpp

Line 416 in 22bcf4d

audioStreamOptions_.numChannels.value_or(getNumChannels(codecContext_));

Maybe we could instead go to the metadata, but I think it's better to get it through the codec context there. I think it's possible for the codec context to have more accurate info here while decoding. At least I believe that is the case for video dimensions, as we can have variable resolution streams. I'm unsure if that's also true for the number of channels for audio. If you can say definitely that you know it's okay to just use the header-based metadata in both of these places, I can make the change. But if you're unsure, let's handle that later - I can create an issue for follow-up.

NicolasHug · 2025-11-18T16:27:07Z

oh, I approved but I wonder if the docs failure is real 🤔

 File "/__w/_temp/conda_environment_19470203429/lib/python3.10/site-packages/torchcodec/decoders/_video_decoder.py", line 423, in _get_and_validate_stream_metadata

    raise ValueError(
ValueError: The minimum pts value in seconds is unknown. 
This should never happen. Please report an issue following the steps in

…tadata_order

Refactor order of getting metadata and adding a stream

84fd7a5

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 18, 2025

scotts marked this pull request as ready for review November 18, 2025 03:00

NicolasHug reviewed Nov 18, 2025

View reviewed changes

Remove re-setting of metadata

8e0b756

NicolasHug approved these changes Nov 18, 2025

View reviewed changes

Merge branch 'main' of github.com:pytorch/torchcodec into refactor_me…

7832834

…tadata_order

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor order of getting metadata and adding a stream #1060

Refactor order of getting metadata and adding a stream #1060

Uh oh!

scotts commented Nov 18, 2025 •

edited

Loading

Uh oh!

NicolasHug Nov 18, 2025

Uh oh!

scotts Nov 18, 2025

Uh oh!

NicolasHug Nov 18, 2025

Uh oh!

scotts Nov 18, 2025

Uh oh!

NicolasHug commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	if stream_index is None:
	if (stream_index := container_metadata.best_video_stream_index) is None:
	raise ValueError(
	"The best video stream is unknown and there is no specified stream. "
	+ ERROR_REPORTING_INSTRUCTIONS
	)

	int retVal = avcodec_parameters_to_context(
	streamInfo.codecContext.get(), streamInfo.stream->codecpar);

Refactor order of getting metadata and adding a stream #1060

Are you sure you want to change the base?

Refactor order of getting metadata and adding a stream #1060

Uh oh!

Conversation

scotts commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

scotts commented Nov 18, 2025 •

edited

Loading