`forward_sliding`  behavior is confusing

The output shape of `forward_sliding` varies depending on the number of input frames T.
- When T > 2, it performs normal tracking and returns a flow of shape (B, T, 2, H, W) for each frame.
- When T == 2, it performs optical flow inference and returns a single flow map of shape (B, 2, H, W).

I assume this design is intended to optimize pair-wise optical flow inference. However, the behavior is confusing and inconsistent from a video-tracking perspective. It requires handling a special case when the sequence contains only two frames.

It would be clearer to separate the two purposes into distinct functions, such as `forward_sliding` and `forward_pair`. The `forward_sliding` function should consistently return the same number of frames as the input sequence.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`forward_sliding` behavior is confusing #15

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

forward_sliding behavior is confusing #15

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

`forward_sliding` behavior is confusing #15