Replace `decord` with `torchcodec` #15022

hmellor · 2025-03-18T10:20:27Z

As discussed in Slack.

The documentation for torchvision.io.read_video states that PyTorch's video decoding capability will soon be centralised in torchcodec. Therefore, it makes sense to skip the intermediate step of using torchvision.

OpenCV was considered but it can only read videos using a path/url, which meant writing the bytes to disk, which was a deal breaker.

The main caveat of torchcodec at the moment is that it does not distribute ARM64 wheel for Linux, see pytorch/torchcodec#569.

FIX #15011

Signed-off-by: Harry Mellor <[email protected]>

github-actions · 2025-03-18T10:20:35Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Harry Mellor <[email protected]>

setup.py

Isotr0py

I'm fine to replace decord with torchcodec on x86 platform, but it's still meaningful to have some benchmark between torchcodec and opencv.

requirements/cpu.txt

requirements/cuda.txt

requirements/rocm-build.txt

Signed-off-by: Harry Mellor <[email protected]>

hmellor · 2025-03-19T13:42:25Z

but it's still meaningful to have some benchmark between torchcodec and opencv.

opencv can only decode videos directly from path/url. Unless we change our APIs this would mean writing the video data to disk first, which makes it a solution we cannot use anyway.

Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Isotr0py <[email protected]>

jeejeelee · 2025-03-19T14:42:09Z

maybe we can support opencv and torchcodec simultaneously

hmellor · 2025-03-19T14:47:23Z

@Isotr0py managed to get opencv working and added a way to have multiple video loaders in #15055, so yeah we could support both

Isotr0py

Otherwise LGTM!

Isotr0py · 2025-03-19T16:33:40Z

Performance comparison between `decord`, `opencv` and `torchcodec`

Tested on Intel(R) Xeon(R) Silver 4116 CPU @ 2.10GHz
Script: https://gist.github.com/Isotr0py/33c9712055b475df7784173f2a0f1de0

Num Frames	Decord	OpenCV	Torchcodec
30	2.171s	3.520s	4.291s
60	2.660s	6.717s	2.007s
120	2.007s	12.874s	2.727s
240	3.351s	24.864s	3.545s
300	3.545s	31.677s	4.237s

OpenCV has poor performance currently because we read frames one by one in iteration.

It's fine to replace decord with torchcodec, and leaving OpenCV as a fallback for aarch64 machine before torchcodec release aarch64 wheel.

Isotr0py · 2025-03-19T17:12:27Z

Well, I optimized the OpenCV implementation a bit and it's much faster now, but seems that the loaded frames have numberic difference with decord and torchcodec due to incorrect frame processing.

Script: https://gist.github.com/Isotr0py/33c9712055b475df7784173f2a0f1de0/revisions#diff-7cda49be7ad2d7b5f039fb97386b1095aa5fc85ef2e1c744514a5490de9df530

Num Frames	Decord	OpenCV	Torchcodec
30	2.697s	0.724s	4.178s
60	2.670s	0.857s	1.914s
120	3.156s	0.953s	2.450s
240	3.296s	1.621s	3.366s
300	3.454s	1.826s	3.865s

hmellor · 2025-03-19T20:08:32Z

Wow that's quite the improvement, in that case shall we just exclusively use opencv? I can't see a reason to support the other two if they are slower and do the same thing?

Isotr0py · 2025-03-20T15:20:00Z

shall we just exclusively use opencv?

I think we can use only OpenCV for video IO which already has best performance and more flexible requirements, but frames extracted from OpenCV would have numeric difference with decord due to compression standard and decoder implementation. (dmlc/decord#108)

hmellor · 2025-03-20T16:08:52Z

One advantage of torchcodec is that it works on GPU. I'm not sure if this is the case for OpenCV, but the dedicated video decoding hardware is probably enough of a reason to support both actually

Signed-off-by: Harry Mellor <[email protected]>

hmellor · 2025-03-20T16:56:51Z

Closing in favour of #15055

Replace decord with torchcodec

d7de81a

Signed-off-by: Harry Mellor <[email protected]>

mergify bot added documentation Improvements or additions to documentation ci/build multi-modality Related to multi-modality (#4194) labels Mar 18, 2025

hmellor added 5 commits March 18, 2025 12:31

Fix error

3e7e67d

Signed-off-by: Harry Mellor <[email protected]>

Add torchcodec as a requirement for cuda and cpu

6e6536f

Signed-off-by: Harry Mellor <[email protected]>

Fix docs build

f360494

Signed-off-by: Harry Mellor <[email protected]>

Output video as np array

2491890

Signed-off-by: Harry Mellor <[email protected]>

Fix VideoDecoder instantiation

e86b3c7

Signed-off-by: Harry Mellor <[email protected]>

hmellor marked this pull request as ready for review March 18, 2025 13:30

hmellor requested review from DarkLight1337 and ywang96 as code owners March 18, 2025 13:30

Add torchcodec to rocm requirements

38a4292

Signed-off-by: Harry Mellor <[email protected]>

DarkLight1337 reviewed Mar 18, 2025

View reviewed changes

setup.py Show resolved Hide resolved

DarkLight1337 requested a review from Isotr0py March 19, 2025 07:53

Isotr0py reviewed Mar 19, 2025

View reviewed changes

requirements/cpu.txt Outdated Show resolved Hide resolved

requirements/cuda.txt Outdated Show resolved Hide resolved

requirements/rocm-build.txt Outdated Show resolved Hide resolved

Respond to comment

fa63f52

Signed-off-by: Harry Mellor <[email protected]>

Apply suggestions from code review

7531292

Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Isotr0py <[email protected]>

Isotr0py approved these changes Mar 19, 2025

View reviewed changes

wangxiyuan mentioned this pull request Mar 20, 2025

use torchvision to read video instead of decord vllm-project/vllm-ascend#341

Closed

Respond to comment

1e35f44

Signed-off-by: Harry Mellor <[email protected]>

hmellor closed this Mar 20, 2025

hmellor deleted the remove-decord branch March 26, 2025 11:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Replace `decord` with `torchcodec` #15022

Replace `decord` with `torchcodec` #15022

Uh oh!

hmellor commented Mar 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Mar 18, 2025

Uh oh!

Uh oh!

Isotr0py left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hmellor commented Mar 19, 2025 •

edited

Loading

Uh oh!

jeejeelee commented Mar 19, 2025 •

edited

Loading

Uh oh!

hmellor commented Mar 19, 2025

Uh oh!

Isotr0py left a comment

Uh oh!

Isotr0py commented Mar 19, 2025 •

edited

Loading

Uh oh!

Isotr0py commented Mar 19, 2025

Uh oh!

hmellor commented Mar 19, 2025

Uh oh!

Isotr0py commented Mar 20, 2025

Uh oh!

hmellor commented Mar 20, 2025

Uh oh!

hmellor commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

Replace decord with torchcodec #15022

Replace decord with torchcodec #15022

Uh oh!

Conversation

hmellor commented Mar 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 18, 2025

Uh oh!

Uh oh!

Isotr0py left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hmellor commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeejeelee commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hmellor commented Mar 19, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Isotr0py commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance comparison between decord, opencv and torchcodec

Uh oh!

Isotr0py commented Mar 19, 2025

Uh oh!

hmellor commented Mar 19, 2025

Uh oh!

Isotr0py commented Mar 20, 2025

Uh oh!

hmellor commented Mar 20, 2025

Uh oh!

hmellor commented Mar 20, 2025

Uh oh!

Uh oh!

Replace `decord` with `torchcodec` #15022

Replace `decord` with `torchcodec` #15022

hmellor commented Mar 18, 2025 •

edited by github-actions bot

Loading

Isotr0py left a comment •

edited

Loading

hmellor commented Mar 19, 2025 •

edited

Loading

jeejeelee commented Mar 19, 2025 •

edited

Loading

Isotr0py commented Mar 19, 2025 •

edited

Loading

Performance comparison between `decord`, `opencv` and `torchcodec`