datachain icon indicating copy to clipboard operation
datachain copied to clipboard

Parallelize video files processing

Open dreadatour opened this issue 9 months ago • 0 comments

It would be great if we can parallelize video files processing (splitting into frames/fragments). At least uploading files.

Example UDF:

def get_frames(file: VideoFile) -> Iterator[tuple[VideoFrame, ImageFile]]:
    for frame in file.get_frames(step=10):
        image = frame.save("gs://bucket-name/frames")
        yield frame, image

dreadatour avatar Apr 02 '25 03:04 dreadatour