datachain
datachain copied to clipboard
Parallelize video files processing
It would be great if we can parallelize video files processing (splitting into frames/fragments). At least uploading files.
Example UDF:
def get_frames(file: VideoFile) -> Iterator[tuple[VideoFrame, ImageFile]]:
for frame in file.get_frames(step=10):
image = frame.save("gs://bucket-name/frames")
yield frame, image