[Python] Add video extension type
After adding ImageURIArray, EncodedImageArray and FixedShapeImageTensorArray it is straightforward to add analogous types for video. Namely VideoURIArray, VideoEncodedArray and FixedShapeVideoTensorArray array.
For decoder see TFs decode_webp.
@wjones127 not sure if this should be closed.
Are there any docs on how to write these types to a lance dataset? Specifically I'm trying to create a video column that's some sort of image array type.
I'm experimenting with doing this instead of just storing the video as bytes to save on decoding time in my training loop.
Hi @tonyf. In general, you can write an Apache Arrow extension array, and these can be written and read from Lance. A good reference for this would be Rok's changes for the image extension types:
https://github.com/lancedb/lance/pull/1272/files
We are currently exploring enhancing Lance's multimodal capabilities and came across this issue. Could @rok @wjones127 share why it was not continued?