Pierre Ruyssen

Results 6 issues of Pierre Ruyssen

There is no way for now to express that a field should be a multidimensional array, for example a 4x4 matrix. An example of dataset with such a need: MatrixCity...

enhancement

Dataset with bounding boxes: https://github.com/mlcommons/croissant/blob/main/datasets/coco2014/metadata.json

enhancement

https://github.com/mlcommons/croissant/blob/main/docs/howto/bounding-boxes.md Example: https://github.com/mlcommons/croissant/blob/main/datasets/coco2014/metadata.json

documentation

https://github.com/mlcommons/croissant/blob/main/docs/howto/labels.md Example: https://github.com/mlcommons/croissant/blob/main/datasets/titanic/metadata.json Maybe we want to add a dataset with labels in the sense of classification.

documentation

There is already a howto about splits (https://github.com/mlcommons/croissant/blob/main/docs/howto/specify-splits.md) and an example (https://github.com/mlcommons/croissant/blob/main/datasets/coco2014/metadata.json). However we also want support for other types of partitions, namely dated partitions and languages (eg: wikipedia). Currently...

documentation
enhancement

_Originally posted by @benjelloun in https://github.com/mlcommons/datasets_format/pull/34#discussion_r1204103941_ Omar I assign to you to check whether external context files are supported by search engines. Thanks!