Li Tong

Results 6 comments of Li Tong

Is there a general method? I currently implement an IterDataPipe that splits the dataset by index: ```python @functional_datapipe("index_split") class IndexSpliterIterDataPipe(IterDataPipe): def __init__(self, source_dp, start_idx, end_idx) -> None: super().__init__() self.source_dp =...

In fact, it is not advisable to split the dataset by demux. Because the training set, validation set, and test set are often very large, it is easy to exceed...

@ejguan It does take some time to skip some elements, a little trick is to put the small dataset in the front, such as the validation set and test set,...

或者可以从文件导入也可以,太多复制不了。

And when processing other types of elements, it should be iterated. For example, when **output_hidden_states=True**, the tuple object will be returned. For example: ```python out = {} for k, element...

作者可以解答一下吗?当通过pip安装的时候应该如何自定义Dataset和Evaluator?