pytorch-ie issues

docs for pie Concepts & Architecture

[Rendered](https://github.com/ChristophAlt/pytorch-ie/tree/docs_pie_concepts#-concepts--architecture)

ArneBinder

documentation

implement MultiModalSequenceTaggingTaskModule

ArneBinder

enhancement

Allow a dataset to be prepared from a dictionary instead of a collection of documents

1

The reason is that there may exist datasets that are so large they can't be iterated over in a reasonable time. For efficiency reasons we should allow the user to...

ChristophAlt

remove parameter is_training from encode_inputs

`encode_inputs` should not do anything depending on the state of `is_training`, respective code can live in `encode_targets`. This will ease separation of concerns and testing. To implement this it may...

ArneBinder

[WIP] fix training seq2seq

2

This was broken because pytorch-lightning tries to move the output of `TransformerSeq2SeqTaskModule.collate` to a device via `pytorch_lightning.core.datamodule.LightningDataModule.transfer_batch_to_device` that internally uses [`pytorch_lightning.utilities.apply_func.apply_to_collection`](https://pytorch-lightning.readthedocs.io/en/stable/api/pytorch_lightning.utilities.apply_func.html#pytorch_lightning.utilities.apply_func.apply_to_collection). This method fails if any part of the input...

ArneBinder

bug

Make readme ready for public usage

Until #183 is implemented, we need at least descriptions for the most relevant parts of PyTorch-IE in the readme to make it usable by the public. This may require the...

ArneBinder

documentation

[WIP] is_prepared attribute for taskmodule

1

Approach: The base taskmodule now has also a `_prepare()` method which should be overwritten in derived classes instead of `prepare()`. `prepare()` does now the following: 1. it checks, if the...

ArneBinder

[WIP] use dataset.map in pipeline

2

If `documents` of type `Dataset` is passed to the pipeline, use `documents.map` to add the predictions. In this case, a `Dataset` is returned instead of `Sequence[Document]`. Note: Builds on top...

ArneBinder

Add HF Dataset to PIE Dataset class conversion to methods of Dataset

For instance `dataset.train_test_split(...)` returns a HF Dataset, which then breaks serialization, deserialization logic. Not sure if there's a better solution but the quick fix would be to wrap the methods...

ChristophAlt

core

Add bigscience brat parser as packaged module

ChristophAlt

data

pytorch-ie
pytorch-ie copied to clipboard

Metadata

docs for pie Concepts & Architecture

implement MultiModalSequenceTaggingTaskModule

Allow a dataset to be prepared from a dictionary instead of a collection of documents

remove parameter is_training from encode_inputs

[WIP] fix training seq2seq

Make readme ready for public usage

[WIP] is_prepared attribute for taskmodule

[WIP] use dataset.map in pipeline

Add HF Dataset to PIE Dataset class conversion to methods of Dataset

Add bigscience brat parser as packaged module

← Metadata

Owner

Metadata

pytorch-ie pytorch-ie copied to clipboard

Metadata

← Metadata

Owner

Metadata

pytorch-ie
pytorch-ie copied to clipboard