fuel issues

Allow data server to use divide-and-conquer

3

@yaoli was interested in using a divide-and-conquer approach to preprocessing, as is used in in @dwf's ImageNet PR (https://github.com/bartvm/fuel/pull/68). With that code, I think it should be relatively easy to...

bartvm

enhancement

Correct TextFile dataset docstring.

TextFile dataset docstring mention that dictionary could be a path to the dictionary pickled file, but never inside the code pickle loading exists.

memimo

Make ServerDataStream auto-negotiate sources, produces_examples and axis_labels

This is something that could be part of the handshake between the server and the client.

vdumoulin

enhancement

UTF-8 support for TextFile

1

Please add support for UTF8 files in TextFile (i.e use codecs.open and allow the user to pass in an encoding). If you like, I could submit this as a patch,...

scfrank

Overridden Dataset getattr needs to deal with self.sources=None properly

Leads to weird errors when trying to do attribute things prior to calling super().

dwf

Add documentation on what a request iterator can return

Seems like integers or lists of integers, but I couldn't find this written down anywhere. Slice object should eventually be supported but perhaps not by default due to the silent...

dwf

docs

Type checking data through axis semantics

Following offline discussion with @lamblin and @vdumoulin, we agreed that the most important kind of type checking to perform in the data processing pipeline is probably the semantics of the...

bartvm

enhancement

Bug in Transformer.get_epoch_iterator

5

Here's a minimal example reproducing the bug: ``` python import numpy from fuel.datasets import IndexableDataset from fuel.streams import DataStream from fuel.schemes import SequentialScheme from fuel.transformers import Mapping def run(bug=True): constructor...

vdumoulin

bug

Ensure serializability MultiProcessing

2

bartvm

enhancement

Remove dependency on both PyTables and H5PY

Fuel currently depends on both PyTables and h5py, using them for different datasets. Ideally I think we should at least make PyTables an optional dependency.

bartvm

refactor

fuel
fuel copied to clipboard

Metadata

Allow data server to use divide-and-conquer

Correct TextFile dataset docstring.

Make ServerDataStream auto-negotiate sources, produces_examples and axis_labels

UTF-8 support for TextFile

Overridden Dataset getattr needs to deal with self.sources=None properly

Add documentation on what a request iterator can return

Type checking data through axis semantics

Bug in Transformer.get_epoch_iterator

Ensure serializability MultiProcessing

Remove dependency on both PyTables and H5PY

← Metadata

Owner

Metadata

fuel fuel copied to clipboard

Metadata

← Metadata

Owner

Metadata

fuel
fuel copied to clipboard