Pablo

Results 90 comments of Pablo

Also, make RunnerContextManager fully immutable (will also help to persist it with ray.put) to avoid the cost of passing it around ever and over.

I think it's fine if @flyingImer picks this up ? : )

theres's very silly superficial stuff I wrote here: https://docs.google.com/document/d/1DcuKhCPnZezIvu9vFMsM4BRdBv0kgAWewOJqRbS42GI/edit# Specifically, I would say `read_datasource` may be a good point to look at - it spins up several Ray Tasks that...

Beam has a few utilities to convert to-from Beam and Arrow schemas (see [here](https://github.com/apache/beam/blob/master/sdks/python/apache_beam/typehints/arrow_type_compatibility.py)). A first step would be to write an `ArrowRecordBatchCoder`, which can be constructed with a Beam...

I'll take on this issue as well.

@valiantljk I will get some pointers to look at this : )

hmmm thanks for finding that. I'll try to debug this week...

This should clean up all Ray resouces after every test: https://github.com/ray-project/ray_beam_runner/pull/40/ Note there's the IMPULSE element that we were creating in a 'global' context.

Hello team. I'm interested in picking up this item. I'll try and do the relevant research and propose a minimum design in the next few days (realistically weeks:))

Hi! Thanks for filing the issue. Would you like to submit a pull request with the change so I can pull it in?