Distributed-CellProfiler issues

Try/except handling for DOWNLOAD_FILES

2

Add WORKSPACE_BUCKET

Adds WORKSPACE_BUCKET to DCP configuration allowing image files to be read from a separate bucket than non-image files (e.g. pipeline, load_data.csv, etc.). Closes #160 Also adds another demo project that...

ErinWeisbart

Think about - exposing number of retries before dead messages in the config file

I think we can do this, and it would be nice for more "experimental" setups

bethac07

enhancement

Plugin usage not pointing to the new plugin repo structure

We'll need to figure out some clever logic to figure out whether it's an old or new structure, and/or some non-clever `try except` logic

bethac07

Fix credential handling

3

Still a bit of a mess handling both roles and users. I believe what is currently pulled to run-worker.sh in `master` handles roles nicely but doesn't play well with user...

ErinWeisbart

read write from external bucket with profile

This works to read/write from external bucket using a profile pointing to an assumed role. Docker currently available at `erinweisbart/distributed-cellprofiler:readwrite_profile` Have not yet tested to see if I broke anything...

ErinWeisbart

Return useful error if download files don't exist

When downloading files (`DOWNLOAD_FILES=True`) off .csvs [`s3.meta.client.download_file(AWS_BUCKET,prefix_on_bucket,new_file_name)`](https://github.com/DistributedScience/Distributed-CellProfiler/blob/7baffda87ae85f3778d513e2d69e549996ebc091/worker/cp-worker.py#L215) if the file isn't in S3 then it should return a useful error message saying so. Currently silent

ErinWeisbart

Change anything re: plugins?

1

Given new plugin organization in CP-plugins repo, confirm that DCP doesn't need any corresponding changes.

ErinWeisbart

Enable configurable load_data.csv location in config

Currently, load_data.csv is read from SOURCE_BUCKET. For a use case like Cell Painting Gallery where you're reading public images, would be nice to be able to set load_data.csv location to...

ErinWeisbart

enhancement

Better error message for failed Transfer of CellProfiler logs to S3

2

``` Transfer of CellProfiler logs to S3 initiated Traceback (most recent call last): File "run.py", line 786, in monitor() File "run.py", line 763, in monitor export_logs(logs, loggroupId, starttime, bucketId) File...

ErinWeisbart

enhancement

Distributed-CellProfiler
Distributed-CellProfiler copied to clipboard

Metadata

Try/except handling for DOWNLOAD_FILES

Add WORKSPACE_BUCKET

Think about - exposing number of retries before dead messages in the config file

Plugin usage not pointing to the new plugin repo structure

Fix credential handling

read write from external bucket with profile

Return useful error if download files don't exist

Change anything re: plugins?

Enable configurable load_data.csv location in config

Better error message for failed Transfer of CellProfiler logs to S3

← Metadata

Owner

Metadata

Distributed-CellProfiler Distributed-CellProfiler copied to clipboard

Metadata

← Metadata

Owner

Metadata

Distributed-CellProfiler
Distributed-CellProfiler copied to clipboard