scoobi icon indicating copy to clipboard operation
scoobi copied to clipboard

A Scala productivity framework for Hadoop.

Results 24 scoobi issues
Sort by recently updated
recently updated
newest added

Bumps hadoop-client from 2.0.0-mr1-cdh4.0.1 to 2.7.0. [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=org.apache.hadoop:hadoop-client&package-manager=maven&previous-version=2.0.0-mr1-cdh4.0.1&new-version=2.7.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a...

dependencies

Bumps [xstream](https://github.com/x-stream/xstream) from 1.4.2 to 1.4.19. Commits See full diff in compare view [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=com.thoughtworks.xstream:xstream&package-manager=maven&previous-version=1.4.2&new-version=1.4.19)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter...

dependencies

Bumps [junit](https://github.com/junit-team/junit4) from 4.7 to 4.13.1. Release notes Sourced from junit's releases. JUnit 4.13.1 Please refer to the release notes for details. JUnit 4.13 Please refer to the release notes...

dependencies

Scoobi has no mechanism for notifying users on various steps of workflow. I believe having a notification mechanism greatly increases monitoring ability granting greater control. A Webhooks API is a...

1) [minor] Rename parameters to be more consistent (no function change) 2) Add parallelization to HDFS->S3 move operation for speedup. PTAL @etorreborre [](https://reviewable.io/reviews/nicta/scoobi/348)

I run a ScoobiApp locally using scoobi 0.8.4-cdh5 with the LocalJobRunner. The overwrite option is set to true. At the last step, I get the following error: [ERROR] OutputChannel -...

If the cluster is under a lot of load, client calls which directly interact with it can fail in weird ways. I have seen `LeaseExpiredException`, `ConnectException: Connection refused`, and others...

The double values are noted as floats in the logs, which is often confusing and sometimes misleading. I have created a pull request to fix this issue : https://github.com/NICTA/scoobi/pull/342

It always gets written to `java.io.tmpdir`

Apache tez[1] enables executing DAG on top of YARN. It has significant performance gains over plain map-reduce. Hive 0.13[2] has seen magnitude of performance improvement by using Tez. Is there...