cascalog icon indicating copy to clipboard operation
cascalog copied to clipboard

Data processing on Hadoop without the hassle.

Results 40 cascalog issues
Sort by recently updated
recently updated
newest added

Shouldn't `hadoop-common` be used in place of `hadoop-core` in the dependencies of the various Cascalog projects, per this [stackoverflow question](http://stackoverflow.com/questions/28856096/differences-between-hadoop-common-hadoop-core-and-hadoop-client) or is there something else keeping it from being updated/changed...

I've been tasked to begin learning Hadoop and I'd like to do it in Clojure. This is one of the projects I've found, but most projects, including this one, haven't...

The sample file 'project.clj' still shows old Hadoop dependencies. The new and working ones (replace 2.7.3 version with your Hadoop version): ``` :profiles { :provided {:dependencies [[org.apache.hadoop/hadoop-common "2.7.3"] [org.apache.hadoop/hadoop-hdfs "2.7.3"]...

This used to fail since ClojureFlow did not support ISelectFields protocol.

I am having a problem in Lambda Architecture, Our data stored in HDFS is in fact based pail format using Thrift Serialization schemes and vertical partitioning. Is there any direct...

using [cascalog/midje-cascalog "2.1.1"] ``` clj (fact ( ?n ?n)) => (produces [[0] [1] [4]]])) ;; false, produces [[4] [4] [4]] (? ?n ?n)) ;; prints 0 1 4 ``` Also,...

The following input on cascalog.playground: ``` clojure (??- (

When using Clojure 1.7.0 warnings are emitted: ``` WARNING: some? already refers to: #'clojure.core/some? in namespace: jackknife.seq, being replaced by: #'jackknife.seq/some? Warning: protocol #'cascalog.cascading.flow/IRunnable is overwriting function run! WARNING: run!...

In memory platform was blowing up for an op that returned a single nil value, due to (jackknife.seq/collectify nil) => []

Cascading won't cache a subquery result if the subquery was a map-only job. Might be able to use the isSafe method on an operation to do this. See http://groups.google.com/group/cascalog-user/browse_thread/thread/40326ead37a09b23 for...

Improvement