marklogic-samplestack icon indicating copy to clipboard operation
marklogic-samplestack copied to clipboard

Larger extract, hosting and definition plan

Open grechaw opened this issue 11 years ago • 7 comments

This task is to specify and create the larger dataset, that we will make available to end users for scaling out their data size.

grechaw avatar May 08 '14 21:05 grechaw

This task is in progress. I'm 80% on loading the initial dataset (pre processsing) from Stack Overflow 2014-05 archive.

grechaw avatar Aug 21 '14 16:08 grechaw

Loaded whole of stack overflow, first draft.

grechaw avatar Sep 05 '14 20:09 grechaw

Sah-weet!

jmakeig avatar Sep 05 '14 20:09 jmakeig

Created a dataset for use in getting PR pushed along.

grechaw avatar Sep 18 '14 03:09 grechaw

The larger dataset needs to be created, but for EA-3 we still can stick to a .tgz distribution method and worry about a larger one after EA-3

grechaw avatar Oct 17 '14 20:10 grechaw

Maybe I'm just unclear on how this is different than #46, at least in terms of timing seems like they would both be 8.0-1?

popzip avatar Oct 17 '14 21:10 popzip

Kicking to you as PM issue, doesn't absolutely need solution for 8.0-1 at all, but good to consider

grechaw avatar Dec 03 '14 23:12 grechaw