wikihadoop
wikihadoop copied to clipboard
Generalize the splitter for non-Wikipedia XMLs
The splitting function essentially should work for more general usecases where you want to split an XML without splitting a certain element. We could provide a separate class for splitting.