Andy Jackson
Andy Jackson
The problem is, ours look like this: ``` os.path.join('webhdfs://server', '/file/path/on/hdfs.warc.gz') ``` which gives `/file/path/on/hdfs.warc.gz` but the old code gave `webhdfs://server/file/path/on/hdfs.warc.gz`.
Hmm, sorting is generally `O(n log(n))` at best. So, unless we can pre-prepare the data, or unless something else is going on (e.g. this is actually arising from some I/O...
Hah, fair enough! Definitely some room for improvement there... :-)
I can't speak for the others but I'm subscribed to this issue because I need to be able to run `repo2jupyter` behind a corporate web proxy. Here's how I do...
The error is a SAX parser error, and yes, it is bubbling up from one of the dependencies. Specifically: ``` $ curl http://www.danrw.de/schemas/contract/v1/danrw-contract-1.xsd #watch-discussion, #watch7-discussion, ytd-comments { display: none; }...
Okay, here's a relevant StackOverflow Q that has a solution: http://stackoverflow.com/questions/29696638/how-to-validate-xml-with-schema-urls-that-return-http-301
Just to make things more confusing, I downloaded it and it works fine for me! ``` Jhove (Rel. 1.14.0, 2016-10-06) Date: 2018-02-22 12:49:27 GMT RepresentationInformation: /Users/andy/Downloads/mets-test.xml ReportingModule: UTF8-hul, Rel. 1.6...
@bitsgalore I can't remember the details, but I've hit problems before with JHOVE giving really weird errors when remote XSDs have not been available. One of those times when I...
@bitsgalore in that case,`xmlstarlet val` FTW!
Hah! Thanks @marhop - JHOVE's behaviour still confusing me after all these years. You think I'd know by now. *EDIT:* Note that the times I've had trouble with JHOVE downloading...