Michael Sander
Michael Sander
There are two pieces of work that need to be done: - Hard: Adapt this mapreduce library to work with the new Google datastore client libraries. I.e., remove references to...
Unless you plan on running 30,000 jobs in parallel, you don't need to shard it that much. I start getting timeouts like the one you mention at around 1000 shards.
I went ahead and built a small class to handle this. The `DatastoreMultiQuery` class takes as a constructor an array of query objects, and then emulates a unified Query class,...
I'm not using python3 yet, but this library is one of my blockers. Even if this is never ported to python3, could we put something in the Readme.md to warn...
Hi @euske, Just want to address your point "B" above. I am a heavy user and occasional bug fixer of pdfminer. We may not speak up much, but we DO...
This [is the PDF at issue](https://www.docketalarm.com/cases/New_Jersey_Bankruptcy_Court/2--16-bk-20392/VeraAnn_Castellano/docs/1.pdf) that causes the problem. I fixed this bug by monkeypatching the function `_add_annots` in `pdfquery.py`: ``` def _add_annots(self, layout, annots): """Adds annotations to the...
Looks like I need to do some fixing for python 3. Please let me know if this PR is generally acceptable and I'll clean up the python 3 issues.
Yes, I can confirm that `open` works and `os.open` does not work. I don't know why this is, but there is probably a good reason for it (perhaps it may...
Good point. They're below and it looks like a mixed bag. `read_hot` and `read_random` take a major hit, but the other reads and deleting are improved. This could be due...
Also... I'm running on Windows 10. I would not be surprised if these numbers were different on other systems.