William Jones

Results 23 comments of William Jones

Thanks for the response! I've been modifying the `buffer_size` value from 1024 to 262144, in increments that multiply by 4 each time (so 1024, 4096, .... ), and I'm still...

Just checked that the default rate is much higher. I've now benchmarked much higher values, from 4 * 128 * 1024 in intervals increasing by 4x to 32 * 128...

By "default rate" I mean `DEFAULT_BUFFER_SIZE` that is defined here: https://github.com/RaRe-Technologies/smart_open/blob/master/smart_open/s3.py#L38

Running the integration-tests from the root directory gives me: ``` (venv) ➜ smart_open git:(master) ✗ pytest integration-tests/test_s3.py ERROR: usage: pytest [options] [file_or_dir] [file_or_dir] [...] pytest: error: unrecognized arguments: --reruns --reruns-delay...

I'm specifically reading from S3. I seem to be able to download the 330mb file file in 6 seconds using boto3's `get_object().read()`, but using `smart_open` this seems to take 159...

Additionally, using `get_object().iter_lines()` seems to iterate through the file in 8 seconds. I just want to check if I'm missing anything here!

I've noticed that actually decompressing the file takes up a large amount of time that I was not previously factoring. The following loop for the same file takes ~50 seconds:...

Hi Heng - that's right I'm hitting: when executing the final `fermi2 assemble` command. Any ideas what this could be? I can post up the MAG files I'm using. `fm6_unitig:...

When navigating to my site for the *first* time via HTTP, my server returns an HTTP response. After I navigate to the site once via HTTPS, all future HTTP future...

The package accepts VCF files, but has not been tested on gVCF files! You will most likely run out of RAM if you try and load a gVCF. On 29...