jupyterlab-s3-browser
jupyterlab-s3-browser copied to clipboard
File upload function do not upload complete file
Describe the bug I am trying to upload csv file which contains million rows. Using the s3-browser functionality I can upload file - However not complete file is uploaded and about 50% of the data is lost. The s3-broswer do not show any logs or error message To Reproduce Steps to reproduce the behavior:
- Take fairly large csv file
- Click on upload button in s3 browser
- s3 browser Uploads the data.
- Check the number of rows in the file. Not all the rows are present since whole file is not uploaded
Expected behavior Large file should be uploaded completely to s3 endpoint
Screenshots
Desktop (please complete the following information):
- OS: Ubuntu 20.04.2 LTS
- Browser Chrome 99.0.4844.83
- Version of
jupyterlab-s3-browser
extension - 0.11.1
I am using standalone Minio version - minio/minio:RELEASE.2020-06-14T18-32-17Z
as my s3 endpoint
Thanks for reporting the issue! I have been able to reproduce it. I'm pretty busy at the moment but I should have time to investigate and fix it within the next couple of weeks.
Hi Reeve, thanks for info.
If it's not that complicated, you can leave some instructions on how to fix it. Maybe I can create draft pr.
On Mon, 4 Apr 2022, 16:00 James Reeve, @.***> wrote:
Thanks for reporting the issue! I have been able to reproduce it. I'm pretty busy at the moment but I should have time to investigate and fix it within the next couple of weeks.
— Reply to this email directly, view it on GitHub https://github.com/IBM/jupyterlab-s3-browser/issues/63#issuecomment-1087595044, or unsubscribe https://github.com/notifications/unsubscribe-auth/AEIVOMX6KPV2ON6TIWNE3H3VDLYV5ANCNFSM5R23JOKQ . You are receiving this because you are subscribed to this thread.Message ID: @.***>
Hi @mohittalele, thanks for your interest and sorry for the lack of updates on this issue. I'm guessing the issue is that the serverextension doesn't handle the case where jupyterlab breaks a file up into chunks for uploading. The extension is probably just overwriting the whole file with the latest chunk every time. Are you able to confirm that hypothesis? Specifically I'm thinking you'll see the size of the file stays the same throughout the upload (maybe 1MB? or however large the chunk size is) but the content will continue changing.
I don't see the upload button guys, where is it supposed to be? I see the small video in the readme, I don't see it on my jupyter installation