src
src copied to clipboard
Loading the document doesn't work
I have a problem when try to load any document from workspace/data. I followed all the steps that are in README. Do you know what could be the problem?
Hi @nikolicdanilo,
I am not exactly sure what the problem could be. Can you please check the following things?
- any error message from the terminal you run the python job index.py?
- are your documents placed under the directory workspace/data?
- does the document have headers "Document Title", "Abstract", "Year", "PDF Link"?
It would be helpful if you can provide more context of your problem. Thank you!
Zhe
Hi Zhe. I have the same issue so I prefer to write here instead of creating a new issue. The answer to your three questions are:
- In the terminal I see a warning: WARNING: This is a development server. Do not use it in a production deployment.
- Yes, I put data under workspace/data.
- Yes, my document has the mentioned headers plus "label".
I can load some of your sample csv files and some of my csv files but I have the problem with the main csv file I want to use.
I appreciate any help!
Hi @rmahdav, it is hard for me to diagnose without any error message.
I can help you fix this problem if you can send me (to [email protected]) a small sample of your data that can reproduce your loading error.
Thank you!
@rmahdav fixed. Please pull the repo and try loading your data. Note: if you see a pickle file named after your data under src/memory, please remove it before loading your data again.