src icon indicating copy to clipboard operation
src copied to clipboard

Loading the document doesn't work

Open nikolicdanilo opened this issue 5 years ago • 4 comments

I have a problem when try to load any document from workspace/data. I followed all the steps that are in README. Do you know what could be the problem?

nikolicdanilo avatar Jun 25 '19 12:06 nikolicdanilo

Hi @nikolicdanilo,

I am not exactly sure what the problem could be. Can you please check the following things?

  • any error message from the terminal you run the python job index.py?
  • are your documents placed under the directory workspace/data?
  • does the document have headers "Document Title", "Abstract", "Year", "PDF Link"?

It would be helpful if you can provide more context of your problem. Thank you!

Zhe

azhe825 avatar Jun 25 '19 16:06 azhe825

Hi Zhe. I have the same issue so I prefer to write here instead of creating a new issue. The answer to your three questions are:

  • In the terminal I see a warning: WARNING: This is a development server. Do not use it in a production deployment.
  • Yes, I put data under workspace/data.
  • Yes, my document has the mentioned headers plus "label".

I can load some of your sample csv files and some of my csv files but I have the problem with the main csv file I want to use.

I appreciate any help!

rmahdav avatar Mar 14 '20 19:03 rmahdav

Hi @rmahdav, it is hard for me to diagnose without any error message.

I can help you fix this problem if you can send me (to [email protected]) a small sample of your data that can reproduce your loading error.

Thank you!

azhe825 avatar Mar 14 '20 19:03 azhe825

@rmahdav fixed. Please pull the repo and try loading your data. Note: if you see a pickle file named after your data under src/memory, please remove it before loading your data again.

azhe825 avatar Mar 14 '20 23:03 azhe825