graphrag icon indicating copy to clipboard operation
graphrag copied to clipboard

Documentation request - add links to sample CSV / step to convert text to input CSV

Open erjadi opened this issue 1 year ago • 2 comments

Apologies if this is just my lack of understanding, but going through the getting started tutorial, there seems to be a step missing?

We download a book from project gutenberg in text format, and then we start the indexer. However the indexer expects CSV files in the input folder and we only have the book .txt file.

I checked the dulce.csv file that also in the repo to transform my input into something acceptable, but I think either:

  • an example CSV
  • a step to convert the txt file into CSV
  • a short description of the expected CSV format

would help people who are starting out.

erjadi avatar Apr 10 '24 07:04 erjadi

Totally agree here, in the same boat

chiragshah285 avatar Jul 02 '24 17:07 chiragshah285

For some time there was a bug in the config such that the Gutenberg txt wasn't working. This has been fixed. So I think this can probably be closed, but I'll give it a few days to see if folks chime in with a similar issue.

natoverse avatar Jul 25 '24 00:07 natoverse

I'll chime in here then @natoverse :-) A lot of the examples in the /examples folder expect a directory /examples/_sample_data to load stuff from. This directory is non-existant though. The code seems to expect csv-files which I couldn't find in the repository, so I assume the data is also missing and might be the same as what was mentioned above.

DOliana avatar Oct 28 '24 13:10 DOliana