FActScore icon indicating copy to clipboard operation
FActScore copied to clipboard

About the enwiki-20230401

Open Toblame opened this issue 1 year ago • 5 comments

after download the data and set the environment, I run this command python -m factscore.factscorer --input_path "/root/FNDLLM/test.jsonl" --model_name "retrieval+llama+npm" --use_atomic_facts --data_dir '/root/.cache/factscore/ and get this File "/root/anaconda3/envs/factstore/lib/python3.7/site-packages/factscore/retrieval.py", line 57, in build_db with open(data_path, "r") as f:'FileNotFoundError: [Errno 2] No such file or directory: '/root/.cache/factscore/enwiki-20230401.jsonl' I didn't find the enwiki-20230401.jsonl in the download data, where is it?

Toblame avatar Feb 29 '24 09:02 Toblame

Hi @Toblame, thanks for your interest in our work. What command did you use to download the data?

The cache is stored by default in the folder where you ran the download command, see https://github.com/shmsw25/FActScore/blob/main/factscore/download_data.py#L119

Can you confirm that the other cache files are present in /root/.cache for you?

martiansideofthemoon avatar Mar 02 '24 10:03 martiansideofthemoon

Thank you and I have solve this problem, however I meet another problem 'AssertionError: topic in your data (topic) is likely to be not a valid title in the DB.' This happened when I used both my own data and the factscore labeled data.

Toblame avatar Mar 03 '24 07:03 Toblame

Hi @Toblame ,

How did u solve this problem? The download_data.py file only downloads a enwiki-20230401.db file, I cannot find a .jsonl file in the cache. TIA

tanay2001 avatar Mar 04 '24 03:03 tanay2001

Hi @Toblame ,

How did u solve this problem? The download_data.py file only downloads a enwiki-20230401.db file, I cannot find a .jsonl file in the cache. TIA

I just restart the command and check the cache file's location, then run the command again. However I still meet another problem above.

Toblame avatar Mar 05 '24 02:03 Toblame

Hi @Toblame,

Thank you and I have solve this problem, however I meet another problem 'AssertionError: topic in your data (topic) is likely to be not a valid title in the DB.'

You are likely getting this error because you have set topic in some rows of the input JSONL file to the string "topic". For this to work, topic must be equal to some article title (like "Billy Conigliaro") which is present in the database.

martiansideofthemoon avatar Mar 09 '24 15:03 martiansideofthemoon