litstudy
litstudy copied to clipboard
Manipulate and save a DocumentSet object after loading.
Hello, I am wondering how possible is to manipulate (like in a pandas table) and save a loaded DocumentSet such as .bib, ieee_csv. Or also manipulate and save the data after doing a refinement (for example using refine_scupos).
Thank you!
Hi
I am wondering how possible is to manipulate (like in a pandas table)
Manipulating the documents themselves is not possible. You can, however, manipulate a DocumentSet
which contains a list of documents by, for example, calculating the intersection, union, or differen between sets (see DocumentSet
)
and save a loaded DocumentSet such as .bib, ieee_csv.
Saving a document set is not possible, but it is a highly requested feature. There are open issues for saving a document set as a Bibtex file or RIS file:
- #12
- #13
Of you interesting in looking into these, we welcome all relevant pull requests!
I was looking for this as well. A possible workaround might be to just Export the documentset to a csv and later Reimport it if needed. Or is there any other way to not lose my progress everytime I shut down my machine? I mean, there must be a database saved somewhere, or is all this data sitting in the memory?
Which fields are called from the api upon refine? Is it all the ones from the class litstudy.types.Document?
I was looking for this as well. A possible workaround might be to just Export the documentset to a csv and later Reimport it if needed. Or is there any other way to not lose my progress everytime I shut down my machine? I mean, there must be a database saved somewhere, or is all this data sitting in the memory?
Alternatively you could pickle the document set which takes less space than a csv. After that you can reload it whenever you would like to perform further analysis on the set. Just use these code snippets:
to save: with open("data.pickle", "wb") as f: pickle.dump(data, f, protocol=pickle.HIGHEST_PROTOCOL) to load: with open("data.pickle", "rb") as f: data = pickle.load(f)