bookwyrm csv import and export fixes

Adds shelved and published dates for books and their imported reviews. Provides option to create new (custom) shelves when importing books.

fixes #3004 fixes #2411

@bookwyrm-social/code-review I'm looking for some clues as to why my tests fail when run sequentially but pass when run individually.

Nov 25 '23 06:11 hughrun

I'm not sure how to handle the merge conflicts on this one

Jan 02 '24 18:01 mouse-reeve

I wasn't sure how to resolve the merge conflicts on this

Apr 24 '24 22:04 mouse-reeve

yeah I got distracted with other things, I need to come back to this.

Apr 25 '24 01:04 hughrun

There's something wrong with my tests here (test_create_new_shelf and test_handle_imported_book_review) - they pass when run in isolation but not when running the whole TestCase. Any ideas @bookwyrm-social/code-review ?

Jul 22 '24 02:07 hughrun

Hello @hughrun!

I took a look at the failures, here's the root cause (fwiw took a while to figure out!):

The __init__ method of BookwyrmBooksImporter is used to extend the row_mappings_guesses attribute:

https://github.com/bookwyrm-social/bookwyrm/blob/06d6360082b6164526dee75d8fb03bc5674c85ab/bookwyrm/importers/bookwyrm_import.py#L38-L40

But row_mappings_guesses is a class level attribute, not instance level; so what happens is that the pair ("shelf_name", ["shelf_name"]) is appended once per created object.

The tests fail when such __init__ runs more than once (more than one importer created), because the duplication results in shelf_name=None in the import item.

To fix the issue, you can remove the __init__ method in favor of, for example:

class BookwyrmBooksImporter(Importer):
    # ...
    row_mappings_guesses = Importer.row_mappings_guesses + [
        ("shelf_name", ["shelf_name"]),
    ]

to avoid the bogus duplication.

Jul 22 '24 03:07 dato

Ok a couple of things here:

I amended the test for retry jobs in bookwyrm/tests/importers/test_importer.py to order the jobs, because they were running in the reverse order when running multiple tests (but not a single test). My assumption is that this happened because pylint runs tests in parallel, and that ultimately it doesn't matter which order they run in, but it may have been indicating an actual problem - so some eyes on that would be helpful.
All tests pass locally, but there seems to be some kind of issue with the GitHub runners. Not sure what I can do about that.

@dato thanks for your advice.

Jul 27 '24 03:07 hughrun

I think this just needs a merge migration in order to be ready to go

Jul 27 '24 19:07 mouse-reeve

Ok there seems to be a conflict in the tests (causing some imports to finish out of order) because I'm using the same books as in generic.csv. Presumably this is because tests run in parallel? I'll adjust the test data to fix this.

Aug 07 '24 02:08 hughrun

@bookwyrm-social/code-review I think this is ready.

Aug 10 '24 06:08 hughrun

bookwyrm bookwyrm copied to clipboard

csv import and export fixes

bookwyrm
bookwyrm copied to clipboard