bigbang icon indicating copy to clipboard operation
bigbang copied to clipboard

Use default archive path in notebooks

Open sbenthall opened this issue 1 year ago • 2 comments

The way things are supposed to work now is this:

  • A configuration file fixes the archive directory relative to the installation path of BigBang
  • When collect-mail is ran, it stores the archive data in this directory
  • When an Archive object is loaded, it by default goes to this directory, so that path is determined by a short name.

The Examples notebooks were written before this was implemented, and so are all over the map with respect to how they choose the Archive path.

Moreover, now that we more explicitly support installation of BigBang via pip and not via git cloning, it's not clear to me that this way of configuring BigBang is stable.

We should try to figure out a more pain-free way of doing this. Checking in with @MridulS about what he did to get the dashboard data in one place would be a good idea.

sbenthall avatar May 06 '23 14:05 sbenthall

I need to rethink about this bit but we should definitely be revisiting the archive/config paths. That was a serious pain point in trying to package the dashboard. I'll add it to my TODO 😅

MridulS avatar May 08 '23 13:05 MridulS

A related thing that I've just ran into is that collect-mail (in populate_provenance()) requires the --archives option to point to a path within a git repo. I'm currently trying to download IETF etc. archives to an external hard drive (and I imagine, given the size of the mailing list data, that others might try to do the same) - so while address this path question it would perhaps be useful to include in an option for a non-git path.

laurenmarietta avatar Jul 19 '23 19:07 laurenmarietta