cummings.ee icon indicating copy to clipboard operation
cummings.ee copied to clipboard

Link to the scanned images from the poem pages

Open palewire opened this issue 3 years ago • 7 comments

palewire avatar Apr 20 '21 04:04 palewire

For XLI Poems, I took a look at getting them Hathi Trust. It's not super clear what their terms of use are, but their obstruction of downloading the full book is a small hint that they wouldn't approve.

Nevertheless, I tried a script to grab them and it didn't work. I could easily do the clicky-clicky through all 64 pages, if you would be comfortable using them from that source.

I'd imagined adding them in a structure like assets/img/pagescans/xli-poems/page_01.jpg

I hadn't yet thought about how to link them in.

Let me know if you want me to get the Hathi images and if so, your opinion about where to put them.

An alternative might be to link directly to the HathiTrust pages, eg p. 15 for "Songs 1".

JoeGermuska avatar May 08 '21 15:05 JoeGermuska

I think that linking to the Hathi images is good enough, especially if they're prefer us not to download. If we get a scan of our own at a later date, we can link to that then.

palewire avatar May 08 '21 16:05 palewire

Adding the hyperlink to the poem's YAML file seems right. I just encourage you to structure it as an array so it can support cases where the poems runs over multiple pages.

palewire avatar May 08 '21 16:05 palewire

links to images, eg https://babel.hathitrust.org/cgi/imgsrv/image?id=uc1.%24b408727&attachment=1&size=ppi%3A300&format=image%2Fjpeg&seq=18&tracker=D1%3A (gets forced to a download when loaded browser)

or links to pages, eg https://babel.hathitrust.org/cgi/pt?id=uc1.$b408727&view=1up&seq=18

?

JoeGermuska avatar May 08 '21 16:05 JoeGermuska

I think I'd vote for the second one, unless you object.

palewire avatar May 08 '21 16:05 palewire

hm, yeah, links to pages makes sense. but if there's more than one, maybe you need more than just a URL... like a page number and/or a title (of the source book)?

here's how I started, for one that spans two pages:

page_scans:
    - 'https://babel.hathitrust.org/cgi/pt?id=uc1.$b408727&view=1up&seq=31'
    - 'https://babel.hathitrust.org/cgi/pt?id=uc1.$b408727&view=1up&seq=32'

but it could be

page_scans:
    - source: XLI Poems (The Dial Press, 1925) via Hathi Trust
      page: 25
      url: 'https://babel.hathitrust.org/cgi/pt?id=uc1.$b408727&view=1up&seq=31'
    - source: XLI Poems (The Dial Press, 1925) via Hathi Trust
      page: 26
      url: 'https://babel.hathitrust.org/cgi/pt?id=uc1.$b408727&view=1up&seq=32'
    - 

I guess it depends on how it will be used for display...

JoeGermuska avatar May 08 '21 17:05 JoeGermuska

I think your proposed change to the page_scans data makes a lot of sense. Will give us more flexibility.

palewire avatar May 11 '21 15:05 palewire