pywb icon indicating copy to clipboard operation
pywb copied to clipboard

Sequence gives 0 results, even when the last item in sequence is $live?

Open jwest75674 opened this issue 5 years ago • 1 comments

Describe the bug

Edit: three week review and cleanup.

At the bottom of this report is my config.yaml for reference.

Following along the documentation with regard to fallbacks via a "Sequence", I was surprised to see 0 results for requests which I know have results in collections included in this sequence.

Steps to reproduce the bug

I am not confident in the reproducibility of this bug, but am hoping that my config can shine some light on the situation.

Expected behavior

Normal sequential failover functionality. Is the first in the sequence does not contain a result, fail to the next, until eventually pulling from $live.

Environment

  • OS: Windows 10 client, Debian based Proxmox Host running on baremetal
  • Browser Chrome
  • Version 86

config.yaml

# pywb config file -- Added comments specifically for this bug report, not present in actual config.
# ========================================
#

collections:
    all: $all # Returns 0 results
    live: $live # Works
    ia: memento+https://web.archive.org/web/ # Works
    rhiz: memento+http://webenact.rhizome.org/all/ # 0 results for this example test domain
    apt:  memento+http://arquivo.pt/wayback/ # 0 results for this example test domain

    # Sequence
    daisychain: # 0 results
        sequence:
            -
              index: /mnt/commoncrawl/collections/homepages/indexes/
              resource: /mnt/commoncrawl/collections/homepages/archive/
              name: homepages

            -
              index: /mnt/commoncrawl/collections/ca/indexes
              resource: /mnt/commoncrawl/collections/ca/archive
              name: ca

            -
              index_group:
                  rhiz: memento+http://webenact.rhizome.org/all/
                  ia:   cdx+http://web.archive.org/cdx;/web
                  apt:  memento+http://arquivo.pt/wayback/

            -
              index: $live
              name: live

    homepages: # many results
        index_paths: /mnt/commoncrawl/collections/homepages/indexed_sorted/
        archive_paths: /mnt/commoncrawl/collections/homepages/archive/
    ca: #many results
        index_paths: /mnt/commoncrawl/collections/ca/indexes/
        archive_paths: /mnt/commoncrawl/collections/ca/archive/
    memento: # many results
        index_group:
            rhiz:  memento+http://webenact.rhizome.org/all/
            ia:    memento+http://web.archive.org/web/
            local: ./collections/



# Settings for each collection
use_js_obj_proxy: true

# Memento support, enable
enable_memento: true

# Replay content in an iframe
framed_replay: true

#timeout: 20 --> Disabled for testing

jwest75674 avatar Nov 11 '20 04:11 jwest75674

One month later check-in.

Still stuck.

jwest75674 avatar Dec 31 '20 04:12 jwest75674