heritrix3 icon indicating copy to clipboard operation
heritrix3 copied to clipboard

HTTP/2 protocol

Open kauka-1 opened this issue 3 years ago • 1 comments

Hello,

Heritrix doesn't harvest material from Web sites which require HTTP/2 protocol. Our installation has found some Web servers which don't accept HTTP/1.

kauka-1 avatar Mar 28 '22 07:03 kauka-1

Supporting HTTP/2 would likely involve writing a new FetchHTTP module on top of Apache HttpClient 5 or another HTTP client library. The current mechanism Heritrix uses for recording responses will not work for HTTP/2 and will need rethinking.

ato avatar Mar 28 '22 07:03 ato