GZinga
GZinga copied to clipboard
Bumps hadoop-common from 2.4.1 to 2.10.1. [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a...
I tried to Gzinga a dictionary file with 100K entries. Obviously I wanted to use offsets to retrieve dictionary articles. The output file grew very quickly to 1Gb for just...
Awesome work. Would consider changing GZinga's license to Apache-based? So then it can be adopted across hadoop ecosystem. Thank you.
Proposed fix for issue#8.
Currently there is no way to instantiate a GZipInputStreamRandomAccess instance with loadMetadata = false and later parse the metadata. Use case - Applications may want to cache the metadata locally...
Suggestion for Issue#5
Calls to read() are slow (network letancy), when the implementation of SeekableInputStream is backed by a remote storage. IMHO ,Ideally we should be making more of block calls (read(buf,off,len)) whenever...