rucene
rucene copied to clipboard
Rust port of Lucene
The following code prints an arbitrary number, because the vec has already been dropped. I can also get aliasing &mut's to the same value by simply calling p.longs() multiple times,...
Hello fellow Rustacean, we (Rust group @sslab-gatech) found a memory-safety/soundness issue in this crate while scanning Rust code on crates.io for potential vulnerabilities. ## Issue Description https://github.com/zhihu/rucene/blob/5b55f842c2bb03beb96898d520e880c180c91adf/src/core/store/io/data_input.rs#L214-L220 `core::store::io::data_input::DataInput::read_string()` method creates...
Failing to build on latest nightly, but also when using `rustup run nightly-2019-10-28 cargo build`: ``` error[E0658]: `cfg(doctest)` is experimental and subject to change --> /Users/amooren/.cargo/registry/src/github.com-1ecc6299db9ec823/memoffset-0.5.6/src/lib.rs:74:7 | 74 | #[cfg(doctest)]...
https://github.com/tantivy-search/tantivy Is also a library to port Lucene in rust, did you compare the features between this project and tantivy?
This might be a silly question, does Rucene support Chinese character indexing and searching. I don't see any tokenizer under the https://github.com/zhihu/rucene/tree/master/src/core/analysis
用master分支build了一个索引,读取segments内容后发现版本是```6.4.18```? 这个版本对应兼容lucene哪个版本呢? ``` >> read(segments_1) header length: 35 lucene version: 6.4.18 version: 4 nameCounter: 1 segCount: 1 ... ``` 1. rucene生成的索引是```完全兼容```原生lucene的吗? 2. 有没有和原生lucene做对比的benchmark数据? 3. 有没有在分布式存储上build索引的测试数据(之前看过你们分享的ppt)? 4. merge segment的重IO操作rucene的表现怎么样? 尤其是在分布式存储上,有没有数据?
Context: I am adding rucene to https://github.com/tantivy-search/search-benchmark-game. It is a search benchmarking comparing Lucene, Tantivy, Bleve and now Rucene. Indexing works but I have to periodically commit to avoid getting...
The search benchmark consists in indexing all docs in wikipedia en. To level the field, we merge all segments down to a single segment. I was happy to see that...
Phrase query faills with a panic when running on 10_000 wikipedia docs See the following commented out code. https://github.com/tantivy-search/search-benchmark-game/blob/master/engines/rucene-0.1/src/bin/do_query.rs#L126-L128
just building example code and getting error below, ```bash error[E0599]: no method named `get_ref` found for union `MaybeUninit` in the current scope --> /home/oz-mint/.cargo/registry/src/github.com-1ecc6299db9ec823/rucene-0.1.1/src/core/search/query/spans/span_near.rs:509:45 | 509 | } else if...