rucene icon indicating copy to clipboard operation
rucene copied to clipboard

Rust port of Lucene

Results 12 rucene issues
Sort by recently updated
recently updated
newest added

The following code prints an arbitrary number, because the vec has already been dropped. I can also get aliasing &mut's to the same value by simply calling p.longs() multiple times,...

Hello fellow Rustacean, we (Rust group @sslab-gatech) found a memory-safety/soundness issue in this crate while scanning Rust code on crates.io for potential vulnerabilities. ## Issue Description https://github.com/zhihu/rucene/blob/5b55f842c2bb03beb96898d520e880c180c91adf/src/core/store/io/data_input.rs#L214-L220 `core::store::io::data_input::DataInput::read_string()` method creates...

Failing to build on latest nightly, but also when using `rustup run nightly-2019-10-28 cargo build`: ``` error[E0658]: `cfg(doctest)` is experimental and subject to change --> /Users/amooren/.cargo/registry/src/github.com-1ecc6299db9ec823/memoffset-0.5.6/src/lib.rs:74:7 | 74 | #[cfg(doctest)]...

https://github.com/tantivy-search/tantivy Is also a library to port Lucene in rust, did you compare the features between this project and tantivy?

This might be a silly question, does Rucene support Chinese character indexing and searching. I don't see any tokenizer under the https://github.com/zhihu/rucene/tree/master/src/core/analysis

用master分支build了一个索引,读取segments内容后发现版本是```6.4.18```? 这个版本对应兼容lucene哪个版本呢? ``` >> read(segments_1) header length: 35 lucene version: 6.4.18 version: 4 nameCounter: 1 segCount: 1 ... ``` 1. rucene生成的索引是```完全兼容```原生lucene的吗? 2. 有没有和原生lucene做对比的benchmark数据? 3. 有没有在分布式存储上build索引的测试数据(之前看过你们分享的ppt)? 4. merge segment的重IO操作rucene的表现怎么样? 尤其是在分布式存储上,有没有数据?

Context: I am adding rucene to https://github.com/tantivy-search/search-benchmark-game. It is a search benchmarking comparing Lucene, Tantivy, Bleve and now Rucene. Indexing works but I have to periodically commit to avoid getting...

The search benchmark consists in indexing all docs in wikipedia en. To level the field, we merge all segments down to a single segment. I was happy to see that...

Phrase query faills with a panic when running on 10_000 wikipedia docs See the following commented out code. https://github.com/tantivy-search/search-benchmark-game/blob/master/engines/rucene-0.1/src/bin/do_query.rs#L126-L128

just building example code and getting error below, ```bash error[E0599]: no method named `get_ref` found for union `MaybeUninit` in the current scope --> /home/oz-mint/.cargo/registry/src/github.com-1ecc6299db9ec823/rucene-0.1.1/src/core/search/query/spans/span_near.rs:509:45 | 509 | } else if...