rucene issues

Use after free / aliasing &mut's when using LongsPtr

The following code prints an arbitrary number, because the vec has already been dropped. I can also get aliasing &mut's to the same value by simply calling p.longs() multiple times,...

5225225

`Read` on uninitialized buffer may cause UB

Hello fellow Rustacean, we (Rust group @sslab-gatech) found a memory-safety/soundness issue in this crate while scanning Rust code on crates.io for potential vulnerabilities. ## Issue Description https://github.com/zhihu/rucene/blob/5b55f842c2bb03beb96898d520e880c180c91adf/src/core/store/io/data_input.rs#L214-L220 `core::store::io::data_input::DataInput::read_string()` method creates...

JOE1994

unable to build from source on toolchain recommendation

4

Failing to build on latest nightly, but also when using `rustup run nightly-2019-10-28 cargo build`: ``` error[E0658]: `cfg(doctest)` is experimental and subject to change --> /Users/amooren/.cargo/registry/src/github.com-1ecc6299db9ec823/memoffset-0.5.6/src/lib.rs:74:7 | 74 | #[cfg(doctest)]...

mooreniemi

How is it compare to tantivy

3

https://github.com/tantivy-search/tantivy Is also a library to port Lucene in rust, did you compare the features between this project and tantivy?

lynic

Does Rucene support Chinese character indexing and searching

1

This might be a silly question, does Rucene support Chinese character indexing and searching. I don't see any tokenizer under the https://github.com/zhihu/rucene/tree/master/src/core/analysis

qiqi-arch

master分支对应lucene哪个版本？

3

用master分支build了一个索引，读取segments内容后发现版本是```6.4.18```？这个版本对应兼容lucene哪个版本呢？ ``` >> read(segments_1) header length: 35 lucene version: 6.4.18 version: 4 nameCounter: 1 segCount: 1 ... ``` 1. rucene生成的索引是```完全兼容```原生lucene的吗？ 2. 有没有和原生lucene做对比的benchmark数据？ 3. 有没有在分布式存储上build索引的测试数据（之前看过你们分享的ppt）？ 4. merge segment的重IO操作rucene的表现怎么样？尤其是在分布式存储上，有没有数据？

wenma

Indexing too many document fails in one commit fails.

5

Context: I am adding rucene to https://github.com/tantivy-search/search-benchmark-game. It is a search benchmarking comparing Lucene, Tantivy, Bleve and now Rucene. Indexing works but I have to periodically commit to avoid getting...

fulmicoton

Index too large

5

The search benchmark consists in indexing all docs in wikipedia en. To level the field, we merge all segments down to a single segment. I was happy to see that...

fulmicoton

PhraseQuery do not work

2

Phrase query faills with a panic when running on 10_000 wikipedia docs See the following commented out code. https://github.com/tantivy-search/search-benchmark-game/blob/master/engines/rucene-0.1/src/bin/do_query.rs#L126-L128

fulmicoton

build error

2

just building example code and getting error below, ```bash error[E0599]: no method named `get_ref` found for union `MaybeUninit` in the current scope --> /home/oz-mint/.cargo/registry/src/github.com-1ecc6299db9ec823/rucene-0.1.1/src/core/search/query/spans/span_near.rs:509:45 | 509 | } else if...

ozkanpakdil

rucene
rucene copied to clipboard

Metadata

Use after free / aliasing &mut's when using LongsPtr

`Read` on uninitialized buffer may cause UB

unable to build from source on toolchain recommendation

How is it compare to tantivy

Does Rucene support Chinese character indexing and searching

master分支对应lucene哪个版本？

Indexing too many document fails in one commit fails.

Index too large

PhraseQuery do not work

build error

← Metadata

Owner

Metadata

rucene rucene copied to clipboard

Metadata

← Metadata

Owner

Metadata

rucene
rucene copied to clipboard