orama icon indicating copy to clipboard operation
orama copied to clipboard

Search Filter not working as expected

Open Leo310 opened this issue 1 year ago • 9 comments

Describe the bug

When I execute this code:

console.log('Filtering by sources', sources);
const results = await search(db, {
       where: {
            source: sources,
       },
       10000,
});
console.log('Got results', results);
}

I get the following console output: image As you can see the first hit in the results already has a different source attribute (value: "README.md") than the filter (value: "To Add.md"). The filter was completely ignored.

To Reproduce

  1. Initialize Orama as follows:
const recordManagerSchema = {
    id: 'string',
    source: 'string',
    updated_at: 'number',
} as const;

const db = await create({
            schema: recordManagerSchema,
            id: "testdb",
            components: {
                tokenizer: {
                    stemming: true,
                    stemmerSkipProperties: ['id', 'source'],
                },
            },
        });
  1. Insert some data into this db with different sources
  2. execute the code I already provided in the bug description

Expected behavior

As you stated in your docs: image I expect to only get hits that include 'To Add.md' in their source attribute.

Environment Info

OS: Mac 14.2.1
Orama: 2.0.1
Running inside Obsidian (electron app)

Affected areas

Search

Additional context

No response

Leo310 avatar Jan 17 '24 17:01 Leo310

Hi, would you be able to provide a repro via replit/similar tool?

micheleriva avatar Jan 18 '24 18:01 micheleriva

@micheleriva Here is the repro: https://replit.com/join/xnweszxeca-leodev310 I figured out that the filter doesn't match the exact entries of sources but on substrings of one entry.

I think in my previous example, "README.md" was a hit because it contained the ".md" substring. That's why every record was matching. Not sure if this is the intended behavior. If it is, is there another way to filter on exact matches?

Leo310 avatar Jan 19 '24 09:01 Leo310

The numeric filter also doesn't seem to work as expected: For this code:

const before = 1705758397008;
console.log('Searching for entries indexed before', before, 'in Oramadb', db.data.docs);
const results = await search(await this.db, { where: { indexed_at: { lt: before } }, 100 });
console.log('Results', results);

I get this output: image where you can see that the indexed_at values of entries 21 and 22 are smaller than the value of before (1705757860535 < 1705758397008). But I also get no hits even though I am using the "lt" filter on indexed_at.

Leo310 avatar Jan 20 '24 13:01 Leo310

@Leo310 did you find a solution to this? I am facing the same problem

AND-TomHarris avatar Jul 18 '24 21:07 AND-TomHarris

@AND-TomHarris Unfortunately not, I ended up using Dexie.js instead of Orama

Leo310 avatar Jul 19 '24 06:07 Leo310

+1, trying to filter for strings with spaces (e.g. "History and geography") gives back hits with substrings matching the query (e.g. every entry containing "and" in the filtered field).

Is there any workaround?

fenicento avatar Aug 01 '24 09:08 fenicento