Ash Vardanian

Results 72 issues of Ash Vardanian

Currently, the anti-diagonal approach for Levenshtein distance is only implemented for square matrices. It should be easy to generalize it to rectangular ones, opening opportunities for more aggressive vectorization. https://github.com/ashvardanian/StringZilla/blob/06423189e7a5b9e2882b471e98ec897d40734fc4/include/stringzilla/stringzilla.h#L2010-L2013

good first issue
core

StringZilla currently implements several SWAR search optimizations for needles of different length: `_sz_find_2byte_serial`, `_sz_find_3byte_serial`, `_sz_find_4byte_serial`. Those currently lack reverse-order variants and should be implemented to guarantee the same level of...

enhancement
good first issue

USearch implementation had 2 layers, the core HNSW structure implemented in `index.hpp` and the high-level wrapper for dense equidimensional vectors in `index_dense.hpp`. In this release, we've made the top layer...

Our GoLang binding for USearch is missing several APIs that are available in the C version. Let's add these to make the GoLang version just as powerful! Missing APIs: 1....

good first issue
help wanted

### Describe what you are looking for As a performance optimization, we can implement `merge` for existing indexes instead of moving (inserting) elements one by one. ### Can you contribute...

enhancement

### Describe what you are looking for Assuming hierarchical information, we can provide functions to embed/draw those graphs in low-dimensional spaces, like 2D. This should help great people trying to...

enhancement

### Describe what you are looking for We want the Paclet to pass tests in GitHub CI and get uploaded to the Marketplace. [Docs](https://resources.wolframcloud.com/PacletRepository/resources/Wolfram/PacletCICD/) ### Can you contribute to the...

enhancement

### Describe what you are looking for We currently have a Docker image with a few undocumented features. Those are highly dependent on UCall. To make the interface more generic,...

enhancement

This Pull Request extends USearch Molecules to use [COATI2 open-source models](https://github.com/terraytherapeutics/COATI) from [Terray Therapeutics](https://www.terraytx.com/). --- Closes #3

I was preparing some datasets for AI training and noticed that `datasets` by HuggingFace uses the conventional `open` mechanism to read the file and split it into chunks. I thought...