Sami Liedes

Results 27 issues of Sami Liedes

First of all, thank you for maintaining a well type annotated library. It's a joy to use! This bug is a minor issue when using spacy with `mypy --no-implicit-reexport`. I...

enhancement
types

I'm wondering if it's possible to use separate models for word and document embeddings with BERTopic. Does something break if I pass it an embedding model that treats words and...

Importing umap tended to take long; pynndescent is the culprit. This moves the substantial cost of importing pynndescent from import time to the first use. Among other things, this is...

In plain numpy, operations like `np.linalg.matrix_rank` can work on stacks of matrices: ``` In [51]: np.linalg.matrix_rank(np.random.randint(0, 2, size=(10, 8, 8))) Out[51]: array([7, 8, 8, 8, 7, 8, 8, 8, 7,...

feature-request
linear-algebra

Searched keywords: PERF402. #10322 is similar, but more complicated (and closed by the submitter). This code triggers PERF402 on ruff 0.3.5: ```python from typing import AsyncIterator async def some_async_iterable() ->...

rule

- [X] I've [run a search on the existing bug reports](https://help.github.com/en/github/searching-for-information-on-github/searching-issues-and-pull-requests) to make sure I'm not filing a duplicate (you can use the search box at the top left corner)...

bug
needs-triage

**Bug Report** On mypy 1.9.0, this code suggests, somewhat nonsensically, `__enter__` because `__aiter__` does not exist despite the reference to `__aiter__` being entirely implicit in the async for loop: ```python...

bug

### Preflight Checklist * [X] I agree to follow the [Code of Conduct](https://github.com/jgraph/drawio-desktop/blob/master/CODE_OF_CONDUCT.md) that this project adheres to. * [X] I have searched the issue tracker for a feature request...

I see that #131 and #134 added Azure support to sycamore. Apparently the demo does not support this yet; at least all the documentation just mentions setting `OPENAI_API_KEY`. I can...

### Describe the bug This code fails to load the dataset it just saved: ```python from datasets import load_dataset from transformers import AutoTokenizer MODEL = "google-bert/bert-base-cased" tokenizer = AutoTokenizer.from_pretrained(MODEL) dataset...