taher comments

Results 51 comments of


                                            taher

Faiss runs very slowly on M1 Mac

potentially this should be ran on the gpu on m1 but this requires porting over cuda kernels to metal.

Replacing Textures

@mirh How would that help?

Integrated font style, size and content style API on speedreader UI

The serif font issue isn't related to this PR. I've pinged @boocmp on this.

TypeError: OAuth2Strategy requires a clientID option

@Witsung I get the same error when I run the migrate command. Have you gotten around this issue yet?

Add embedding mode with arg flag. Currently working

Do the input tokens need to go through many layers to obtain their embeddings? My guess is that the input embeddings should be obtainable earlier, so that it is not...

Add embedding mode with arg flag. Currently working

The input embeddings obtained at the starting layer are static and not contextual? In order for the embeddings to include context for each input token in the sequence, is it...

Add embedding mode with arg flag. Currently working

How are you testing the correctness of the embeddings?

Add embedding mode with arg flag. Currently working

I'm curious if normalizing reduces the dimensionality of the embedding space? If the goal of the consumer is to only compute cosine similarity from the embeddings, then I think it...

Add embedding mode with arg flag. Currently working

It would be helpful to add an example of generating input embeddings, normalizing them, and computing cosine similarity in the example folder in this PR.

convert-pth-to-ggml.py how to handle torch.view_as_complex

If you are asking about applying rotary embeddings, then that is done in the [llama.cpp](https://github.com/ggerganov/llama.cpp/blob/master/llama.cpp#L705) file and not during conversion