obsidian-copilot icon indicating copy to clipboard operation
obsidian-copilot copied to clipboard

(LOGAN PLEASE HELP!!! )Despite high hopes for this plug-in, Multiple screenshots confirm the contrary :(

Open ReliablyAwkward opened this issue 1 year ago • 28 comments

I created an entirely new vault just to test this plug-in, I only have one note inside, and it has failed at every test to communicate accordingly. I am running Google advanced API w/ the embedding API from cohere. At one point it was saying cannot connect to the internet, another it was saying it cannot read my vault, and now it has all these in addition to those, what on earth is going on here?

I really want to enjoy this feature, but lordy lord, this has gotten rather redundant now hasn't it. (Regarding time efficiency) Completely inaccurate Randomly complete different language Why 2 Why 3

ReliablyAwkward avatar Feb 27 '24 13:02 ReliablyAwkward

Hi @ReliablyAwkward, thanks for reporting. Let me address your screenshots one by one.

  1. Directly mentioning [[note title]] isn't supported in Chat mode yet. You need to use the small "up" button to send your active note or a context with multiple notes (set with command) to the model.
  2. Not sure why it replied in Spanish, does your note contain Spanish? Or it could just be Gemini hallucinating.
  3. I can't see what the model said, but it shows you successfully passed the active note.
  4. I see a 500 error, this is internal to Google's API and has nothing to do with the plugin. As directed, you may report it to Google using that link

In general the issue I see is with Gemini, can you try with another model? GPT-4-TURBO is the benchmark and can probably eliminate model-specific issues. If you don't have OpenAI access, you can also try OpenRouter for free models.

logancyang avatar Feb 27 '24 23:02 logancyang

WHYYYY

Yes I have been taking your first instruction but when I do that it says no context found. There is zero and I mean zero in my note regarding Spanish. And I don't understand regarding number four, I have a Gemini Advanced 2-month trial for which I see API key access, yet when I look at your model selection it says Gemini "PRO", are these not the same thing? I tried resetting my API key there, and I'm still dead in the water over here regarding just even trying to get your plug-in off the ground.

I reset everything, I started an entirely new vault just for your plug-in only, it is the only plug-in active, I refreshed all my API keys, there was only one single note in the entire vault, I changed it from test to "HERE", and I cannot get one single reaction from the plug-in to interact w/ it.

I've watched every single one of your YouTube videos, I'm running a windows 10 / Toshiba Satellite laptop, I wish to lord I could simply have an offline lol version to simply sidestep all this, but I have still yet to see anything simplified enough to achieve such a context, unless you somehow now all of a sudden magically know of a way to go that route instead?

I have tried emulators / Docker / etc, I'm finding it increasingly frustrating to find something to simply represent my main vault which consists of over 2 million words all geared towards One Singular research project, for which I wish to see something be capable of a representing said data set, as a mouthpiece, that I can simply maintain a permanent dialogue and time w/. Sunset of memory, just like Claude 2.1, their number one flaw, is that out of nowhere they drop continuity. And I cannot continue just sending chunks upon chunks upon chunks upon chunks, this has grown to become quite contrary to productivity, as my number one time waster now :( :( :(

ReliablyAwkward avatar Feb 29 '24 10:02 ReliablyAwkward

(crickets)

ReliablyAwkward avatar Mar 02 '24 19:03 ReliablyAwkward

@ReliablyAwkward In my previous reply point 1: it does NOT support mentioning a note like that in Chat directly, your screenshot is still a direct mention. You have to use the little up button or templating in Chat mode like shown in my videos.

However, the new release Vault QA mode v2.5.0 should work for direct mentions. That new mode also supports completely offline QA with the ollama local embeddings. A new video on that will be up this week.

logancyang avatar Mar 05 '24 00:03 logancyang

@ReliablyAwkward In my previous reply point 1: it does NOT support mentioning a note like that in Chat directly, your screenshot is still a direct mention. You have to use the little up button or templating in Chat mode like shown in my videos.

However, the new release Vault QA mode v2.5.0 should work for direct mentions. That new mode also supports completely offline QA with the ollama local embeddings. A new video on that will be up this week.

Brother, I have been doing this, please see attachments in order....I have been waiting so long for you to respond only to arrive at the same dead end :(

I have been using the little up button from the beginning, in both modes I have tried and tried every option, for Pete's Peppers, all I want is to communicate w/ my vault as a singular data set.

I made an isolated vault just to test your plug-in only, no other plug-in is clashing w/ it, literally zero friction is on this vault except for your plug-in, I only have one note in this vault and it cannot for some reason communicate whatsoever :( :(

1 2 3 1 4 5

ReliablyAwkward avatar Mar 05 '24 12:03 ReliablyAwkward

  • From your 2nd screenshot the AI did not response after you clicked the button, that means your call failed. Can you do this again but include your dev console in the screenshot too
  • From your 3rd and 4th screenshots, it amazes me how Gemini said [[ICI]] out of nowhere. What's your temperature setting? It's purely hallucinating.

BTW, in QA mode, pls ask more specific questions as the bot mentioned.

This kind of questions are bad 👎 : talk to me about [[note title]], summarize this article

These are better 👍 : what does a cerebral vacation mean, what are the insane chain of bold claims

logancyang avatar Mar 06 '24 02:03 logancyang

In order of advice 0 1 2 3

ReliablyAwkward avatar Mar 06 '24 14:03 ReliablyAwkward

@ReliablyAwkward I see, thanks for the screenshots. This is an actual bug to be fixed. It's caused by the particular format of your note, probably you have --- at the top for headings. Could you copy this note in source mode and paste it here, just the top 20-50 lines should be fine, I'll use it for testing.

In the meantime, try anything without those ---.

logancyang avatar Mar 08 '24 08:03 logancyang

Okay I did as you said, and yes indeed, I did have many of those three lines in a row, I replaced every single one w/ dots instead, so now there are zero of those dashes anywhere in the note, tried it again and the snapshot shows the result. after subtraction of the 3 lines

ReliablyAwkward avatar Mar 08 '24 08:03 ReliablyAwkward

Okay I did as you said, and yes indeed, I did have many of those three lines in a row, I replaced every single one w/ dots instead, so now there are zero of those dashes anywhere in the note, tried it again and the snapshot shows the result. after subtraction of the 3 lines

This time it's 500 internal error from Google, not sure what's happening on their side. You could report to them using that link they provided. I believe I saw it once or twice before too but Gemini is working for me now. Could it be your account issue? But if it's because of payment or access they should really return 403 and not 500 🤷‍♂️

logancyang avatar Mar 08 '24 21:03 logancyang

I believe you, but I have zero clue what you are saying. I grasp that it is an error on Google Gemini's part and not your plug-in, but, I did do exactly as you said and removed the lines, and it threw this code last time even before that adjustment. My account w/ them is in good standing, I just tried it again and this is the gibberish it gives me back. Why does it seem like the rest of the world is enjoying your plug-in except me, lol :( :( :( I WANT TO ENJOY YOUR PLUGIN

ReliablyAwkward avatar Mar 08 '24 21:03 ReliablyAwkward

Have you tried OpenRouter? They have a big collection of models and a free quota.

As for Google, make sure you can use Gemini in Google AI Studio first https://aistudio.google.com/app/u/1/prompts/new_chat

Find their support people and show them this 500 internal error message is your best bet.

I can't even use Google AI Studio without getting an error, huge bummer SCR-20240308-mngn

logancyang avatar Mar 08 '24 22:03 logancyang

Have you tried OpenRouter? They have a big collection of models and a free quota.

As for Google, make sure you can use Gemini in Google AI Studio first https://aistudio.google.com/app/u/1/prompts/new_chat

Find their support people and show them this 500 internal error message is your best bet.

I can't even use Google AI Studio without getting an error, huge bummer SCR-20240308-mngn

No I have never messed w/ open router because I got sick of the runaround a while ago trying to find something that my machine can handle.Last week I did.... I have been using clause 3 and it is achieving the type of context I desire from my entire obsidian vault. Have been getting crazy deep regarding pattern recognition in tandem w/ topics like dimensional threshold and it has been superb in maintaining continuity across the conversation.

I'm not about to start chasing my tail w/ Google support now, lol

I just tried that same window that your snapshot showed and got the exact same error.

Are you in Los Angeles by any chance?

ReliablyAwkward avatar Mar 08 '24 22:03 ReliablyAwkward

So you're telling me I can just copy paste this api's from open router (Claude 3) into your plugins box and, voila?

ReliablyAwkward avatar Mar 08 '24 22:03 ReliablyAwkward

I did that, made the adjustments, Saved and reloaded, now this: Screenshot 2024-03-08 144056

ReliablyAwkward avatar Mar 08 '24 22:03 ReliablyAwkward

what model are you using for OpenRouter? Next time you see a 3-digit error code, just google it. 402 is "payment required". You are probably using a paid model, they have free ones, check their website https://openrouter.ai/docs#models

logancyang avatar Mar 08 '24 22:03 logancyang

Claude Sonet....Oh my goodness their list is like infinite, I don't even know where to begin w/ something like this. (le' sigh)....Dude I will like pay you to remote set this up or something, I am so at my wit's end w/ all this run around.

ReliablyAwkward avatar Mar 08 '24 23:03 ReliablyAwkward

SO CLOSE!!! I started using the free API key and drop it gives you, and voila, it's the first time I've seen it react positively whatsoever! It was completely understanding the note I was giving it, when I was in chat mode I was able to reference another note, phenomenal. I asked it to give me a particular note and a dissertation style and Claude III came through like a charm.

Then I switched to the big boy, which if you remember I said I created a whole new vault just for this, and then I copy paste it over about 10 notes, and tried using the Vault QA Mode, And it threw me a bunch of error codes, mentioning openai again which I don't really understand since, the embedding API is cohere and the regular one is now Claude, so, I went back and deleted my API key from openai all together, refreshed everything, and these snapshots are what it comes back w/.

At first it was just stuck at indexing, then I changed like I said above, and then it started working but now it's just stuck at that last snapshot, it says 10 out of 11 notes indexed, what's going on?

And oh my goodness, I just tried refreshing everything and doing it again and now it's talking to me in an entirely whole other language, and I promise you there is zero in any of my notes other than English. What on earth!??!

Baby steps FEELS SO CLOSE! what changed but first 2 notifications dont' dissappear image

ReliablyAwkward avatar Mar 09 '24 22:03 ReliablyAwkward

@ReliablyAwkward The answer is in the screenshot: your Cohere embedding API says it only allows 10 calls/minute for the free trial key (which is weird, my free Cohere API key allows much more than that, could be because I signed up very early and got a lot more quota?).

This means you can't use Cohere free key to index the whole vault. For your big vault, you either get a paid Cohere key, or go the OpenAI route. Free and local option is Ollama, just go through the Ollama setup guide:https://github.com/logancyang/obsidian-copilot/blob/master/local_copilot.md#ollama

logancyang avatar Mar 09 '24 23:03 logancyang

Free and local option is Ollama, just go through the Ollama setup guide:https://github.com/logancyang/obsidian-copilot/blob/master/local_copilot.md#ollama

""Ollama currently supports Mac and Linux, they mentioned that Windows is coming soon.""

I am on a 10 year old Toshiba Satellite laptop, running windows 10.

Screenshot below -v- makes no sense because I'm using the exact same API keys in the original vault and it is working fine when I check it, but I do not have both open at the same time, so it's not like one that is running thus interrupting this one,and I wait beyond the 10 minute mark, etc etc. ^ Different vault, is this because it is too large (QuestionMark)

ReliablyAwkward avatar Mar 10 '24 08:03 ReliablyAwkward

@ReliablyAwkward SCR-20240310-nkcv

But your machine probably can't handle Ollama unfortunately 😬

Your best bet is to pay for OpenAI or Cohere

logancyang avatar Mar 10 '24 22:03 logancyang

Oh my goodness, okay, per your recommendation I will entertain the idea of this. I was actually thinking about getting the full opus version of Claude for at least one month, I am curious though, would your vault mode negate the need to create one long file for macro context? Is it literally no difference? Because for example, my big gripe is that I have over 2.5 million words that I need considered as One Singular data set, so I was thinking originally, since Claude has such a high token limit, I would copy paste a ton/merge a ton of notes into one file for it to communicate w/. But now I feel redundant since your vault mode would just sidestep this and make that obsolete altogether?

Also, what do you consider best between cohere and openai? I personally would rather avoid openai altogether, but do you see a benefit/risk to this?

ReliablyAwkward avatar Mar 11 '24 15:03 ReliablyAwkward

@ReliablyAwkward there's no need to create one long file. You have 2 options

  1. Use Chat mode with the command Set note context for chat mode, where you can set / as the folder, meaning including your entire vault in your single prompt (this is ofc going to be expensive! I wouldn't recommend doing this. But this is exactly the same as creating one note that has all of your vault.), then use the little up button to pass the context to the prompt. For real use cases, I recommend setting a specific folder with a few notes for this.
  2. Use Vault QA mode. Keep in mind that all embedding providers have rate limiting (429 error code), meaning if you call them too frequently you'll get blocked for a while. Indexing a large vault can lead to this. If you get stuck with 429, just wait a bit and do "refresh index", it will resume the indexing process.

Cohere is a solid choice for embedding, if you decide to go with them.

logancyang avatar Mar 11 '24 18:03 logancyang

Okay I've been playing w/ it like crazy, from my experience, the best way to access the entire vault is by referencing it from chat mode instead, when I put it in vault mode it seems to choke. But I can't put my finger on exactly where that's coming from because it gives me a 429 error code from anthropic even though it's been an entirely full day since I last messed w/ it last. I have 79 cents left in my $5 free trial which should be more than enough for a simple prompt. I switched over to Gemini Pro which I really don't like in comparison to Claude, And tried using vault mode and long Q&A mode, and gave it a prompt of referencing other notes for contrast and it gets all types of confused.

But when I just leave it in chat mode and then reference the other notes, it seems to do fine?

I didn't really understand what you meant w/ the whole, "Command set note context for chat mode" And then using the /, if you remember I said I created an entirely new vault for this, And unlike the other similar plug-in called smart connections, when I would use a / or hit the bracket key it would automatically pull up a list of options that would be available for that particular syntax, But I think what you meant was, I can reference an entire folder that way if I have everything I want organize enough, so it's like a microvolt inside a vault because all I ever think about evolved is that it's one hell of a giant folder. Anyway....

Entertain the idea of cohere, I've been trying to look into the topic but I am just so far away from the axioms regarding definitions of many of the words this realm seems to utilize. I am just so scared of spending the money and then all of a sudden I run into some similar hiccup, it's going to make my head explode if that's the case, I want to remember that I have a 2.5 million word vault that I would love to be represented as a singular data set, so you're saying cohere can both handle the task and doesn't have the same 429 error code Jazz as The more popular boys like anthropic and openai? Compressed file couldn't handle it

ReliablyAwkward avatar Mar 12 '24 16:03 ReliablyAwkward

Would it be possible to do like a 20 minute q&A virtual call with you at some point? I can just like I don't know, venmo you or something like that if you want to bill me for your time? I just have so many questions about how to go about utilizing this plugin in the most efficient manner.

On Mon, Mar 11, 2024, 11:17 AM Logan Yang @.***> wrote:

@ReliablyAwkward https://github.com/ReliablyAwkward there's no need to create one long file. You have 2 options

  1. Use Chat mode with the command Set note context for chat mode, where you can set / as the folder, meaning including your entire vault in your single prompt (this is ofc going to be expensive! I would only do this once for testing purposes), then use the little up button to pass the context to the prompt. For real use cases, I recommend setting a specific folder with a few notes for this.
  2. Use Vault QA mode. Keep in mind that all embedding providers have rate limiting (429 error code), meaning if you call them too frequently you'll get blocked for a while. Indexing a large vault can lead to this. If you get stuck with 429, just wait a bit and do "refresh index", it will resume the indexing process.

— Reply to this email directly, view it on GitHub https://github.com/logancyang/obsidian-copilot/issues/319#issuecomment-1989135429, or unsubscribe https://github.com/notifications/unsubscribe-auth/BAJGG6V647D6VV5XIRT5XQ3YXX7M7AVCNFSM6AAAAABD4D6A26VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOBZGEZTKNBSHE . You are receiving this because you were mentioned.Message ID: @.***>

ReliablyAwkward avatar Mar 12 '24 22:03 ReliablyAwkward

Regarding your advice about no need to put everything into one note because of your vault mode, I respectfully beg to differ. I have been testing and testing and, it seems to be yanking from sources arbitrarily, when I need it to yank from specific sources. So even though I have extended the source limit to 10, I have way more than 10 sources I wish for it to take into consideration. Therefore, it seems vault mode is something like negate it all together, When someone is attempting to address a larger than 10 sources?

....And I just got notified I can now be a beta tester for Gemini 1.5 pro 1 million token limit, lol

ReliablyAwkward avatar Mar 13 '24 20:03 ReliablyAwkward

???

ReliablyAwkward avatar Mar 19 '24 16:03 ReliablyAwkward

Hello?

ReliablyAwkward avatar Mar 25 '24 15:03 ReliablyAwkward