Yangseung
Yangseung
Thank you.
How to test the fusion model? I think i should take an average of two models (RGB, Flow). Is this right?
Did you solve this problem? I followed this code but I can only get 58-60% accuracy on Visda-2017 too.
Thanks for your help. I found the models for CCCC WM and CCCC LT. CCCC WM https://hf.co/GritLM/gritlm_m7_sq2048_medi CCCC LT https://hf.co/GritLM/gritlm_m7_sq2048_medi_lasttoken I wonder if there is a CCCC & Mean model...
I am just wondering why CCCC & M models have lower performance than CCCC & WM models. Also, I have one question. Excluding the prompt section, can cache reuse still...
Thank you for sharing your good paper :) We are experimenting with GritLM in our application, and the top-2 hit rate is around 98% using bidirectional, but when using causal,...
Thanks for your suggestion. I think that the suggested model is the embedding-only model. Our goal is to share both the retriever and the generator, and to reduce inference cost...
In our case, we want to cache queries. In the GritLM-7B case, query caches are calculated using bidirectional attention, so the generation performance is degraded a lot when using our...
We are experiencing the following issue and are hoping you can provide some comments or solutions: We are using the aforementioned causal/causal model and are proceeding with the following process:...
It seems these two outcomes should be exactly the same, but since different results are coming out, I am asking this question. When using both the query and text, the...