Yangseung

Results 11 comments of Yangseung

Thank you.

How to test the fusion model? I think i should take an average of two models (RGB, Flow). Is this right?

Did you solve this problem? I followed this code but I can only get 58-60% accuracy on Visda-2017 too.

Thanks for your help. I found the models for CCCC WM and CCCC LT. CCCC WM https://hf.co/GritLM/gritlm_m7_sq2048_medi CCCC LT https://hf.co/GritLM/gritlm_m7_sq2048_medi_lasttoken I wonder if there is a CCCC & Mean model...

I am just wondering why CCCC & M models have lower performance than CCCC & WM models. Also, I have one question. Excluding the prompt section, can cache reuse still...

Thank you for sharing your good paper :) We are experimenting with GritLM in our application, and the top-2 hit rate is around 98% using bidirectional, but when using causal,...

Thanks for your suggestion. I think that the suggested model is the embedding-only model. Our goal is to share both the retriever and the generator, and to reduce inference cost...

In our case, we want to cache queries. In the GritLM-7B case, query caches are calculated using bidirectional attention, so the generation performance is degraded a lot when using our...

We are experiencing the following issue and are hoping you can provide some comments or solutions: We are using the aforementioned causal/causal model and are proceeding with the following process:...

It seems these two outcomes should be exactly the same, but since different results are coming out, I am asking this question. When using both the query and text, the...