Markus F
Markus F
Very generous indeed! Thanks but the TPUs are very strong. I'd be very curious whether there is a discrepancy too.
Hi, we recently introduced SaT, which strongly improves upon WtP. We specifically focused on such cases by adding corruptions to text (including case) that resemble those discussed. Overall, our new...
Hi, we recently released SaT models which significantly improve upon WtP. We also specifically tackled the observed issues with short sequences via a limited lookahead mechanism and packing of fewer...
Hi @k2helix, since today I get the same error. But new cookies - even from different accounts and different browser etc. - do not help. So does it still work...
@k2helix thx for the fast reply! Interesting that the cookies lasted so long, over a month. How would removing the try/catch block help? I see I get the same `Error:...
Hi, thanks for raising this! This occurred when porting the research codebase into the library. To fix your issues: 1. I updated the `README.md` accordingly. Please use this workflow to...
I see, does this happen when you run `wtpsplit/train/train_lora.py`? Currently a bit pressed with other projects but I'll get back to this soon
Thanks for the detailed info! Really appreciate it. Currently I don't have access to a decent enough CUDA-powered device so I can't fully test this. But I pushed a version...
I see, thank you for your clear and detailed explanation. While looking through an older version of the research code, I realized that I used a custom version of the...
I just pushed the corresponding fixed and tested it on GPU, following the tutorial in the README. It works as expected now, so I will close this issue. If anything...