Jaume Zaragoza
Jaume Zaragoza
For me, passing the colors to mako through cli parameters when calling it from the sway config works: ``` include "$HOME/.cache/wal/colors-sway" [...] exec --no-startup-id mako --background-color "$background" --text-color "$foreground" --border-color...
What about something like ``` -x, --no-xml-escape Don't perform escaping special characters for XML. ``` or just removing the shortened form `-x` and leave the `--no-xml-escape`? If `--no-xml-escape` is too...
Why sacremoses shouldn't include this?
Would it be possiblo to merge this https://github.com/forkeith/bat/commit/c1a4e981897844cb145b6224f2fc4be42b1fdcce? It seems to fix the issue for me. I've applied the commit patch to master and, after running `./assets/create.sh`, `cargo test` shows...
Email login was also broken and with the latest version is working now. I no longer have a Google account, so cannot check. Closing, as it is solved for me.
Hi @vince62s, Unfortunately there's no data preserving that kind of information for Paracrawl. In the raw file, you might find the less filtered version we have. If you group by...
It really depends on the size of the data used for distillation because the generation of n-best candidates takes a significant amount of the time. If you are distilling from...
See here: https://github.com/psf/requests-html/issues/362#issuecomment-588248111
I'm trying to do something similar to what is explained in the [paper](https://arxiv.org/pdf/2201.11990.pdf) > Exact match duplicates would be computationally expensive, so we opted to take a fuzzy deduplication approach...
Do you mean by doing something like this? ```python text = ['This is an example text'] vectorizer = HashingVectorizer() msh = MinHash() X = vectorizer.fit_transform(text) for i in X[0].indices: msh.update(str(i).encode())...