Evgenii Zheltonozhskii
Evgenii Zheltonozhskii
It would be convenient to be able to filter duplicates by parameters, e.g., show only those with different resolutions, those with the same name, or those with the same size,...
GPT-4 is now available in palyground. Add support to export chats from there
I've run the training without changing any hyperparameter except for batch size and gradient accumulation steps to match the global batch size on two machines. The first run is exactly...
The paper evaluates on ARC, HellaSwag, MMLU, and TruthfulQA, but this repo does not reference these evals. Adding short explanation regarding these evals (e.g., in https://github.com/huggingface/alignment-handbook/tree/main/scripts#evaluating-chat-models) would be nice
Some fixes imported from https://github.com/patrick-kidger/diffrax/pull/418 (SDEs still do not pass all tests)
There are some tests for SDEs failing when adding [complex inputs](https://github.com/patrick-kidger/diffrax/pull/418) ``` =========================== short test summary info ============================ FAILED test/test_integrate.py::test_sde_strong_order[complex128-ItoMilstein-any-0.5] - assert -0.2 < (Array(-0.00617573+0.01165244j, dtype=complex128) - 0.5) FAILED test/test_integrate.py::test_sde_strong_order[complex128-SPaRK-any-0.5]...
Following up on https://github.com/patrick-kidger/optimistix/pull/61#issuecomment-2271461768 This doesn't introduce fixes yet, just adds a simple test that fails to highlight the issue
While working on https://github.com/patrick-kidger/optimistix/pull/71 I got an `XlaRuntimeError` in which the error message suggested to turn `EQX_ON_ERROR=breakpoint`. However, when I do that, many tests, even ones that didn't fail before,...
1. Make reply using "select specific quote" 2. Try to save it as a sticker 3. Instead of message "Unsupported message" appears in sticker.
- [x] We chose first-argument conjugation, but our tree_dot does second-argument conjugation (https://github.com/patrick-kidger/lineax/pull/105) - [ ] Complex functions require taking conjugates and/or real in various places (partially fixed in https://github.com/patrick-kidger/optimistix/pull/71)...