Kirill Golikov issues

Results 24 issues of


Kirill Golikov

[NeoML] less memory consumption

For x64 platform 1. All **CMemoryHandlers** sizes are decreased 24 bytes --> 16 bytes 2. Any **CMemoryHandlers** has been allocated, their corresponding structs sizes are decreased 32 bytes --> 16...

[NeoML] MultiheadAttentionPerformerLayer (prototype..)

Origin article and code * https://arxiv.org/pdf/2009.14794.pdf * https://github.com/google-research/google-research/blob/master/performer/fast_attention/tensorflow/fast_attention.py * https://blog.research.google/2020/10/rethinking-attention-with-performers.html * https://medium.com/analytics-vidhya/paper-explained-rethinking-attention-with-performers-b207f4bf4bc5 * https://www.youtube.com/watch?v=xJrKIPwVwGM

Kirill Golikov

[NeoML] less memory consumption

[NeoML] MultiheadAttentionPerformerLayer (prototype..)

[NeoML] remove code copy-paste in DnnDistributed

[NeoMathEngine] try AVX512 vector functions (research..)

[NeoML] Remove excess CUDA syncs in layers

Revert "[NeoML] Avoid small negative values in CalcDistance (#1039)"

[NeoMLTest] Update source.txt

[NeoML] Optimize CUDA syncs in CDnnSolver

[NeoML] Create Distributed Inference

[NeoMLPython] Fix macos warnings