EasyDeL icon indicating copy to clipboard operation
EasyDeL copied to clipboard

Performance Optimization Opportunities

Open erfanzar opened this issue 9 months ago • 0 comments

Description

While reviewing the codebase, I identified several areas where performance could be improved and memory usage reduced by leveraging int1 kernels or boolean operations. These optimizations can be particularly beneficial in scenarios involving bit-level operations, compact data storage, or high-frequency computations.

Next Steps:

  • [ ] Benchmark the changes to quantify performance gains.
  • [ ] Submit a pull request with the optimizations.
  • [ ] Research and prototype int1 or boolean-based implementations.
  • [ ] for some operations we can use CUDA kernel for better performance instead of triton (e.g RMSNorm, MMU, ...)

erfanzar avatar Mar 05 '25 17:03 erfanzar