high-performance-computing topic
thread-pool
A modern, fast, lightweight thread pool library based on C++20
heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
SPHinXsys
SPHinXsys provides C++ APIs for engineering simulation and optimization. It aims at complex systems driven by fluid, structure, multi-body dynamics and beyond. The multi-physics library is based on a...
colmena
Library for steering campaigns of simulations on supercomputers
plinycompute
A system for development of high-performance, data-intensive, distributed computing, applications, tools, and libraries.
OpenCL-Benchmark
A small OpenCL benchmark program to measure peak GPU/CPU performance.
metal-flash-attention
Faster alternative to Metal Performance Shaders
cybersecurity-architecture
An ongoing & curated collection of awesome software best practices and techniques, libraries and frameworks, E-books and videos, websites, blog posts, links to github Repositories, technical guideline...
TePDist
TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.
numerical-finance
Numerical Methods in Finance