pytorch
pytorch copied to clipboard
Tensors and Dynamic neural networks in Python with strong GPU acceleration
In comms default_hooks - decompress stream is hardcoded to cuda type. fix this to use generic type based on the grad tensor device. cc @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen...
Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #126907 * #124745 This adds a bunch of global configurations to the cache key. There's definitely more I haven't added, but this...
This is a meta only tool, this allow users to profile any python function by annotating it with **strobelight** using the strobelight profiler. ex ``` def fn(x, y, z): return...
Use sleef for aarch64 by default. cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang
Fixes https://github.com/pytorch/pytorch/issues/75287 and https://github.com/pytorch/pytorch/issues/117437 Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #126704 * #126906 cc @mrshenli @pritamdamania87 @zhaojuanmao @satgera @gqchen @aazzolini @osalpekar @jiayisuse @H-Huang @kwen2501 @awgu @penguinwu @fegin @XilunWu...
Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #126704 * __->__ #126906
Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #114947 * #116375 * __->__ #116372 * #116374 1. Add a new method `treespec.entries()` 2. Change `treespec.context` access to `treespec.entries()` to hide the...
Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #126345
Add and test torchao nightly testing workflow. This workflow will be triggered under the following conditions: 1. If the PR has ciflow/torchao label 2. Manual trigger It will run the...
Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #126813 * #126699 * __->__ #126826 * #126590 * #126328 This PR makes it such that we search for a libtorch wheel when...