marsggbo

Results 39 issues of marsggbo

There are some parameters I don't know what their meanings. Sucha as: micro_child.py - sync_replicas - num_aggregate - num_replicas Are these parameters used for multi gpus?

how to solve this problem without root priority when installing tensornvme?

![image](https://user-images.githubusercontent.com/13477956/54476235-d1182d80-4835-11e9-9989-c8b12777e7f9.png)

- https://github.com/kornia/kornia - https://pytorch-lightning.readthedocs.io/en/latest/notebooks/lightning_examples/augmentation_kornia.html

Any plan to release the search code for [Prioritized Architecture Sampling with Monto-Carlo Tree Search](https://arxiv.org/abs/2103.11922) (MCT)? Thanks

While MoE training typically uses a fixed capacity to distribute tokens evenly across all experts, my understanding is that inference involves activating experts based on predicted relevance via a softmax...

### Description & Motivation I want to benchmark different parallelisms, but I didn't find pipeline mode. ### Pitch _No response_ ### Alternatives _No response_ ### Additional context _No response_ cc...

question