odilurm
odilurm
@beiyuouo Thanks for your quick and kind response. It's been a while since started the training between client and server, but so far I haven't seen any details or information...
Hi @beiyuouo In my previous trial, I did not run the ```bootstrap.sh``` file before running the training. I stopped the training and then I run ```bash bootstrap.sh``` file located in...
Hi @beiyuouo, Thanks for your support. Actually, the problem is solved after upgrading torch version from 1.11.0 to 1.12.0+cu116. Probably, the one who directly uses docker image should double check...
Hi there, I trained YOLOv5 in server and client for 120 epochs. However, I haven't got any stored weights for server or client in predefined directory which is ```~/object_detection/runs/```. What...
data:image/s3,"s3://crabby-images/ab7d2/ab7d28de69cdef401b0589e379afad781982ef0e" alt="no_weights"
@xierongpytorch Actually, you can enable wandb from your configuration file to see the training details while doing distributed training. At least, I was able to see the effect of training...
Try to use gRPC backend. To set up gRPC, see following example: ``` ./python/examples/cross_silo)/grpc_fedavg_mnist_lr_example/```
> > I also met this problem, did you fix it? > > Have you solved this problem? Use another communication protocol. Currently, by default they use MQTT but this...
First of all look at these files and make sure your code is actually going into the right path as you change the algorithm. 1) ```python/fedml/cross_silo/fedml_server.py``` 2) ``` python/fedml/ml/aggregator/agg_operator.py``` In...