Yongxing Dai
Yongxing Dai
> @bfan @caozhangjie > I add the weight in pytorch version(without c). > > ``` > def pairwise_loss(outputs1,outputs2,label1,label2): > similarity = Variable(torch.mm(label1.data.float(), label2.data.float().t()) > 0).float() > dot_product = torch.mm(outputs1, outputs2.t())...
> from verl/workers/fsdp_workers.py. > > ``` > torch_dtype = fsdp_config.get("model_dtype", None) > if torch_dtype is None: > torch_dtype = torch.float32 if self._is_actor else torch.bfloat16 > else: > torch_dtype = PrecisionType.to_dtype(torch_dtype)...