gritlm question about emb loss

question about emb loss

Open 256785 opened this issue 1 year ago • 3 comments

trafficstars

def call(self, q_reps, p_reps): if self.negatives_cross_device: # This gathers both negatives and positives. # It could likely be optimized by only gathering negatives. q_reps = self._dist_gather_tensor(q_reps) p_reps = self._dist_gather_tensor(p_reps) scores = self.compute_similarity(q_reps, p_reps) / self.temperature scores = scores.view(q_reps.size(0), -1)

    target = torch.arange(scores.size(0), device=scores.device, dtype=torch.long)
    target *= (p_reps.size(0) // q_reps.size(0))
    return self.cross_entropy(scores, target)

in the code，does it use ContrastiveLoss following the paper？

Jul 24 '24 12:07 256785

yes thats contrastive loss

Jul 24 '24 14:07 Muennighoff

target = torch.arange(scores.size(0), device=scores.device, dtype=torch.long) target *= (p_reps.size(0) // q_reps.size(0))

why use target as this way，a little confuse

Jul 25 '24 10:07 256785

I have some sense each query refers to some samples，use divided to count the num of samples，and use arrange with multiply to find the positive item index. Maybe is that?

Jul 25 '24 11:07 256785

gritlm gritlm copied to clipboard

question about emb loss

gritlm
gritlm copied to clipboard