LLaVA-NeXT
LLaVA-NeXT copied to clipboard
What does this line code mean?
https://github.com/LLaVA-VL/LLaVA-NeXT/blob/56cdba265cc786454115f98e5da967a99b532263/llava/model/llava_arch.py#L449
I quite not understand,
isn't just
if num_images == 0:
cur_image_features = image_features[cur_image_idx]
cur_input_embeds_1 = self.get_model().embed_tokens(cur_input_ids)
cur_input_embeds = cur_input_embeds_1
new_input_embeds.append(cur_input_embeds)
new_labels.append(labels[batch_idx])
?