SmartEdit
SmartEdit copied to clipboard
testing the editing performance directly using stage 1's ckpt
Hi! I have a question, it seems that after training stage 1, the llava+qformer's output is aligned with clip text space. Could we directly use the llava and qformer after stage 1? Or did you have an experiment on testing the editing performance using stage 1's ckpt?