What's the purpose of these lines? #25

XavierXiao · 2023-02-01T22:24:07Z

Thanks for your amazing work! I have one quick question: what is the purpose of these lines, in the modified CrossAttention's forward function? It seems like you disable the gradient of the first token in the embedding? Can you explain a bit?

Thanks!

nupurkmr9 · 2023-02-03T02:00:47Z

Hi,

Since the first start of the sentence token is always fixed, I noticed a small improvement when detaching it during the training. I guess this helps in better association between the "V* category" and the target image and thus improved generation on inference time prompt.

Thanks.

XavierXiao · 2023-02-03T23:18:21Z

thanks! Another possible issue I spotted is here, where it always assume the --freeze_model is 'crossattn_kv', and if I set this argument to 'crossattn' this line will disregard it.

nupurkmr9 · 2023-02-04T12:57:54Z

Ohh yeah. Thanks so much for catching it!!
I have corrected it now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's the purpose of these lines? #25

What's the purpose of these lines? #25

XavierXiao commented Feb 1, 2023

nupurkmr9 commented Feb 3, 2023

XavierXiao commented Feb 3, 2023

nupurkmr9 commented Feb 4, 2023

What's the purpose of these lines? #25

What's the purpose of these lines? #25

Comments

XavierXiao commented Feb 1, 2023

nupurkmr9 commented Feb 3, 2023

XavierXiao commented Feb 3, 2023

nupurkmr9 commented Feb 4, 2023