Finegrain Light Switcher (Lite Version)
💡
50
Instantly turn lamps on in your images
Instantly turn lamps on in your images
Clarity AI Upscaler Reproduction
Erase any object from an image with just a prompt
Create HD cutouts from any image with just a prompt
zeroing and reshaping the text-related cross-attentions into self-attentions
It's actually narrowing, not zeroing (even though strategy="zeros" is used in the StateDictAdapter()).
For instance, the logs show:
Adapting down_blocks.0.attentions.0.transformer_blocks.0.attn2.to_k.weight by narrowing from shape torch.Size([320, 768]) to torch.Size([320, 320])
So the extra weights are just discarded in this case. Zero-filling is only used when expanding tensors to larger shapes.
Corresponding code: link.
Re-LAION-Caption19M[3].aesthetic_score > 5.6 and pwatermark < 0.2 and LaMa [2] mask generation.