I wonder whether LoRAs could be useful for U-Net training. Especially thinking o... | Hacker News

HN2new | past | comments | ask | show | jobs | submit

		carbocation on March 8, 2024 \| parent \| context \| favorite \| on: Fine tune a 70B language model at home I wonder whether LoRAs could be useful for U-Net training. Especially thinking of CNN-based U-Net models with pre-trained encoders (but randomly initialized decoders). At least, it seems possible that normal weight updates on the decoder and LoRA training on the encoder could improve efficiency.

jph00 on March 8, 2024 [–]

Diffusion unet has an "extended" version nowadays that applies to the resnet part as well as the cross-attention: https://github.com/cloneofsimo/lora

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact