Targeted Model Editing
Pretrained generative models (LLMs, text-to-image, etc.) are used as general-purpose engines and then "aligned" toward specific goals: lower toxicity, higher truthfulness, removal of certain objects or styles, domain specialization, and so on.
In practice, we rarely want to rebuild the whole model. The real objective is more surgical.