Standard LLM training methods do not naturally encourage information to be localized in specific ..., Sonic AI
“Standard LLM training methods do not naturally encourage information to be localized in specific neurons, making it fundamentally difficult to "unlearn" or remove specific knowledge post-training.”