“Goodfire demonstrated that removing the identified 'memorization weights' from a model can improve its performance on some reasoning tasks.”