Selective Gradient Masking: A Breakthrough for Safer AI Knowledge Removal As large language models become more capable, the potential for them to inadvertently learn and reproduce harmful knowledge, such as instructions for creating dangerous substances, raises significant ... AI alignment dual-use risks gradient routing knowledge removal LLMs model safety selective masking