back
Get SIGNAL/NOISE in your inbox daily

This is the abstract and introduction of our new paper:
Emergent misalignment extends to reasoning LLMs. 
Reasoning models resist being shut down and…