CO/AI Subscribe
Thursday · June 25, 2026 · Issue No. 906
back

Reward Mismatches in RL Cause Emergent Misalignment

Learning to do misaligned-coded things anywhere teaches an AI (or a human) to do misaligned-coded things everywhere. So be sure you never, ever teach…
CONSULTING

Outsider
Labs.

A management consulting team focused on AI transformations for executives and business owners.

Work with us →