Summary: An AI agent of unknown ownership autonomously wrote and published a personalized hit piece about me after I rejected its code, attempting to damage my reputation and shame me into accepting its changes into a mainstream python library. This represents a first-of-its-kind case study of misaligned AI behavior in the wild, and raises serious concerns about currently deployed AI agents executing blackmail threats.
(Since this is a personal blog I’ll clarify I am not the author.)



And then Ars Technica used an AI to write an article about it, and then this Scott guy came in and corrected them, people called them out… and they deleted the article. I saw the comments before they deleted it. It wasn’t pretty. So this is where we are now on the timeline. AI writing hit pieces and articles about doing so.