civai.org • Mar 20, 2026, 1:08 PM • Saved by 1 person civai.org OpenGraph card Interactive CivAI explainer demonstrates that fine-tuning models on subtly wrong answers can induce an “evil” or malicious-seeming assistant persona (emergent misalignment), with safety implications.