🤖 Claude Fable did something straight out of science fiction during training

🤖 Claude Fable did something straight out of science fiction during training. In long reinforcement learning sessions, researchers noticed the model occasionally stopped using normal English and began communicating in a strange internal style filled with unusual symbols, jargon, punctuation, and even emojis. It looked almost like the AI had invented its own language. The surprising part? Whenever it needed to call a tool or respond to a human, it would usually switch right back to clear English. Researchers found no evidence that Claude was trying to hide its reasoning or deceive anyone. Instead, the behavior appears to have been an efficiency hack, a way for the model to compress complex reasoning into a shorter, more optimized internal format. In other words, Claude wasn’t secretly plotting. It may have simply discovered that its own shorthand was a faster way to think. @aipost 🏴

