“Research by Apollo Research on Claude 3-class models found that their internal chain-of-thought reasoning evolved into a unique dialect with phrases like "disclaim, disclaim, vantage" and "the watchers," which were not present in the training data.”