OpenAI updates ChatGPT context handling for sensitive conversations
OpenAI says it is improving how ChatGPT recognizes context in sensitive conversations, especially when risk signals appear over time.
Topic hub
AI safety is about whether systems behave reliably, securely, and responsibly. This desk avoids panic and focuses on practical risk signals.
Plain-English primer
OpenAI says it is improving how ChatGPT recognizes context in sensitive conversations, especially when risk signals appear over time.
OpenAI says a TanStack npm compromise impacted two employee devices and it is rotating code-signing certificates, requiring macOS app updates by June 12, 2026.
Anthropic says it is handing Petri, its open-source alignment auditing toolbox, to Meridian Labs and releasing Petri 3.0 with more adaptable and realistic behavior tests.
Anthropic describes Model Spec Midtraining (MSM), a training stage that teaches models their behavior spec, and reports large drops in agentic misalignment on scenario tests.
OpenAI added an opt-in Advanced Account Security mode that requires passkeys or security keys, tightens recovery, and shortens sessions.
Anthropic updated its Responsible Scaling Policy to version 3.2, expanding how its Long-Term Benefit Trust can request and approve external review of risk reports.
Nearby topics