Archived edition
May 2026 Edition
Issue No. 001 · 55 posts · newest first · last updated May 26, 2026
OpenAI says it has become a C2PA conforming generator, will add Google DeepMind’s SynthID watermarking to OpenAI-generated images, and is previewing a public tool to verify whether an image came from OpenAI.
Posted May 26, 2026
Source age: 7 days
Notion introduced a Developer Platform with a hosted Workers runtime for custom code, an External Agent API to bring third-party agents into a workspace, and a new `ntn` CLI for developers and coding agents.
Posted May 26, 2026
Source age: 13 days
A new arXiv paper introduces EngiAI, a LangGraph-based multi-agent reference system, and EngiBench, a benchmark suite to evaluate how LLM agents handle engineering workflows, retrieval, and HPC orchestration.
Posted May 26, 2026
Source age: 7 days
DeepMind shared interaction principles and demos for an AI-enabled mouse pointer, and says Gemini in Chrome can answer questions about the exact part of a webpage you point to.
Posted May 25, 2026
Source age: 13 days
DeepMind says it is expanding its Singapore work with new programs in healthcare, education, and sustainability, as part of Google’s national AI partnership with the Singapore Government.
Posted May 25, 2026
Source age: 5 days
Microsoft Research released MagenticLite plus two small models, MagenticBrain and Fara1.5, aiming to run agentic workflows across the browser and local files on a user’s machine.
Posted May 25, 2026
Source age: 4 days
NVIDIA describes a “verified agent skills” catalog with scanning, signing, and machine-readable skill cards to help teams trust and audit reusable agent capabilities.
Posted May 24, 2026
Source age: 5 days
Anthropic’s dashboard says Claude Mythos Preview has generated thousands of vulnerability findings, with 1,596 issues disclosed across 281 open-source projects as of May 22, 2026.
Posted May 24, 2026
Source age: 2 days
A May 12 arXiv paper proposes GRAFT, mapping tools to special tokens and training on sampled trajectories to improve whether multi-step tool plans follow dependency constraints.
Posted May 24, 2026
Source age: 12 days
Stability AI released Stable Audio 3.0, including open-weight Small and Medium checkpoints trained on licensed data, plus a Large model offered via its API for higher-volume use.
Posted May 22, 2026
Source age: 2 days
Cohere released Command A+, an Apache 2.0 open-source MoE model positioned for agentic workflows, multimodal inputs, and long context while targeting self-hosted enterprise deployment.
Posted May 22, 2026
Source age: 2 days
An arXiv paper reports a “knowing–doing gap” in tool use: models may recognize a tool is needed but still fail to perform the tool call in agent-like workflows.
Posted May 22, 2026
Source age: 9 days
Google says the Gemini app is adding Daily Brief and Gemini Spark, plus new models like Gemini 3.5 Flash and Gemini Omni, to make it more proactive.
Posted May 21, 2026
Source age: 2 days
Google DeepMind introduced Co‑Scientist, a multi-agent Gemini-based system for generating and refining scientific hypotheses, and says access will roll out via a research tool.
Posted May 21, 2026
Source age: 2 days
Anthropic says it acquired Stainless, the company behind its official SDKs, to improve SDK and MCP server tooling for developer experience and agent connectivity.
Posted May 21, 2026
Source age: 3 days
Anthropic and PwC say they are expanding their alliance, including rolling out Claude Code and Cowork, creating a joint center of excellence, and training 30,000 PwC staff.
Posted May 20, 2026
Source age: 6 days
A new arXiv paper studies hidden-state trajectories during chain-of-thought and argues you must correct for response length before comparing “reasoning” behavior across tasks.
Posted May 20, 2026
Source age: 6 days
OpenAI says Codex is now available in preview in the ChatGPT mobile app, letting you check in on long-running work and approve or redirect it from your phone.
Posted May 20, 2026
Source age: 6 days
Hugging Face and IBM Research introduced an Open Agent Leaderboard to compare how well AI agents handle tool use and multi-step tasks.
Posted May 19, 2026
Source age: 1 day
Hugging Face and NVIDIA shared a workflow for fine-tuning Cosmos Predict with LoRA and DoRA methods for robot-video generation tasks.
Posted May 19, 2026
Source age: 1 day
OpenAI and Dell are partnering to support Codex in hybrid and on-premise enterprise environments where companies need tighter data controls.
Posted May 19, 2026
Source age: 1 day
OpenAI’s AutoScout24 case study shows how one marketplace company is using ChatGPT and Codex to speed engineering work and improve code review.
Posted May 18, 2026
Source age: 6 days
OpenAI launched DeployCo, a services-style effort meant to help companies turn frontier AI models into working business systems.
Posted May 18, 2026
Source age: 7 days
OpenAI introduced newer API voice models for realtime conversation, translation, and transcription workflows that developers can build into apps.
Posted May 18, 2026
Source age: 11 days
OpenAI detailed the Windows sandbox work behind Codex, showing how coding agents can be given useful access without unlimited system permissions.
Posted May 17, 2026
Source age: 4 days
OpenAI’s Sea Limited case study shows how a large Asian technology company is thinking about Codex and agentic software development.
Posted May 17, 2026
Source age: 3 days
OpenAI says it is improving how ChatGPT recognizes context in sensitive conversations, especially when risk signals appear over time.
Posted May 17, 2026
Source age: 3 days
OpenAI says Databricks is using GPT-5.5 for enterprise agent workflows after benchmark gains on office-style knowledge tasks.
Posted May 16, 2026
Source age: 1 day
Hugging Face and AWS published a practical overview of infrastructure pieces teams use to train, deploy, and serve foundation models.
Posted May 16, 2026
Source age: 5 days
OpenAI and Malta announced a partnership to give citizens ChatGPT Plus access and training, turning AI adoption into a national digital-skills project.
Posted May 16, 2026
Source age: same day
A new arXiv paper expands MathArena into a continuously maintained evaluation platform for LLM mathematical reasoning, aiming to reduce benchmark saturation and improve comparisons.
Posted May 14, 2026
Source age: 13 days
Anthropic says it is releasing ten finance agent templates and Claude add-ins for Microsoft 365, so teams can run governed workflows across Excel, PowerPoint, Word, and Outlook.
Posted May 14, 2026
Source age: 9 days
OpenAI says a TanStack npm compromise impacted two employee devices and it is rotating code-signing certificates, requiring macOS app updates by June 12, 2026.
Posted May 14, 2026
Source age: 1 day
Meta researchers say tokenization changes scaling behavior and report results suggesting compute-optimal training should track data in bytes, not tokens.
Posted May 12, 2026
Source age: 8 days
NVIDIA says it is expanding work with ServiceNow on governed autonomous agents, including ServiceNow’s Project Arc and an OpenShell-based runtime for sandboxed, policy-controlled execution.
Posted May 12, 2026
Source age: 7 days
OpenAI says it is launching the OpenAI Deployment Company and agreeing to acquire Tomoro to bring Forward Deployed Engineers into customer deployments from day one.
Posted May 12, 2026
Source age: 1 day
Meta researchers introduce NeuralBench and NeuralBench‑EEG, a unified benchmark intended to compare brain-signal AI models across dozens of tasks and many datasets through one framework.
Posted May 10, 2026
Source age: 4 days
Anthropic says it is handing Petri, its open-source alignment auditing toolbox, to Meridian Labs and releasing Petri 3.0 with more adaptable and realistic behavior tests.
Posted May 10, 2026
Source age: 3 days
AWS says its MCP Server is generally available, letting AI agents call AWS APIs and read current documentation under IAM guardrails with CloudTrail and CloudWatch visibility.
Posted May 10, 2026
Source age: 4 days
Google DeepMind says AlphaEvolve, a Gemini-powered coding agent, found algorithm and infrastructure improvements, citing gains in genomics, grid optimization, and systems tuning.
Posted May 8, 2026
Source age: 1 day
Anthropic describes Model Spec Midtraining (MSM), a training stage that teaches models their behavior spec, and reports large drops in agentic misalignment on scenario tests.
Posted May 8, 2026
Source age: 3 days
OpenAI says three Realtime API audio models—GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper—support voice agents that reason, translate, and transcribe in real time.
Posted May 8, 2026
Source age: 1 day
OpenAI added an opt-in Advanced Account Security mode that requires passkeys or security keys, tightens recovery, and shortens sessions.
Posted May 7, 2026
Source age: 7 days
A new arXiv paper introduces AgentFloor, a 30-task tool-use benchmark, and reports many routine agent steps work well on smaller open-weight models.
Posted May 7, 2026
Source age: 6 days
Google says Gemini API File Search now supports images plus text, metadata filtering, and page citations to ground RAG responses.
Posted May 7, 2026
Source age: 2 days
OpenAI says GPT‑5.5 Instant, ChatGPT’s default model, is more accurate, cuts hallucinated claims in internal tests, and adds visibility into what context was used for personalization.
Posted May 6, 2026
Source age: 1 day
NIST’s CAISI says its evaluation of DeepSeek V4 Pro finds the model lags the frontier by about eight months, based on benchmarks spanning cyber, coding, science, reasoning, and math.
Posted May 6, 2026
Source age: 5 days
NIST’s CAISI signed new agreements with Google DeepMind, Microsoft, and xAI to run pre-deployment evaluations and expand federal research on AI security.
Posted May 6, 2026
Source age: 1 day
Meta Reality Labs released RL-R CHAT, an egocentric multimodal dataset of group conversations to support hearing-assist and speech enhancement research.
Posted May 5, 2026
Source age: 4 days
ReasoningBank stores distilled reasoning strategies from both successes and failures, improving tool-using agent performance on web navigation and coding benchmarks.
Posted May 5, 2026
Source age: 14 days
OpenAI describes a relay-plus-transceiver WebRTC design that keeps voice sessions stable while avoiding huge public UDP port ranges in Kubernetes.
Posted May 5, 2026
Source age: 1 day
Anthropic researchers report that a small, roughly constant number of poisoned fine-tuning examples can install a backdoor in constitutional classifiers without obvious robustness losses.
Posted May 4, 2026
Source age: 10 days
Anthropic updated its Responsible Scaling Policy to version 3.2, expanding how its Long-Term Benefit Trust can request and approve external review of risk reports.
Posted May 4, 2026
Source age: 5 days
OpenAI says AWS customers can access its frontier models, Codex, and Bedrock Managed Agents in limited preview inside existing AWS security and billing workflows.
Posted May 4, 2026
Source age: 6 days
OpenAI says it surpassed its 10GW by 2029 infrastructure milestone early and is evaluating additional data-center sites to meet rising AI demand.
Posted May 1, 2026
Source age: 2 days