Topic hub

AI Agents

AI agents are systems that can plan steps and use tools. This desk tracks where agentic workflows are becoming real and where they still need caution.

10 posts Latest May 19, 2026 Readers want to know what AI agents can actually do and where they are being adopted.

Plain-English primer

Terms that appear in this desk

AI agent tool use sandbox workflow automation

AI Agents 36 sec Verified

Hugging Face and IBM open a leaderboard for AI agent testing

Hugging Face and IBM Research introduced an Open Agent Leaderboard to compare how well AI agents handle tool use and multi-step tasks.

Posted May 19, 2026 Source age: 1 day

Read brief Original source

AI Agents 48 sec Verified

OpenAI and Dell bring Codex closer to enterprise infrastructure

OpenAI and Dell are partnering to support Codex in hybrid and on-premise enterprise environments where companies need tighter data controls.

Posted May 19, 2026 Source age: 1 day

Read brief Original source

AI Agents 47 sec Verified

OpenAI explains how it sandboxes Codex on Windows

OpenAI detailed the Windows sandbox work behind Codex, showing how coding agents can be given useful access without unlimited system permissions.

Posted May 17, 2026 Source age: 4 days

Read brief Original source

AI Agents 45 sec Verified

Sea Limited describes using Codex across engineering teams

OpenAI’s Sea Limited case study shows how a large Asian technology company is thinking about Codex and agentic software development.

Posted May 17, 2026 Source age: 3 days

Read brief Original source

AI Agents 45 sec Verified

Databricks brings GPT-5.5 into enterprise agent workflows

OpenAI says Databricks is using GPT-5.5 for enterprise agent workflows after benchmark gains on office-style knowledge tasks.

Posted May 16, 2026 Source age: 1 day

Read brief Original source

AI Agents 2 min Verified

Anthropic releases finance agents and Microsoft 365 add-ins

Anthropic says it is releasing ten finance agent templates and Claude add-ins for Microsoft 365, so teams can run governed workflows across Excel, PowerPoint, Word, and Outlook.

Posted May 14, 2026 Source age: 9 days

Read brief Original source

AI Agents 2 min Verified

NVIDIA and ServiceNow expand partnership for governed autonomous agents

NVIDIA says it is expanding work with ServiceNow on governed autonomous agents, including ServiceNow’s Project Arc and an OpenShell-based runtime for sandboxed, policy-controlled execution.

Posted May 12, 2026 Source age: 7 days

Read brief Original source

AI Agents 2 min Verified

DeepMind highlights new impact results for AlphaEvolve, its Gemini-powered coding agent

Google DeepMind says AlphaEvolve, a Gemini-powered coding agent, found algorithm and infrastructure improvements, citing gains in genomics, grid optimization, and systems tuning.

Posted May 8, 2026 Source age: 1 day

Read brief Original source

AI Agents 2 min Verified

AgentFloor benchmark tests how far small open-weight models go in tool use

A new arXiv paper introduces AgentFloor, a 30-task tool-use benchmark, and reports many routine agent steps work well on smaller open-weight models.

Posted May 7, 2026 Source age: 6 days

Read brief Original source

AI Agents 2 min Verified

Google’s ReasoningBank aims to help agents learn from past runs

ReasoningBank stores distilled reasoning strategies from both successes and failures, improving tool-using agent performance on web navigation and coding benchmarks.

Posted May 5, 2026 Source age: 14 days

Read brief Original source

Nearby topics

AI Agents

Terms that appear in this desk

Keep browsing