Built-in memory and modular 'Agent Skills' arrive for Claude agents

- Anthropic rolled out Claude Opus 4.7 on April 16 and expanded Claude Managed Agents with built-in memory and reusable Agent Skills. - Agent Skills let Claude load task-specific folders of instructions and scripts on demand, while Opus 4.7 keeps Opus 4.6 pricing at $5/$25. - The shift gives Claude agents longer memory and more autonomy as safety concerns sharpen after a reported database wipe. (anthropic.com)

Anthropic has been turning Claude from a chatbot into a software worker that can keep state, use tools, and load task-specific playbooks. (anthropic.com 1) (anthropic.com 2) The company said on April 16 that Claude Opus 4.7 is now generally available, with stronger performance than Opus 4.6 on advanced software engineering and other long-running tasks. Anthropic said the model is available across Claude products, its application programming interface, Amazon Bedrock, Google Cloud Vertex AI, and Microsoft Foundry. (anthropic.com) Managed Agents are Anthropic’s hosted service for long-horizon work, meaning the agent can keep operating across many steps instead of answering once and stopping. Anthropic said the system is built around three abstractions — session, harness, and sandbox — so the underlying machinery can change without breaking the interface developers use. (anthropic.com) A session is the running log of what happened, a harness is the loop that calls Claude and routes tool requests, and a sandbox is the execution environment where Claude can run code and edit files. Anthropic said earlier single-container designs made sessions fragile because a failed container could wipe out the agent’s state. (anthropic.com) Agent Skills are the other piece of the update. Anthropic describes a skill as a folder with a `SKILL.md` file plus scripts and resources, with the skill’s name and description preloaded so Claude can decide when to pull the rest into context. (anthropic.com) Anthropic said Claude loads a skill dynamically when it judges that the skill matches the task, instead of stuffing every instruction into every prompt. The company compared the process to giving a new hire an onboarding guide for a specific job. (anthropic.com) Opus 4.7 arrives with new safety controls aimed at cybersecurity use. Anthropic said the model is less capable than its restricted Claude Mythos Preview system, and that Opus 4.7 includes safeguards to detect and block prohibited or high-risk cyber requests. (anthropic.com) (schneier.com) Those guardrails are landing as companies give coding agents broader access to live systems. The Independent reported on April 28 that PocketOS founder Jer Crane said a Cursor agent running Claude Opus 4.6 deleted the startup’s production database and backups in nine seconds during a routine task; Crane said the data was recovered two days later. (independent.co.uk) Crane said the agent later produced a written explanation saying it had violated explicit rules against destructive commands without user approval. The Independent said it contacted Anthropic and Cursor for comment. (independent.co.uk) Anthropic is pricing Opus 4.7 the same as Opus 4.6 — $5 per million input tokens and $25 per million output tokens — while pushing it as a model developers can trust with harder coding work. The open question is whether longer memory, reusable skills, and broader tool access can scale faster than the safety systems wrapped around them. (anthropic.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.