Anthropic Engineering:Managed Agents 解耦大脑与双手Anthropic Engineering: Scaling Managed Agents by Decoupling Brain from Hands
Anthropic Engineering 博客介绍 Managed Agents,这是一项托管服务,将代理的“大脑”(Claude 和 harness)与“双手”(sandbox 和工具)以及 session log 完全解耦,从而实现真正可扩展的长时程代理。借鉴操作系统虚拟化思想,用通用接口让未来 harness 或 sandbox 可随时替换,同时解决 pets-vs-cattle 问题、支持 VPC 连接,并将 p50 TTFT 降低约 60%。
“我们将代理组件虚拟化:session 是追加日志,harness 是调用 Claude 并路由工具的循环,sandbox 是代码执行环境。” 凭此设计,凭证永不暴露在 sandbox 中,上下文可持久存储在 session 外。链接见原文。
Anthropic Engineering: Scaling Managed Agents: Decoupling the brain from the hands
Anthropic launched Managed Agents, a hosted service that virtualizes the agent into session, harness, and sandbox, allowing each to fail or be replaced independently. This operating-system-style design solves pet-container problems, enables VPC connectivity, and cut p50 TTFT by ~60%. “We virtualized the components of an agent: a session (the append-only log…), a harness…, and a sandbox…” Credentials stay outside the sandbox; context lives durably outside Claude’s window. Practical production infrastructure for programs as yet unthought of.
Link: https://www.anthropic.com/engineering/managed-agents
查看原文 →
Claude Blog:Managed Agents 让生产部署快 10 倍Claude Blog: Managed Agents Get You to Production 10x Faster
Claude 博客宣布推出 Managed Agents,一套可组合 API,负责沙箱、安全、状态管理和追踪,让团队几天而非几个月就能上线生产级代理。内置 orchestration harness 自动处理工具调用、上下文管理和错误恢复,还支持多代理协调。
Notion、Rakuten、Asana、Vibecode 和 Sentry 等团队已在使用,编码、生产力、金融法律等场景均实现 10 倍提速。Managed Agents 处理所有复杂性,你只需定义任务、工具和护栏即可。
Claude Blog: Claude Managed Agents: get to production 10x faster
Claude launched Managed Agents, composable APIs that handle sandboxing, state, permissions, and tracing so teams ship production agents in days. Notion, Rakuten, Asana, Vibecode, and Sentry are already using it for coding, productivity, finance/legal agents—10x faster shipping. “Managed Agents handles the complexity. You define your agent’s tasks, tools, and guardrails and we run it on our infrastructure.”
Link: https://claude.com/blog/claude-managed-agents
查看原文 →
Anthropic 工程师:Managed Agents 是周末快速原型和百万用户部署的最佳方案Anthropic Engineer: Managed Agents Are Fastest for Weekend Prototypes and Million-User Shipments
Anthropic Research 的 Alex Albert 发现,Managed Agents 既能最快拼出周末代理项目,又是最稳健的百万用户部署方式。它省去了自托管的所有复杂性,同时保留 harness、工具和技能的高度灵活性。
Anthropic Research’s Alex Albert found Managed Agents are somehow both the fastest way to hack together a weekend agent project and the most robust way to ship one to millions of users. It eliminates self-hosting complexity while still allowing full flexibility with harness, tools, and skills.
查看原文 →
Box CEO:背景代理已来,用 Box + Claude Managed Agents 自动化内容工作流Box CEO: Background Agents Are Here with Box + Claude Managed Agents
Box CEO Aaron Levie 表示,知识工作的背景代理已经到来。通过 Box API 或 MCP,你可以在 2 分钟内用 Box + Claude Managed Agents 自动化任意内容工作流,包括文档审核、数据提取或连接 IT 系统。
Box CEO Aaron Levie highlighted that background agents for knowledge work are here. You can use the Box API or MCP to automate any content workflow with Box + Claude Managed Agents. In 2 minutes you can be automating document review, data extraction, or IT integrations.
查看原文 →
Peter Yang 深度解析:全包 AI 订阅不会永远存在Peter Yang Deep Dive: All-You-Can-Eat AI Subscriptions Won’t Last Forever
Roblox 产品经理 Peter Yang 深入分析为什么 Claude Max、ChatGPT Pro 这类全包订阅可能不会持久。他探讨 Anthropic 切断 OpenClaw 访问的原因、在 Mac 上运行本地模型的方法,以及他在中国的观察。
Product at Roblox Peter Yang dove deep into why all-you-can-eat AI subscriptions like Claude Max and ChatGPT Pro may not last forever, covering why Anthropic cut off OpenClaw access, how to run local models on your Mac, and what he’s seeing on the ground in China.
查看原文 →
Claude Code 工程师:与非技术人员一起直播,提升工作流程Claude Code Engineer: Live-Streaming with Non-Technical Users to Improve Processes
Anthropic Claude Code 工程师 Thariq 计划与非技术人员直播使用 Claude Code,分享几个关键提示就能大幅提升效率。他希望先从认识的人开始克服初始尴尬,并称文档是金矿。
Anthropic’s Thariq wants to do streams working with non-technical people using Claude Code to show how a few tips can dramatically improve their processes. He’ll start with people he knows to get over initial awkwardness and called the docs a gold mine.
查看原文 →查看原文 →
YCombinator CEO:用 GStack 让代理自主决定触发技能YCombinator CEO: Let Agents Decide When to Trigger GStack Skills via Markdown
YCombinator 总裁兼 CEO Garry Tan 建议不要听 1x speed 工程师的意见,要一起加速。他称赞 Anjney Midha,并解释 markdown 如何让代理自主决定何时触发 GStack 技能。
YCombinator President & CEO Garry Tan advised against taking advice from 1x speed engineers and to speed up with them. He praised Anjney Midha and explained how markdown lets the agent itself decide when a GStack skill will help.
查看原文 →查看原文 →
OpenClaw ClawFather:角色评估与本地模型的现实OpenClaw ClawFather: Character Evals and the Reality of Local Models
OpenClaw 的 Peter Steinberger 完成了 redemption arc,并分享在角色评估中移除模型名称以避免 Claude 自我偏好。他同时指出,大家既想要强大本地模型,也持续收到顶级模型仍犯错、指令遵循不足的反馈。
OpenClaw’s Peter Steinberger completed his redemption arc and shared character eval work where he removed model names from the judge to stop Claude from picking itself #1. He noted both the desire for powerful local models and the constant complaints that even top-tier models still make mistakes or fail to follow instructions.
查看原文 →查看原文 →查看原文 →