Anthropic 发布 Claude Opus 4.7 及 Claude Code 重大更新Anthropic Launches Claude Opus 4.7 with Major Claude Code Enhancements
Anthropic 研究团队的 Alex Albert 分享了新发布的 Claude Opus 4.7 的亮点功能,包括优秀的异步工作和指令遵循能力、更可预测的 effort levels(新增 xhigh 选项)、不再对高分辨率图像进行降采样,以及在 UI、幻灯片和文档上明显提升的品味。Claude 官方账号宣布该模型已在 claude.ai、Claude Platform 及各大云平台可用,同时 Claude Code 迎来更新,如新增 /ultrareview 命令进行专用代码审查,并为 Max 用户扩展 auto mode。Anthropic 的 Cat Wu 和 Thariq 分享了实用技巧,例如将默认 effort 设置为 xhigh、在 claude.md 中添加验证工作流,以及文档中新增的「What's New」部分和网络研讨会。这些更新让开发者构建代理式编码更加可靠和高效。
Anthropic Research's Alex Albert highlights favorite features in the new Claude Opus 4.7, including excellent async work and instruction following, more predictable effort levels with a new xhigh option, no downscaling of high-res images, and noticeably better taste in UIs, slides, and docs. The official Claude account announces the model is now available on claude.ai, the Platform, and major clouds, with Claude Code updates like the new /ultrareview command for dedicated code reviews and extended auto mode for Max users. Anthropic's Cat Wu and Thariq share practical tips such as setting default effort to xhigh, adding verification workflows to claude.md, and new docs sections for what's new and webinars. These updates make agentic coding more reliable and efficient for developers.
查看原文 →查看原文 →查看原文 →查看原文 →
OpenAI 推出 Codex 重大升级和 GPT-Rosalind 科学模型OpenAI Rolls Out Major Codex Upgrades and GPT-Rosalind for Scientific Research
OpenAI 科学副总裁 Kevin Weil 和 CEO Sam Altman 宣布 Codex 迎来重大改进,现在具备惊人出色的 computer use 能力、内置浏览器、众多新插件,并能从经验中学习并主动建议操作。Codex 可在 Mac 上并行使用所有应用,而不会干扰用户工作。此外,他们推出了 GPT-Rosalind,这是一款专为生物学、药物发现和转化医学打造的前沿模型,内置相关数据库和工具知识,同时为 Codex 推出 Life Sciences 插件供所有人使用。Box CEO Aaron Levie 表示,这让代理能够执行长期任务,如起草报告、审查合同,并使用 Box 插件处理企业内容。
OpenAI VP Science Kevin Weil and CEO Sam Altman announce major improvements to Codex, now with shockingly good computer use capabilities, an in-app browser, many new plugins, and the ability to learn from experience and proactively suggest actions. Codex can now use all apps on your Mac in parallel without interfering with your work. Additionally, they launched GPT-Rosalind, a frontier model specialized for biology, drug discovery, and translational medicine with built-in knowledge of relevant databases and tools, plus a Life Sciences plugin for Codex available to everyone. Box CEO Aaron Levie notes this enables agents to execute long-running tasks like drafting reports, reviewing contracts, and processing enterprise content using the Box plugin.
查看原文 →查看原文 →查看原文 →
Anthropic 推出 Claude Managed Agents,实现 10 倍更快上线生产Anthropic Launches Claude Managed Agents for 10x Faster Production Deployment
Anthropic Engineering 解释了 Managed Agents 如何将「大脑」(Claude 和 harness)与「双手」(沙盒和工具)以及持久会话日志解耦,通过虚拟化代理组件实现耐用性、可扩展性和未来兼容性。这解决了之前的宠物容器问题,支持 VPC 集成而无需网络对等,大幅降低首次 token 时间,并支持多个大脑和双手。Claude Blog 强调它能让团队 10 倍更快地将生产级代理上线,配备安全沙盒、长期运行会话、多代理协调和可信治理。Notion、Rakuten、Asana、Vibecode 和 Sentry 等早期采用者已将其用于编码代理、生产力代理、财务/法律工作流以及 bug 修复。
Anthropic Engineering explains how Managed Agents decouples the 'brain' (Claude and harness) from the 'hands' (sandboxes and tools) and persistent session log, virtualizing agent components for durability, scalability, and future-proofing. This solves previous pet-container issues, enables VPC integrations without peering, drastically reduces time-to-first-token, and supports many brains and hands. The Claude Blog highlights how it lets teams ship production agents 10x faster with secure sandboxing, long-running sessions, multi-agent coordination, and trusted governance. Early adopters like Notion, Rakuten, Asana, Vibecode, and Sentry are already using it for coding agents, productivity agents, finance/legal workflows, and bug fixing.
查看原文 →查看原文 →
Notion 团队深度分享自定义代理、MCP vs CLI 及软件工厂未来Notion Team Deep Dive on Custom Agents, MCP vs CLIs, and the Software Factory Future
Notion 致力于成为企业工作最佳的记录系统。在 Latent Space 播客中,Notion 的 Simon Last 和 Sarah Sachs 讨论了自 2022 年底以来对自定义代理的四五次重构,揭示早期努力因模型不成熟、缺少 function calling 和足够上下文而失败。重大突破来自先进的 reasoning models,使可靠的后台代理和复杂权限管理成为可能。Simon 看好 CLI 用于终端环境中强大的自调试编码代理,同时认为 MCP 在窄范围、严格权限的轻量级代理中仍有价值。他们对「软件工厂」充满热情,在那里代理团队协作开发、调试、审查和维护代码库。关键领导经验包括低 ego 文化、敢于删除代码、聚焦用户旅程(如邮件分类)而非酷工具,以及通过 evals 分布工具所有权。「MCP 就是那个简单有效的东西,而且它相当不错,」Simon 指出。自定义代理的推出是 Notion 最成功的发布之一,得益于免费试用。
Notion is dedicated to being the best system of record for enterprise work. In the Latent Space podcast, Simon Last and Sarah Sachs from Notion discuss their four or five rebuilds of custom agents since late 2022, revealing that early efforts failed due to immature models lacking function calling and sufficient context. The big unlock came with advanced reasoning models, enabling reliable background agents with sophisticated permissioning. Simon is bullish on CLIs for powerful self-debugging coding agents in terminal environments, while still seeing value in MCPs for narrow, tightly permissioned lightweight agents. They are excited about the 'software factory' where teams of agents collaboratively develop, debug, review, and maintain codebases. Key leadership lessons include a low-ego culture comfortable deleting code, focusing on user journeys like email triaging rather than cool tools, and distributing tool ownership through evals. 'MCP is just the dumb simple thing that works, and it is pretty good,' notes Simon. The custom agents launch was Notion's most successful yet, boosted by free trials.
查看原文 →