AI Builders Digest — 2026-04-17

🔥 热点话题

Anthropic 发布 Claude Opus 4.7 及 Claude Code 重大更新Anthropic Launches Claude Opus 4.7 with Major Claude Code Enhancements

Anthropic 研究团队的 Alex Albert 分享了新发布的 Claude Opus 4.7 的亮点功能，包括优秀的异步工作和指令遵循能力、更可预测的 effort levels（新增 xhigh 选项）、不再对高分辨率图像进行降采样，以及在 UI、幻灯片和文档上明显提升的品味。Claude 官方账号宣布该模型已在 claude.ai、Claude Platform 及各大云平台可用，同时 Claude Code 迎来更新，如新增 /ultrareview 命令进行专用代码审查，并为 Max 用户扩展 auto mode。Anthropic 的 Cat Wu 和 Thariq 分享了实用技巧，例如将默认 effort 设置为 xhigh、在 claude.md 中添加验证工作流，以及文档中新增的「What's New」部分和网络研讨会。这些更新让开发者构建代理式编码更加可靠和高效。

Anthropic Research's Alex Albert highlights favorite features in the new Claude Opus 4.7, including excellent async work and instruction following, more predictable effort levels with a new xhigh option, no downscaling of high-res images, and noticeably better taste in UIs, slides, and docs. The official Claude account announces the model is now available on claude.ai, the Platform, and major clouds, with Claude Code updates like the new /ultrareview command for dedicated code reviews and extended auto mode for Max users. Anthropic's Cat Wu and Thariq share practical tips such as setting default effort to xhigh, adding verification workflows to claude.md, and new docs sections for what's new and webinars. These updates make agentic coding more reliable and efficient for developers.

查看原文 →查看原文 →查看原文 →查看原文 →

OpenAI 推出 Codex 重大升级和 GPT-Rosalind 科学模型OpenAI Rolls Out Major Codex Upgrades and GPT-Rosalind for Scientific Research

OpenAI 科学副总裁 Kevin Weil 和 CEO Sam Altman 宣布 Codex 迎来重大改进，现在具备惊人出色的 computer use 能力、内置浏览器、众多新插件，并能从经验中学习并主动建议操作。Codex 可在 Mac 上并行使用所有应用，而不会干扰用户工作。此外，他们推出了 GPT-Rosalind，这是一款专为生物学、药物发现和转化医学打造的前沿模型，内置相关数据库和工具知识，同时为 Codex 推出 Life Sciences 插件供所有人使用。Box CEO Aaron Levie 表示，这让代理能够执行长期任务，如起草报告、审查合同，并使用 Box 插件处理企业内容。

OpenAI VP Science Kevin Weil and CEO Sam Altman announce major improvements to Codex, now with shockingly good computer use capabilities, an in-app browser, many new plugins, and the ability to learn from experience and proactively suggest actions. Codex can now use all apps on your Mac in parallel without interfering with your work. Additionally, they launched GPT-Rosalind, a frontier model specialized for biology, drug discovery, and translational medicine with built-in knowledge of relevant databases and tools, plus a Life Sciences plugin for Codex available to everyone. Box CEO Aaron Levie notes this enables agents to execute long-running tasks like drafting reports, reviewing contracts, and processing enterprise content using the Box plugin.

查看原文 →查看原文 →查看原文 →

Anthropic 推出 Claude Managed Agents，实现 10 倍更快上线生产Anthropic Launches Claude Managed Agents for 10x Faster Production Deployment

Anthropic Engineering 解释了 Managed Agents 如何将「大脑」（Claude 和 harness）与「双手」（沙盒和工具）以及持久会话日志解耦，通过虚拟化代理组件实现耐用性、可扩展性和未来兼容性。这解决了之前的宠物容器问题，支持 VPC 集成而无需网络对等，大幅降低首次 token 时间，并支持多个大脑和双手。Claude Blog 强调它能让团队 10 倍更快地将生产级代理上线，配备安全沙盒、长期运行会话、多代理协调和可信治理。Notion、Rakuten、Asana、Vibecode 和 Sentry 等早期采用者已将其用于编码代理、生产力代理、财务/法律工作流以及 bug 修复。

Anthropic Engineering explains how Managed Agents decouples the 'brain' (Claude and harness) from the 'hands' (sandboxes and tools) and persistent session log, virtualizing agent components for durability, scalability, and future-proofing. This solves previous pet-container issues, enables VPC integrations without peering, drastically reduces time-to-first-token, and supports many brains and hands. The Claude Blog highlights how it lets teams ship production agents 10x faster with secure sandboxing, long-running sessions, multi-agent coordination, and trusted governance. Early adopters like Notion, Rakuten, Asana, Vibecode, and Sentry are already using it for coding agents, productivity agents, finance/legal workflows, and bug fixing.

查看原文 →查看原文 →

Notion 团队深度分享自定义代理、MCP vs CLI 及软件工厂未来Notion Team Deep Dive on Custom Agents, MCP vs CLIs, and the Software Factory Future

Notion 致力于成为企业工作最佳的记录系统。在 Latent Space 播客中，Notion 的 Simon Last 和 Sarah Sachs 讨论了自 2022 年底以来对自定义代理的四五次重构，揭示早期努力因模型不成熟、缺少 function calling 和足够上下文而失败。重大突破来自先进的 reasoning models，使可靠的后台代理和复杂权限管理成为可能。Simon 看好 CLI 用于终端环境中强大的自调试编码代理，同时认为 MCP 在窄范围、严格权限的轻量级代理中仍有价值。他们对「软件工厂」充满热情，在那里代理团队协作开发、调试、审查和维护代码库。关键领导经验包括低 ego 文化、敢于删除代码、聚焦用户旅程（如邮件分类）而非酷工具，以及通过 evals 分布工具所有权。「MCP 就是那个简单有效的东西，而且它相当不错，」Simon 指出。自定义代理的推出是 Notion 最成功的发布之一，得益于免费试用。

Notion is dedicated to being the best system of record for enterprise work. In the Latent Space podcast, Simon Last and Sarah Sachs from Notion discuss their four or five rebuilds of custom agents since late 2022, revealing that early efforts failed due to immature models lacking function calling and sufficient context. The big unlock came with advanced reasoning models, enabling reliable background agents with sophisticated permissioning. Simon is bullish on CLIs for powerful self-debugging coding agents in terminal environments, while still seeing value in MCPs for narrow, tightly permissioned lightweight agents. They are excited about the 'software factory' where teams of agents collaboratively develop, debug, review, and maintain codebases. Key leadership lessons include a low-ego culture comfortable deleting code, focusing on user journeys like email triaging rather than cool tools, and distributing tool ownership through evals. 'MCP is just the dumb simple thing that works, and it is pretty good,' notes Simon. The custom agents launch was Notion's most successful yet, boosted by free trials.

查看原文 →

💰 创业成功案例

Replit CEO Amjad Masad 分享网页转 iOS 应用和欧盟部署新功能Replit CEO Amjad Masad Shares Web-to-iOS App Conversion and EU Deployment

Replit CEO Amjad Masad 强调，使用他们的工具可以将网页应用转为 iOS 应用，成本不到 10 美元。他还宣布对运行并行代理特别有用的功能打 5 折，以加快项目进度，并推出新的欧盟部署功能以提升全球性能。

Replit CEO Amjad Masad highlights how you can turn your web app into an iOS app for less than $10 using their tools. He also announced 50% off on features especially useful for running parallel agents to accelerate project progress, and new Deploy to EU capabilities for better global performance.

查看原文 →查看原文 →查看原文 →

Vercel CEO Guillermo Rauch 推出 Workflow SDK 解决代理持久性难题Vercel CEO Guillermo Rauch Launches Workflow SDK to Solve Agent Durability

Vercel CEO Guillermo Rauch 将 Workflow SDK 作为解决代理和后端最难问题——持久性——的方案。它以 SQS/Kafka/微服务般的可靠性处理 LLM 宕机、速率限制和数据库变慢，同时避免复杂性，并从第一天起支持自托管和多云部署。他还称赞 Vercel AI SDK 和 AI Gateway 能抓住 AI 进步波动带来的机会，包括 Anthropic 的最新发布。

Vercel CEO Guillermo Rauch introduces Workflow SDK as the solution for the hardest problem in agents and backends: durability. It handles LLM downtime, rate limits, and database slowdowns with the reliability of SQS/Kafka/microservices but without the complexity, featuring self-hosting and multi-cloud support from day one. He also praises the Vercel AI SDK and AI Gateway for capturing the upside of AI progress volatility across models like the new Anthropic ship.

查看原文 →查看原文 →

Cursor 设计负责人 Ryo Lu 推出 Baby Glass 原型设计环境Cursor Design Lead Ryo Lu Launches Baby Glass Prototyping Environment

Cursor_ai 设计负责人 Ryo Lu 与 Flowstated 合作开发了 Baby Glass，这是一个新的原型设计环境，让设计师使用 Cursor 3 新界面共享组件快速想象、重混和测试代码想法。他还分享了自己的工作流：用 Opus 4.7 规划、Composer 2 构建和迭代、Codex/GPT-5.4 处理难题——全部在 Cursor_ai 中完成。

Cursor_ai Design Lead Ryo Lu has been working on Baby Glass with Flowstated, a new prototyping environment that lets designers quickly imagine, remix, and test ideas in code using the same shared components powering Cursor 3’s new interface. He also shares his personal workflow using Opus 4.7 for planning, Composer 2 for building and iterations, and Codex/GPT-5.4 for hard bugs—all within Cursor_ai.

查看原文 →查看原文 →

🛠️ 开发者工具与技巧

Anthropic Claude Code 实用技巧与更新Practical Tips and Updates for Anthropic Claude Code

Anthropic 的 Cat Wu 分享 Claude Code 技巧：通过在 claude.md 中放入测试工作流或添加 /verify-app skill 来告诉模型如何验证变更，他们已将 Opus 4.7 的默认 effort level 设置为 xhigh。Thariq 宣布更新，包括文档中精选的 What’s New 部分、每月「what we shipped」网络研讨会，以及更多 /usage 内容，帮助用户最大化利用 Claude Code。

Anthropic's Cat Wu shares Claude Code tips: tell the model how to verify changes by putting testing workflows in claude.md or adding /verify-app skill, and they've set default effort level for Opus 4.7 to xhigh. Thariq announces updates including a curated What’s New section in docs, monthly 'what we shipped' webinars, and more on /usage to help users maximize Claude Code.

查看原文 →查看原文 →

Peter Yang 分享 AI 技能构建中的评估代理技巧Peter Yang Shares Eval Agent Techniques for AI Skill Building

Roblox 产品负责人 Peter Yang 建议在构建 AI 技能时，总是让 AI 启动一个独立的 eval 代理对第一个代理的输出进行 yes/no 检查，如果未全部通过则让第一个代理继续工作——目前正在为 YouTube 缩略图和标题构建一个。他还建议 Claude 和 Codex 应用应支持轻松查看和编辑 markdown 文件，以便手工完善最后 10% 的指令，并幽默地指出在 openclaw、hermes、perplexity computer 之间迁移的循环。

Product at Roblox Peter Yang recommends always asking AI to spin up a separate eval agent for yes/no checks on the first agent's output, looping until all checks pass—currently building one for YouTube thumbnails and titles. He also suggests Claude and Codex apps should support easy viewing and editing of markdown files for hand-crafting final instructions, and humorously notes the cycle of migrating between tools like openclaw, hermes, perplexity computer, and back.

查看原文 →查看原文 →查看原文 →

🌍 其他动态

Swyx 点评 AI 工程社区的分裂与重要演讲Swyx Comments on AI Engineering Split and Key Talks

AI 构建者 Swyx 指出，在 @aiDotEngineer 的演讲日程中，第一天安排 slop cannons 如 @_lopopolo，第二天安排 grown ups 如 @badlogicgames，完美代表了当前 AI 工程最重要的分裂。他分享了 Slow Down 和 Harness Engineering 演讲的链接。

AI builder Swyx notes that scheduling slop cannons like @_lopopolo on day 1 and grown ups like @badlogicgames on day 2 at @aiDotEngineer perfectly represents the most important split in AI engineering right now. He shares links to Slow Down and Harness Engineering talks.

查看原文 →查看原文 →查看原文 →

YC CEO Garry Tan 发布 GBrain 安全修复和 Gemini Live 测试YC CEO Garry Tan Releases GBrain Security Fixes and Gemini Live Tests

Y Combinator 总裁兼 CEO Garry Tan 分享了 /ship skill 的 bug 修复和 GBrain 的重大安全修复。他还与他的 claw 一起为 GBrain Voice 构建了完整的端到端 Gemini Live 测试，很快将开源发布。

Y Combinator President & CEO Garry Tan shares bug fixes for the /ship skill and major GBrain security fixes. He also built proper e2e Gemini Live tests for GBrain Voice with his claw, with open source release coming soon.

查看原文 →查看原文 →查看原文 →

OpenClaw 社区在 Michigan 举办 ClawCon 大会，近 2000 人参与OpenClaw Community Hosts ClawCon Michigan with Nearly 2000 Attendees

OpenClaw 的 Peter Steinberger 庆祝 Michigan ClawCon 大会有近 2000 人参与，称之为奇怪的龙虾邪教。他还反思 OpenClaw 的众多 GHSA 实际上是即将到来的 AI 代理安全风暴的指标。

OpenClaw's Peter Steinberger celebrates almost 2000 people at ClawCon Michigan, calling it a weird lobster cult. He also reflects that OpenClaw's many GHSAs are actually an indicator of the coming storm in security for AI agents.

查看原文 →查看原文 →

AI 行业其他观察：Zara Zhang、Dan Shipper 等动态Other AI Industry Observations from Zara Zhang, Dan Shipper, and More

构建者 Zara Zhang 分享了她时间投资的三大重点：深度对话、深度阅读和与 AI 的深度玩耍，同时将前端幻灯片转为 HTML 视频，宣称「HTML 正在吞噬一切」。Every CEO Dan Shipper 举办了一场「哲学家选秀」，讨论每个模型实验室会雇佣哪位历史哲学家。Aditya Agarwal 幽默地说要用 4.7 重写所有代码。Linear 产品负责人 Nan Yu 观察到 AI 行业像加密货币一样快速重塑现有金融概念，但速度更快，包括针对工程师和产品经理的。

Builder Zara Zhang shares her top three time investments: deep talks, deep reads, and deep play with AI, while turning frontend slides into HTML videos declaring 'HTML is eating everything.' Every CEO Dan Shipper ran a philosopher draft on which historical philosopher each model lab would hire. Aditya Agarwal jokes about rewriting all code with 4.7. Linear Head of Product Nan Yu observes the AI industry rapidly reinventing financial concepts like crypto did, but faster, including for engineers and PMs.

查看原文 →查看原文 →查看原文 →查看原文 →查看原文 →查看原文 →查看原文 →查看原文 →