AI Builders Digest — 2026-04-25

🔥 热点话题

SAP CTO Philipp Herzig 谈企业 AI 转型：从技术到商业成果SAP CTO Philipp Herzig on Bringing Enterprise 'Operating System' into AI Era

SAP CTO Philipp Herzig 表示，AI 不是单纯的技术转变，而是商业模式转型。SAP 作为企业端到端运营系统的领导者，正通过深度嵌入 AI agents、生成式 UI 和数据层重构，帮助客户实现可验证的成果。企业采用 AI 的最大挑战在于规模化、数据碎片化和安全性，而非模型本身。Herzig 强调 LLM 在非结构化任务中表现出色，但在预测分析（如需求预测、现金流）上仍需专用表格模型（如 SAP 的 RPT1 relational pretrained transformers）。他指出，agentic coding 的成功依赖于 evals 和 verifiability，类似于复兴的测试驱动开发。SAP 正在从座位许可转向混合消耗/成果导向定价，以匹配 AI 带来的生产力提升。

关键洞见："AI adoption in the enterprise is still not where we wanna see it... there's this Gartner curve... the AI innovation race, and then there's this AI outcome race." Herzig 认为，胜出的企业软件公司将专注于为客户交付可衡量的业务成果，而非单纯的技术炫耀。SAP 还探索量子计算在优化问题（如旅行商问题）上的潜力。

SAP CTO Philipp Herzig frames AI as a business model transition rather than just a technology shift. As the leader in end-to-end enterprise operating systems serving 400,000 customers, SAP is embedding AI agents, generative UIs, and reengineering data layers to deliver verifiable outcomes across finance, HR, supply chain, and more. Major challenges include scaling across complex landscapes, data fragmentation from M&A or legacy systems, and security—prompt injections or vulnerabilities like recent ones can expose credentials if not properly isolated.

Herzig highlights that LLMs excel in unstructured worlds (text, documents, support) but fall short for predictive/tabular tasks like demand forecasting or cash flow predictions, where classical ML or new architectures like RPT1 (relational pretrained transformers) are needed. Agent success hinges on evals for verifiability, reviving test-driven practices now that coding is automated. SAP's products like Joule for consultants demonstrate 30% effort reduction in complex migrations.

Quote: "The gap almost increases... versus getting narrow" between innovation and outcomes. SAP is moving to hybrid consumption-based pricing as trust and verifiability grow, while upleveling roles so humans focus on strategic work. Herzig also eyes quantum for hard optimization problems like logistics.

查看原文 →

Anthropic 发布 Managed Agents：解耦 agent 大脑与双手，实现长期可靠运行Anthropic Engineering: Scaling Managed Agents by Decoupling Brain from Hands

Anthropic Engineering 博客介绍了 Managed Agents，这是一种托管服务，通过稳定接口运行长时程 agents，适应未来模型改进。核心是将 agent 组件虚拟化：session（持久事件日志，位于 Claude 上下文窗口之外）、harness（控制循环）和 sandbox（执行环境）。早期设计中一切放在单个容器内，导致“宠物”式维护难题；现在解耦“大脑”（Claude + harness）与“双手”（工具/沙箱），每个部分可独立失败、替换或扩展。

好处包括：容器变为“牛群”式，按需通过 tool call 启动，p50 TTFT 降低约 60%，p95 降低 90% 以上；安全性提升，凭证永不进入沙箱；支持多脑多手架构，harness 无状态，可从 session 日志恢复。session 允许灵活查询事件流，支持上下文工程而无需不可逆压缩。

这一设计借鉴操作系统抽象，旨在支持“尚未想到的程序”。实际含义：企业可安全地将 Claude 连接到自家 VPC 等资源，而无需网络对等。

Anthropic's new Managed Agents is a hosted service for long-horizon agents via minimal, durable interfaces designed to outlast specific implementations. They virtualized three components: the session (append-only durable log of events, living outside Claude's context window), the harness (orchestration loop), and the sandbox (execution environment for code and file edits).

Previously, coupling everything in one container created "pet" servers that required nursing on failure. Now, the brain (Claude + harness) calls hands via a simple execute(name, input) → string interface. Containers become cattle—failures are handled as tool errors, new ones provisioned on demand. Harness failures are recovered by rebooting with the session log. Security boundaries ensure credentials never reach the sandbox where generated code runs.

Result: massive latency wins (p50 TTFT down ~60%, p95 >90%), easier VPC integration, support for many brains and many heterogeneous hands. The session enables flexible context interrogation without irreversible compaction. Quote from the post context: abstractions like process and file in OS outlasted hardware; Managed Agents aims for the same with agents.

Link: https://www.anthropic.com/engineering/managed-agents

查看原文 →

🛠️ 开发者工具与技巧

Cursor 与 GPT-5.5 + Composer 2 组合：智能、速度与成本完美平衡Ryo Lu (Cursor Design) switches fully to GPT-5.5 + Composer 2

Cursor 设计负责人 Ryo Lu 表示，已完全切换到 GPT-5.5 + Composer 2，这是 Cursor 中智能、速度和成本的完美组合。/multitask 命令可让 agent 并行处理多项任务，打破队列限制。

Ryo Lu, Design at Cursor_ai, has fully switched to GPT-5.5 + Composer 2 in Cursor for the ideal balance of intelligence, speed, and cost. Try /multitask to break out of the queue and work on many things at once.

查看原文 →

NotebookLM 新功能：自动标签与分类来源Josh Woodward (Google Labs) on NotebookLM auto-label & categorize sources

Google VP Josh Woodward 分享 NotebookLM 现在可以自动标签和分类来源，大幅提升笔记整理效率。

Josh Woodward, VP at Google Labs / Gemini / AI Studio, announced NotebookLM can now auto-label & categorize sources.

查看原文 →

Replit 轻松导入 Vercel 或 Lovable 应用Replit CEO Amjad Masad on importing Vercel/Lovable apps

Replit CEO Amjad Masad 宣布，现在只需几次点击即可将 Vercel 或 Lovable 应用导入 Replit。

Replit CEO Amjad Masad shared: Import your Vercel or Lovable apps to Replit with a few clicks.

查看原文 →

discrawl 0.6.0：无需特殊登录即可读取 Discord DMsPeter Steinberger releases discrawl 0.6.0

OpenClaw / OpenAI 的 Peter Steinberger 发布 discrawl 0.6.0 版本，最大改进是无需自定义登录技巧即可读取 Discord DMs（不写入以避免发送垃圾信息）。

Peter Steinberger (Polyagentmorous ClawFather at OpenClaw + OpenAI) released discrawl 0.6.0 with the biggest new feature: reading Discord DMs without custom login tricks (no writing).

查看原文 →

💰 创业成功案例

Aaron Levie 谈 Jevons Paradox 与 AI 带来的就业增长Box CEO Aaron Levie on Jevons Paradox and AI increasing labor demand

Box CEO Aaron Levie 指出，如果 AI 提升员工生产力（已被广泛接受），公司将希望拥有更多生产力单位。这正是 Jevons Paradox 在 AI 时代的体现：AI 越强大，公司越能承担更多任务，从而雇用更多人完成周边工作。小企业过去负担不起复杂软件，现在 AI 让工程师产生 5-10 倍影响，从而敢于投资工程；销售团队自动化线索生成后会雇用更多销售人员；营销团队能制作更高品质视频后会增加视频编辑岗位。这种效应将以意想不到的方式出现。

Box CEO Aaron Levie explains Jevons Paradox in the AI context: If AI makes employees more productive, companies will want as many productive units of labor as possible. As AI improves, it enables taking on more tasks, leading to hiring more people for surrounding work. Small businesses that couldn't afford complex software now can with AI-amplified engineers (5-10X impact). Sales teams with automated intelligence hire more reps for more leads; marketing with better video production hires editors, and so on. This trend will accelerate and appear in surprising ways.

查看原文 →

🌍 其他动态

Sam Altman：GPT-5.5 及 Pro 版本已在 API 可用，本周成果显著Sam Altman on GPT-5.5 release and a good week

OpenAI CEO Sam Altman 宣布 GPT-5.5 和 GPT-5.5 Pro 已在 API 可用，并表示这是一周不错的成果，为团队感到骄傲，继续构建。

OpenAI CEO Sam Altman announced GPT-5.5 and GPT-5.5 Pro are now available in the API. "This was a good week. Proud of the team. Happy building!"

查看原文 →查看原文 →

Swyx 观察：Vercel C-suite 直接推动 AI 工程工作Swyx on Vercel AI engineering from the C-suite

Swyx 称赞 Vercel 在 JSConf EU 的演讲，指出其 AI 工程工作直接来自 C-suite，这在今天仍不常见。

Swyx highlighted Vercel's incredible AI engineering work presented at JSConf EU, coming directly from the C-suite—something that should be more common.

查看原文 →

Dan Shipper：任何 AI 都知道的比任何个人多，但个人学习比任何 AI 快Dan Shipper (Every) on human vs AI learning speed

Every CEO Dan Shipper 分享洞见：任何 AI 都知道的比任何单个人类多，但任何单个人类的学习速度都比任何 AI 快。

Dan Shipper, CEO at Every, noted: "Any AI knows more than any individual human. But any individual human learns faster than any AI."

查看原文 →

Matt Turck：欧洲对依赖美国 AI 平台的顾虑加剧Matt Turck on reluctance to outsource intelligence to US platforms

FirstMark VC Matt Turck 观察到，Cohere 与 Aleph Alpha 的结合反映出，在当前地缘政治环境下，欧洲对将智能外包给少数美国平台的抵触情绪强烈。

Matt Turck noted that combining Cohere and Aleph Alpha highlights strong reluctance outside the US to outsourcing intelligence to a handful of US platforms amid geopolitical chaos.

查看原文 →

Peter Yang 用 GPT-5.5 + Codex 15 分钟内生成 Star Fox 游戏Peter Yang on building Star Fox with GPT-5.5 and Codex

Peter Yang 展示用 GPT-5.5 和 Codex 在 15 分钟提示后制作 Star Fox 游戏，agent 甚至会自娱自乐地玩测试自己的游戏。

Peter Yang demonstrated building Star Fox with GPT-5.5 and Codex after 15 min of prompting; the agent even play tests its own game for fun.

查看原文 →