AI Builders Digest — 2026-05-06

🔥 热点话题

OpenAI vs Ant：AI 创业公司估值现实检查OpenAI vs Ant: AI Startup Valuation Reality Check

AI 工程师 Swyx 根据 WSJ 数据重建图表，对比了 OpenAI 的 8500 亿美元估值（约 300 亿美元 ARR）与 Ant 的约 9000 亿美元估值（约 440 亿美元 ARR，但若采用 OpenAI 的确认方法可能低 80-100 亿美元）。这凸显了 AI 与金融科技之间估值倍数的差异。

AI engineer Swyx reconstructed a WSJ chart comparing OpenAI's $850B valuation at ~$30B ARR with Ant's ~$900B valuation at ~$44B ARR, noting Ant's revenue would be ~$8-10B lower using OpenAI's methodology. This highlights the aggressive multiples in AI versus fintech.

查看原文 →

AI 应用的三个前沿：编程、知识工作、个人 AgentThe Three Frontiers of AI Adoption: Coding, Knowledge Work, Personal Agents

Roblox 产品负责人 Peter Yang 勾勒了 AI 应用的三阶段轨迹：编程是 AI 已经证明自己的第一个前沿，知识工作是第二个，个人 Agent 则是第三波。这一框架暗示了从技术场景到个人场景的自然演进。

Roblox product leader Peter Yang outlined a three-phase AI adoption trajectory: coding is the first frontier where AI is already delivering, knowledge work is the second, and personal agents represent the third wave. This framing suggests a natural progression from technical to personal use cases.

查看原文 →

企业 AI Agent 部署浪潮：背后的真正挑战Enterprise AI Agents Are Coming Fast: The Real Work Behind Deployment

Box CEO Aaron Levie 指出，Anthropic 和 OpenAI 都在推出帮助企业部署 AI Agent 的计划。除了模型本身，真正的挑战在于升级 IT 系统、提供上下文、现代化工作流、管理人机协作关系以及推动采用——这为新的岗位和公司创造了大量机会。

Box CEO Aaron Levie notes that both Anthropic and OpenAI are launching initiatives to help enterprises deploy AI agents. Beyond the models, there's real work in upgrading IT systems, providing context, modernizing workflows, managing human-agent relationships, and driving adoption—creating massive opportunities for new jobs and firms.

查看原文 →

2023-2025 年创业公司的教训：只重分发不重留存就是烧钱Startup Vintage 2023-2025: Distribution Over Retention Is Burning Money

FPV Ventures 合伙人 Nikunj Kothari 认为，2023-2025 年的创业公司逐渐意识到，华丽的发布视频和只关注分发也许能拿到 VC 融资，但若忽视留存仍然是烧钱。增长势头从来不是护城河，种子轮到 A 轮的差距正在拉大，导致更多人才收购。

FPV Ventures partner Nikunj Kothari argues that startups from 2023-2025 are realizing that fancy launch videos and distribution focus might get VC funding but still burn cash if retention is neglected. Momentum is not a moat, and the seed-to-Series A gap is widening, leading to more acquihires.

查看原文 →

没有方向的快速迭代毫无意义Velocity Without Direction Is Not Interesting

South Park Commons GP Aditya Agarwal 强调他们从未想成为加速器，因为没有“真北”指引的快速迭代毫无意义。这反映了一种追求有目的的建设而非单纯速度的理念。

South Park Commons GP Aditya Agarwal emphasized that they never desired to be an accelerator, because velocity without grounding towards true north is not interesting. This reflects a philosophy of purposeful building over mere speed.

查看原文 →

语音模型正在改变人机交互方式Voice Models Are Changing How People Interface with AI

OpenAI CEO Sam Altman 对语音模型的进步感到兴奋，观察到人们已经开始改变与 AI 的交互方式——转向语音。这预示着 AI 产品用户体验的重大演变。

OpenAI CEO Sam Altman expressed excitement for voice models getting great, noting that people are already starting to change how they interface with AI—shifting toward voice. This signals a major UX evolution in AI products.

查看原文 →

Waymo 的 Dmitri Dolgov：2000 万次出行与全自动驾驶之路Waymo's Dmitri Dolgov: 20 Million Rides and the Road to Full Autonomy

核心收获：Waymo 已进入指数级扩张阶段，累计完成 2000 万次全自动驾驶出行，其中 1000 万次发生在过去 7 个月。Waymo 联合 CEO Dmitri Dolgov 透露其基础模型是一个端到端的多模态世界行动模型，同时驱动驾驶、模拟和评判三大模块。安全是底线：Waymo Driver 在严重伤害碰撞方面比人类司机安全 13 倍，每 8 天就能预防一次严重伤害。令人难忘的细节：LiDAR 曾探测到公交车下方行人的脚步信号，使 AI 提前预测并做出反应。Waymo 正扩展至 11 个城市，今年还计划进入伦敦和东京。

The Takeaway: Waymo has entered exponential scaling, giving 20 million fully autonomous rides, with 10 million in the last 7 months. Waymo co-CEO Dmitri Dolgov revealed that their foundation model is an end-to-end multimodal world action model, powering driver, simulator, and critic. Safety is non-negotiable: the Waymo Driver is 13x safer than human drivers for serious injury collisions, preventing a serious injury every 8 days. A memorable moment: LiDAR detected a pedestrian's footsteps under a bus, allowing the AI to predict and react. Waymo is expanding to 11 cities and internationally to London and Tokyo this year.

查看原文 →

💰 创业成功案例

Replit 帮助创业者找到投资人并获得会面Replit Helps Entrepreneur Find Investors and Land Meetings

Replit CEO Amjad Masad 分享了一位创业者使用 Replit 构建产品，成功吸引投资人并获得会面的故事。这展示了 Replit 作为非技术创始人获取融资的跳板。

Replit CEO Amjad Masad shared a story of an entrepreneur who used Replit to build their product and successfully attracted investors and landed meetings. This showcases Replit as a launchpad for non-technical founders to get to funding.

查看原文 →

基于 Replit 构建的聋生多模态 AI 学习平台AI Multi-Modal Learning Platform for Deaf Students Built on Replit

Amjad Masad 强调了一个 AI 教育的优秀案例：一个为聋生打造的多模态学习平台，使用 Replit 构建。这体现了 AI 在创造包容性教育工具方面的潜力。

Amjad Masad highlighted a great use of AI for education: a multi-modal learning platform for deaf students, built using Replit. This demonstrates AI's potential to create inclusive educational tools.

查看原文 →

🛠️ 开发者工具与技巧

Vercel 发布 deepsec：开源安全审查 Agent 编排器Vercel Introduces deepsec: Open-Source Agent Orchestrator for Security Reviews

Vercel CEO Guillermo Rauch 发布了 npx deepsec，一个用于深度安全审查的开源 Agent 编排器。它能在几分钟内发现团队需要数月才能找到的严重漏洞。针对 Vercel Sandbox 优化，可调动数千个 Agent 并行审查代码。他邀请开源项目私信申请赞助运行。

Vercel CEO Guillermo Rauch announced npx deepsec, an open-source agent orchestrator for deep security reviews. It can find critical vulnerabilities in minutes that would take teams months. Optimized for Vercel Sandbox, it harnesses thousands of agents scrutinizing code in parallel. He invites OSS projects to DM for a sponsored run.

查看原文 →

GBrain v0.27：统一记忆、代码和搜索的图谱工具GBrain v0.27: Unified Graph for Memory, Code, and Search

YC CEO Garry Tan 的 GBrain 是一个将记忆层、代码工具和搜索引擎统一在一个图谱和查询界面下的工具。v0.27 新增了对非 Anthropic/OpenAI 的 embedding 和 LLM 的支持。多模态 embedding 和照片 OCR 即将推出。他每天与 10 万行 markdown 文件和 OpenClaw+Hermes Agent 配合使用。

YC CEO Garry Tan's GBrain is a unified tool combining memory layer, code tool, and search engine under one graph with one query interface. v0.27 adds support for non-Anthropic/OpenAI embeddings and LLMs. Multi-modal embeddings and photo OCR coming soon. He uses it daily with a 100k markdown file and OpenClaw+Hermes Agent setup.

查看原文 →查看原文 →

Gemini Flash：超便宜、1M 上下文、结构化输出的模型Gemini Flash: 1M Context, Structured Outputs, and Incredibly Cheap

FPV Ventures 合伙人 Nikunj Kothari 称 Gemini Flash“便宜得离谱且好用”，拥有 1M 上下文窗口和结构化输出，是他生产环境中最常用的模型。他还盛赞 Google 的新实时语音模型好到令人震惊。

FPV Ventures partner Nikunj Kothari calls Gemini Flash 'criminally cheap and good,' with 1M context windows and structured outputs. It's his most used model in production workloads. He also praises Google's new live voice model as mindblowingly good.

查看原文 →

Crabbox 0.5.0：支持 WebVNC 和 Windows 的远程 CI 盒子Crabbox 0.5.0: Remote CI Boxes with WebVNC and Windows Support

OpenClaw 创建者 Peter Steinberger 发布了 Crabbox 0.5.0，支持通过 WebVNC 创建临时远程 CI 盒子以复现问题。Agent 可设置精确状态、测试、修复并在 PR 上发布视频。现已支持桌面/浏览器租赁、AWS Windows 与 WSL2、截图和应用启动。

OpenClaw creator Peter Steinberger released Crabbox 0.5.0, enabling ephemeral remote CI boxes with WebVNC for reproducing issues. Agents can set up exact state, test, fix, and post videos on PRs. Now supports desktop/browser leases, AWS Windows with WSL2, screenshots, and app launch.

查看原文 →查看原文 →

Claude Code Auto Mode：更安全的权限跳过方式Claude Code Auto Mode: A Safer Way to Skip Permissions

Anthropic 工程团队为 Claude Code 推出 auto mode，利用基于模型的分类器委托审批。它能捕捉过度主动的行为，如删除远程分支或泄露数据。两级分类器（快速过滤加推理）将误报率降至 0.4%，同时拦截大部分危险操作，并包含 prompt injection 防御。Auto mode 旨在安全地替代 --dangerously-skip-permissions。

Anthropic Engineering introduced auto mode for Claude Code, using model-based classifiers to delegate approvals. It catches overeager actions like deleting remote branches or exfiltrating data. A two-stage classifier (fast filter then reasoning) reduces false positives to 0.4% while blocking most dangerous actions. It also includes prompt injection defense. Auto mode is designed to replace --dangerously-skip-permissions safely.

查看原文 →