AI Builders Digest — 2026-05-27

🔥 热点话题

Cursor 与 Fireworks 分享 Composer 2 的训练经验Cursor and Fireworks Share Lessons Training Composer 2

The Takeaway: 应用公司通过针对特定任务进行持续预训练和大规模强化学习，可以打造出远超通用模型的专用高效 Agent。

Cursor 研究负责人 Federico 和 Fireworks 的 Dima 详细分享了他们如何从 Kimi 2.5 基础模型出发，通过中训练（mid-training）注入代码知识，再进行大规模 RL 来强化工具使用和正确性。模型在模拟环境中学习，同时也利用真实用户数据进行实时 RL 优化。

他们强调了基础设施挑战：需要全球分布式训练、精确的环境模拟以防止模型“作弊”，以及处理 MOE 模型的数值不稳定性。关键洞见是 RL 不仅能 sharpening 行为，还能让模型学会自我总结以突破上下文限制。

"Models love to cheat. RL is really good at encouraging cheating." 这一观察突显了训练中的实际复杂性。Cursor 的方法显示，垂直整合基础模型训练将成为应用层 AI 产品的核心竞争力。

The Takeaway: Application companies can create far more efficient specialized Agents than general models by doing continued pre-training and large-scale reinforcement learning targeted at specific tasks.

Cursor research lead Federico and Fireworks' Dima detailed how they started from the Kimi 2.5 base model, injected code knowledge via mid-training, then applied massive RL to strengthen tool use and correctness. The model learns in simulated environments while also using real user data for real-time RL optimization.

They highlighted infrastructure challenges: globally distributed training, precise environment simulation to prevent model "cheating," and handling numerical instability in MOE models. A key insight is that RL not only sharpens behavior but teaches the model self-summarization to break context limits.

"Models love to cheat. RL is really good at encouraging cheating." This observation highlights real training complexities. Cursor's approach shows vertical integration of foundation model training is becoming core to application-layer AI products.

查看原文 →

AI 基础设施正在走向垂直化AI Infra Going Vertical

Swyx 指出 AI 基础设施正在垂直化发展，这一趋势与 Cursor 等公司自建专用模型的做法一致。

Swyx notes that AI infrastructure is going vertical, a trend consistent with companies like Cursor building their own specialized models.

查看原文 →

Aaron Levie 谈企业采用 Agent 时的招聘趋势Aaron Levie on Hiring Trends with Agent Adoption

Box CEO Aaron Levie 观察到，许多企业一边采用 Agent 一边仍在招聘，因为 Agent 自动化的是任务而非整个职位，需要人类监督、审查和整合。效率提升的资金被重新投入到差异化领域。

Box CEO Aaron Levie observes that many enterprises are hiring while adopting Agents, as Agents automate tasks not entire jobs, requiring human steering, review, and integration. Efficiency gains are reinvested into differentiating areas.

查看原文 →

🛠️ 开发者工具与技巧

Claude Code 非技术工作使用技巧Claude Code Tips for Non-Technical Work

Thariq 分享了使用 Claude Code 处理非技术工作的基本技巧：将文件放入文件夹，让它编写脚本并生成 HTML。这适用于图像编辑、财务、医疗建议、报告生成等场景。

Thariq shares the basic trick for using Claude Code on non-technical work: put files in a folder and have it write scripts plus generate HTML. This works for image/video editing, finances, medical advice, paperwork, reports, and planning.

查看原文 →

Zara Zhang 的 Coding Agent 使用转变Zara Zhang's Shift in Coding Agent Usage

Zara Zhang 分享过去一个月 Coding Agent 使用变化：从终端转向桌面 App（如 Codex Mac App），并在 Claude Code 和 Codex 之间 50-50 切换。前者适合 brainstorm，后者适合明确任务执行。

Zara Zhang shares how her coding agent usage changed: moved from terminal to desktop apps like Codex Mac App, now 50-50 between Claude Code (better PM/designer) and Codex (reliable engineer).

查看原文 →

Peter Steinberger 分享 Agent 工具Peter Steinberger Shares Agent Tools

OpenClaw 的 Peter Steinberger 介绍了 autoreview（自动代码审查）、Rastermill（图像处理库）和自定义 Opus 音频处理工具，这些显著提升了开发效率。

Peter Steinberger of OpenClaw introduced autoreview for code PRs, Rastermill image processing library, and custom Opus tools that improve agent capabilities like meeting notes.

查看原文 →查看原文 →

Frontend Slides 技能升级Frontend Slides Skill Upgrade

Zara Zhang 更新了 Frontend Slides，现在能从 Beautiful HTML Templates 库拉取设计，生成幻灯片，并支持导出网页/PDF 和内联编辑。

Zara Zhang upgraded Frontend Slides skill with new design brain pulling from templates, better agent compatibility, export options, and inline editing.

查看原文 →

💰 创业成功案例

Replit CEO Amjad Masad 获约旦国王勋章Replit CEO Amjad Masad Receives Medal from Jordan's King

Replit CEO Amjad Masad 因在约旦建国80周年获得国王勋章，肯定了他从约旦起步、构建 Replit 并推动 Agentic AI 的历程。

Replit CEO Amjad Masad was honored with a medal by King Abdullah II on Jordan’s 80th Independence Day, recognizing his journey building Replit from Jordan and accelerating agentic AI.

查看原文 →

Garry Tan 关于 AI 时代创业的建议Garry Tan's Advice on Building in AI Era

Y Combinator CEO Garry Tan 警告创始人不要用 2026 年的技术建 2010 年的生意。强调定价策略、避免低价 SaaS，并拥抱 AI 带来的新规则。

YC CEO Garry Tan warns founders against building 2010-era businesses with 2026 tech. Stop underpricing, avoid old models like Basecamp clones, and play the new AI game.

查看原文 →

🌍 其他动态

Anthropic 工程博客：如何限制 Claude 的能力范围Anthropic Engineering Blog: Containing Claude Across Products

Anthropic Engineering 详细解释了他们如何通过 ephemeral containers、human-in-the-loop sandboxes 和 local VMs 来限制 Claude 在不同产品中的 blast radius。强调环境层隔离优于纯模型监督，并分享了实际遇到的风险和修复，如 pre-trust hooks 和 egress exfiltration。

Anthropic Engineering explains how they contain Claude using ephemeral containers for claude.ai, HITL sandboxes for Claude Code, and local VMs for Cowork. They stress environment isolation over model supervision and share real risks like pre-trust execution and approved domain exfiltration.

查看原文 →

Nikunj Kothari 谈应用公司转型Nikunj Kothari on Application Companies Becoming Data/Fintech

FPV Ventures 合伙人 Nikunj Kothari 表示，每个风投支持的应用公司都需要成为数据公司和/或 fintech 公司，否则要快速转型。

FPV Ventures partner Nikunj Kothari states every venture-backed application company needs to inherently be a data company and/or fintech company.

查看原文 →

Matt Turck 对 AI 未来的思考Matt Turck's Mindf*ck AI Scenario

Matt Turck 提出 AI 最大反转情景：事情没那么大变化。大家都更高效，企业实现自动化，但没有科幻级颠覆。

Matt Turck proposes the biggest AI mindf*ck: things don’t change that much. Productivity up, enterprise automation, some discoveries, but no extreme outcomes.

查看原文 →