推文总结_2026-05-01 - 𓀚 转了码的刘公子

# 推文总结 2026-05-01 ## 总览 - 账号范围：18 个 - 活跃账号：13 个 - 总推文数：179 条 - 主要主题：中文 AI 圈 / AI 编码 / Agent / 模型与研究 ## 今日洞察 ### 1. Codex 正在从 CLI 工具变成持续运行的工作台 - 判断：这不是一次小功能发布，而是在把 Codex 往“可持续追目标的 agent 工作台”推进。 - 为什么重要：如果目标循环、长期上下文和应用内交互稳定下来，用户会开始把更多非一次性的任务交给 Codex，而不只是让它改一段代码。 - 建议动作：优先试跑 /goal 类长任务，并记录失败点：上下文丢失、权限、工具调用、重复执行和结果验收。 - 证据： - @gdb: codex now has a built in Ralph loop++: (赞 1683 / 转 69 / 回 96 / 看 179658) [链接](https://x.com/gdb/status/2050194039077495089) - @dotey: OpenAI 官方推出 Ralph loop 功能了，给 Codex CLI 加了个 /goal 命令。也就是说：你定个目标，它就一直跑，跨多轮不丢，不达目的不停。这是 0.128.0 版本里的新东西，要在 ~/.codex/config.toml 的 [features]… (赞 761 / 转 107 / 回 37 / 看 106837) [链接](https://x.com/dotey/status/2050028108787450148) - @gdb: Codex as the everything productivity app (赞 683 / 转 18 / 回 73 / 看 66309) [链接](https://x.com/gdb/status/2050233125276451243) ### 2. Agent 叙事从炫技转向公司级 workflow - 判断：当天的 agent 信息流不只在讲模型能力，而是在讲“如何用 agents 组装业务流程”。 - 为什么重要：这意味着下一阶段的差异点会落在流程编排、数据源、验收、权限和成本控制，而不是单点 prompt 技巧。 - 建议动作：把自己的自动化任务按“输入源、执行器、验收标准、失败恢复”四列盘点，找最适合 agent 化的重复流程。 - 证据： - @gdb: Codex as the everything productivity app (赞 683 / 转 18 / 回 73 / 看 66309) [链接](https://x.com/gdb/status/2050233125276451243) - @gregisenberg: How to build an entire company with AI agents using Paperclip https://t.co/LlTo36PPlk (赞 304 / 转 26 / 回 57 / 看 21740) [链接](https://x.com/gregisenberg/status/2050205362356134054) - @lidangzzz: 为什么OpenAI犯了个错误？ codex虽然把/goal写进去了，把goal driven抄走了（https://t.co/mJld9XcBjp），但是很明显这个东西很可能并不work，因为我反复讲过无数次，只有goal是不够的，必须要定义一个criteria，要让这个cr… (赞 82 / 转 7 / 回 30 / 看 29695) [链接](https://x.com/lidangzzz/status/2050059631054008417) ### 3. 模型能力讨论开始回到可解释与硬基准 - 判断：模型圈的讨论一边追新基准，一边补可解释性工具，说明单纯刷榜已经不够。 - 为什么重要：对产品和工作流来说，能否解释、调节和验证模型行为，会直接影响能不能进入更高风险的生产场景。 - 建议动作：关注“可解释调参 + 任务级评测”的组合，不要只记录模型名和分数。 - 证据： - @nash_su: 大模型终于不再是个“黑盒”了！阿里刚开源了一个叫 Qwen-Scope 的模型，直接赋予了大模型可解释性，就像给 AI 装上了“透视眼”和“遥控器”，主要能干这几件大事：不用写 Prompt，直接控场：想让它说话客气点、或者别提某些敏感词？不需要费劲巴拉地调指令，直接调节内… (赞 124 / 转 20 / 回 12 / 看 12722) [链接](https://x.com/nash_su/status/2050073203284869590) - @garrytan: GBrain v0.25 just dropped - this is mainly for me and contributors to GBrain to be able to benchmark evals against our own real queries in… (赞 54 / 转 3 / 回 11 / 看 7762) [链接](https://x.com/garrytan/status/2050244734241964214) - @dongxi_nlp: ARC-AGI-3 benchmark 5月1日 GPT-5.5: 0.43% Opus 4.7: 0.18% 3月25日 Opus 4.6 0.2% GPT-5.4 0.3% Gemini 3.1 0.2% Grok 4.20 0 一个月后，人类自信依然存在。 (赞 6 / 转 0 / 回 0 / 看 3315) [链接](https://x.com/dongxi_nlp/status/2050309104627769673) ### 4. 高互动内容明显偏向组织、城市和政治表达 - 判断：热门榜里非 AI 内容占了不少位置，说明关注列表不只是技术雷达，也在反映社会情绪和组织治理话题。 - 为什么重要：这些内容互动高，但和日常工作流的直接相关性较弱；适合当背景信号，不适合挤占技术跟进时间。 - 建议动作：阅读时把它们标成“背景/观点”，只保留能迁移到组织、产品或个人决策的部分。 - 证据： - @signulll: the contrast between zohran mamdani & vivek ramaswamy is undeniable. zohran almost always speaks from a coherent moral grammar (left populi… (赞 7578 / 转 371 / 回 186 / 看 1198340) [链接](https://x.com/signulll/status/2050345747275780140) - @garrytan: We can’t let the alt left talking point “fuck them, leave” take hold in California like it has in Washington After the billionaires leave,… (赞 2931 / 转 169 / 回 216 / 看 170889) [链接](https://x.com/garrytan/status/2050205726682079345) - @signulll: with john ternus taking over as apple ceo, every mag 7 company is now effectively run by someone with an engineering background, the lone e… (赞 1728 / 转 26 / 回 45 / 看 92054) [链接](https://x.com/signulll/status/2050037343839555783) ## 信号矩阵 | 信号 | 强度 | 代表账号 | 处理建议 | |---|---:|---|---| | Codex 正在从 CLI 工具变成持续运行的工作台 | 17 条 | @gdb、@dotey、@gdb | 优先试跑 /goal 类长任务，并记录失败点：上下文丢失、权限、工具调用、重复执行和结果验收。 | | Agent 叙事从炫技转向公司级 workflow | 8 条 | @gdb、@gregisenberg、@lidangzzz | 把自己的自动化任务按“输入源、执行器、验收标准、失败恢复”四列盘点，找最适合 agent 化的重复流程。 | | 模型能力讨论开始回到可解释与硬基准 | 3 条 | @nash_su、@garrytan、@dongxi_nlp | 关注“可解释调参 + 任务级评测”的组合，不要只记录模型名和分数。 | | 高互动内容明显偏向组织、城市和政治表达 | 11 条 | @signulll、@garrytan、@signulll | 阅读时把它们标成“背景/观点”，只保留能迁移到组织、产品或个人决策的部分。 | ## 今天该做什么 - 优先试跑 /goal 类长任务，并记录失败点：上下文丢失、权限、工具调用、重复执行和结果验收。 - 把自己的自动化任务按“输入源、执行器、验收标准、失败恢复”四列盘点，找最适合 agent 化的重复流程。 - 关注“可解释调参 + 任务级评测”的组合，不要只记录模型名和分数。 ## 重点账号动态 - @bcherny: 当日未抓到新推文 - @karpathy: 1 条；重点：@willccbb @FilipoGiovanni Very tempting due to how well this works though I still find that some slop leaks through in the concept space that gets inc [链接](https://x.com/karpathy/status/2050240810403410211) - @trq212: 2 条；重点：@CAISconf @AnthropicAI excited to join! [链接](https://x.com/trq212/status/2050232133705437652) - @gdb: 10 条；重点：codex now has a built in Ralph loop++: [链接](https://x.com/gdb/status/2050194039077495089) - @dotey: 13 条；重点：OpenAI 官方推出 Ralph loop 功能了，给 Codex CLI 加了个 /goal 命令。也就是说：你定个目标，它就一直跑，跨多轮不丢，不达目的不停。这是 0.128.0 版本里的新东西，要在 ~/.codex/config.toml 的 [features] 段写一句 goals [链接](https://x.com/dotey/status/2050028108787450148) - @oran_ge: 21 条；重点：很多人喜欢用「熵增」解释一切。关系会变差，公司会腐败，人会变懒，一切终将走向混乱。我平时最烦这种半懂不懂的表达方式...今天终于知道为什么了。熵增定律有一个大前提：它只适用于孤立系统。完全封闭的、不与外界交换任何能量和物质的系统。在这种条件下，混乱程度确实只增不减。但问题在于，大部分事物，都不 [链接](https://x.com/oran_ge/status/2050077799978074139) - @AnthropicAI: 当日未抓到新推文 - @dongxi_nlp: 2 条；重点：ARC-AGI-3 benchmark 5月1日 GPT-5.5: 0.43% Opus 4.7: 0.18% 3月25日 Opus 4.6 0.2% GPT-5.4 0.3% Gemini 3.1 0.2% Grok 4.20 0 一个月后，人类自信依然存在。 [链接](https://x.com/dongxi_nlp/status/2050309104627769673) - @jiangydev: 当日未抓到新推文 - @lifesinger: 3 条；重点：丰饶时代匮乏无处不在 [链接](https://x.com/lifesinger/status/2050194509489938630) - @gregisenberg: 7 条；重点：How to build an entire company with AI agents using Paperclip https://t.co/LlTo36PPlk [链接](https://x.com/gregisenberg/status/2050205362356134054) - @garrytan: 29 条；重点：We can’t let the alt left talking point “fuck them, leave” take hold in California like it has in Washington After the billionaires leave, the bureauc [链接](https://x.com/garrytan/status/2050205726682079345) - @signulll: 32 条；重点：the contrast between zohran mamdani & vivek ramaswamy is undeniable. zohran almost always speaks from a coherent moral grammar (left populist, materia [链接](https://x.com/signulll/status/2050345747275780140) - @thedankoe: 当日未抓到新推文 - @lidangzzz: 32 条；重点：我一般不看乱七八糟的美女，但是这位选美冠军实至名归，简直是跳动的生命力 https://t.co/PRkqiSGpgk [链接](https://x.com/lidangzzz/status/2050148489967886799) - @HiTw93: 20 条；重点：https://t.co/1mKzVu5vdv [链接](https://x.com/HiTw93/status/2050189572999618982) - @Khazix0918: 当日未抓到新推文 - @nash_su: 7 条；重点：大模型终于不再是个“黑盒”了！阿里刚开源了一个叫 Qwen-Scope 的模型，直接赋予了大模型可解释性，就像给 AI 装上了“透视眼”和“遥控器”，主要能干这几件大事：不用写 Prompt，直接控场：想让它说话客气点、或者别提某些敏感词？不需要费劲巴拉地调指令，直接调节内部的“特征开关”，精准 [链接](https://x.com/nash_su/status/2050073203284869590) ## 重点推文 - @signulll: the contrast between zohran mamdani & vivek ramaswamy is undeniable. zohran almost always speaks from a coherent moral grammar (left populist, materialist, “rent is too high”, etc. (赞 7578 / 转 371 / 回 186 / 看 1198340) [链接](https://x.com/signulll/status/2050345747275780140) - @garrytan: We can’t let the alt left talking point “fuck them, leave” take hold in California like it has in Washington After the billionaires leave, the bureaucrats turn their sights on the (赞 2931 / 转 169 / 回 216 / 看 170889) [链接](https://x.com/garrytan/status/2050205726682079345) - @signulll: one of the most refreshing things on the planet is talking to someone who just *gets it*. like you don’t need a preamble, & you don’t need to articulate the shape of the thought be (赞 2870 / 转 224 / 回 98 / 看 109445) [链接](https://x.com/signulll/status/2050258767946608808) - @gdb: codex now has a built in Ralph loop++: (赞 1683 / 转 69 / 回 96 / 看 179658) [链接](https://x.com/gdb/status/2050194039077495089) - @signulll: with john ternus taking over as apple ceo, every mag 7 company is now effectively run by someone with an engineering background, the lone exception being andy jassy. (赞 1728 / 转 26 / 回 45 / 看 92054) [链接](https://x.com/signulll/status/2050037343839555783) - @gdb: openai logo, scribblified https://t.co/NjdKrfFxOP (赞 1500 / 转 48 / 回 150 / 看 126110) [链接](https://x.com/gdb/status/2050099778072047756) - @dotey: OpenAI 官方推出 Ralph loop 功能了，给 Codex CLI 加了个 /goal 命令。也就是说：你定个目标，它就一直跑，跨多轮不丢，不达目的不停。这是 0.128.0 版本里的新东西，要在 ~/.codex/config.toml 的 [features] 段写一句 goals = true 才能启用。 [features] goals (赞 761 / 转 107 / 回 37 / 看 106837) [链接](https://x.com/dotey/status/2050028108787450148) - @lidangzzz: 我一般不看乱七八糟的美女，但是这位选美冠军实至名归，简直是跳动的生命力 https://t.co/PRkqiSGpgk (赞 751 / 转 32 / 回 129 / 看 155753) [链接](https://x.com/lidangzzz/status/2050148489967886799) - @lidangzzz: 2026全球十大宜居城市 1 维也纳 ( 奥地利 ) 2 哥本哈根 ( 丹麦 ) 3 苏黎世 ( 瑞士 ) 4 卡尔加里 ( 加拿大 ) 5 温哥华 ( 加拿大 ) 6 日内瓦 ( 瑞士 ) 7 沈阳市中街富龙自助盒饭周边一小圈 ( 中国 ) 8 多伦多 ( 加拿大 ) 9 阿姆斯特丹 ( 荷兰 ) 10 大阪 ( 日本 ) (赞 677 / 转 17 / 回 243 / 看 195225) [链接](https://x.com/lidangzzz/status/2050050580622594388) - @gdb: Codex as the everything productivity app (赞 683 / 转 18 / 回 73 / 看 66309) [链接](https://x.com/gdb/status/2050233125276451243) - @signulll: it’s looking like this will finally get openai to 1b mau. https://t.co/anytxE5Caq (赞 643 / 转 25 / 回 33 / 看 34982) [链接](https://x.com/signulll/status/2050285261624365223) - @gdb: such a fun launch — try "/pet hi" in Codex app: (赞 582 / 转 23 / 回 100 / 看 50540) [链接](https://x.com/gdb/status/2050284610656035297) ## 标签分布 - china: 98 条 - ai: 93 条 - agent: 85 条 - ai-coding: 76 条 - product: 73 条 - tools: 59 条 - startup: 36 条 - yc: 29 条 - engineering: 23 条 - openai: 10 条 - gpt: 10 条 - nlp: 2 条 ## 一句话判断 - 2026-05-01 的主线不是“有哪些热推”，而是：这不是一次小功能发布，而是在把 Codex 往“可持续追目标的 agent 工作台”推进。 ## 文件 - 原始抓取：`/Users/bytedance/Downloads/twitter_output/每日推文总结_raw_2026-05-01.json` - 信息源：`/Users/bytedance/myCronTask/run/daily_tweet_summary/resources/info_source.md`