Agent & LLM · 2026年6月29日

每日论文速递 · Agent & LLM

💡 一句话：把 Agent 安全从“外围防御/训练期对齐”推进到运行时认知循环内部，专门处理 memory poisoning、tool-chain manipulation、多 Agent 协议攻击。

2026-06-29 09:02:417 篇论文条目

📄 每日论文速递 · Agent & LLM

日期：2026-06-29

1. Agent-Native Immune System: Architecture, Taxonomy, and Engineering

Agent 原生免疫系统：架构、分类与工程化

🔗 https://arxiv.org/abs/2606.28270

💡 一句话：把 Agent 安全从“外围防御/训练期对齐”推进到运行时认知循环内部，专门处理 memory poisoning、tool-chain manipulation、多 Agent 协议攻击。

🎯 关联：很高。Anna 做 Agent 平台一定会遇到 runtime safety、tool 权限、memory 污染，这篇可以直接启发 InternOS 的“组织免疫/执行防线”设计。

📅 提交：2026-06-26

2. GBC: Gradient-Based Connections for Optimizing Multi-Agent Systems

GBC：用梯度连接优化多智能体系统

🔗 https://arxiv.org/abs/2606.28187

💡 一句话：把多 Agent 系统建模成计算图，用 token-level attribution 找出哪个 Agent、哪一步交互导致错误，再定向优化 prompt。

🎯 关联：极高。Anna 的 Agent 平台如果要做可调试、可归因、可优化的多 Agent 编排，这篇比普通 “multi-agent collaboration” 论文更有工程价值。

📅 提交：2026-06-26

3. QueenBee Planner: Skill-Evolving Communication Topologies for Token-Efficient LLM Multi-Agent Systems

QueenBee Planner：面向 Token 高效多 Agent 系统的自进化通信拓扑

🔗 https://arxiv.org/abs/2606.27492

💡 一句话：让外层 planner 学会生成“谁和谁通信、几轮通信、谁聚合、谁输出”的 DAG，并把成功 trace 蒸馏成可复用的拓扑设计规则。

🎯 关联：极高。这个方向正中 Anna 的组织协调系统：不是让 Agent 瞎聊，而是把通信结构本身变成可学习、可复用、可评估的组织能力。

📅 提交：2026-06-25

4. Glite ARF: Verifier-Driven Research with Parallel LLM Coding Agents

Glite ARF：用验证器驱动并行 LLM 编码 Agent 做研究

🔗 https://arxiv.org/abs/2606.27416

💡 一句话：提出一个开源框架，用 deterministic verifier scripts 管住一批并行 coding agents，保证任务隔离、可复现、可审计，而不是靠 prompt 祈祷。

🎯 关联：极高。Anna 做 AI Agent 平台和代码生成 workflow，最该盯的就是这种“规则写进代码、失败就 loud fail”的工程范式。

📅 提交：2026-06-25

5. Delayed Verification Destabilizes Multi-Agent LLM Belief: Instability Thresholds and Optimal Corrector Placement

延迟验证会破坏多 Agent 信念稳定性：不稳定阈值与纠错器最优放置

🔗 https://arxiv.org/abs/2606.27409

💡 一句话：多 Agent 系统里的 verifier/critic 如果反馈太晚，错误信念会先在网络里扩散，甚至导致振荡；论文给了稳定性阈值和纠错节点放置策略。

🎯 关联：很高。InternOS 做组织协作时，review / verification 不能只是“最后检查”，要设计在信息流关键节点上，否则系统会集体跑偏。

📅 提交：2026-06-25

6. GenWorld: Empirically Grounded Urban Simulation Infrastructure for Scalable LLM-Agent Studies

GenWorld：面向大规模 LLM Agent 研究的真实约束城市仿真基础设施

🔗 https://arxiv.org/abs/2606.27650

💡 一句话：用真实 census/geospatial 数据构建城市级 Agent 仿真，并把 LLM 决策离线编译成 lookup policies，解决大规模 rollout 成本问题。

🎯 关联：中高。不是直接做办公 Agent，但它的“LLM 决策离线编译 + 可审计 replanning traces”对 Agent 平台的 scale 和 simulation 设计很有参考价值。

📅 提交：2026-06-26

7. From Tokens to States: LLMs as a Special Case of World Models and the Continuous Path Beyond

从 Token 到 State：LLM 作为 World Model 的特例，以及通向连续状态模型的路径

💡 一句话：主张 LLM 不是 world model 的对立面，而是状态空间退化成 token sequence 的特殊 world model，并讨论从 next-token prediction 走向 latent/state prediction 的连续路径。

🎯 关联：中高。对 Anna 的长期架构判断有用：Agent memory、state、environment interface 不应该永远被塞进文本 token，要开始想显式状态层。

📅 提交：2026-06-26